Documentation

Linglib.Phenomena.Agreement.Studies.Toosarvandani2023

Toosarvandani (2023): The Interpretation and Grammatical Representation of Animacy #

@cite{toosarvandani-2023} @cite{harbour-2016} @cite{kratzer-2009} @cite{foley-toosarvandani-2022}

Language 99(4): 760–808.

Formalizes the core empirical and theoretical contributions of Toosarvandani's analysis of Santiago Laxopa Zapotec (and other Southeastern Sierra Zapotec varieties), in which third-person plural pronouns encode a four-way animacy distinction (elder humans, nonelder humans, animals, inanimates) that exhibits ASSOCIATIVITY — the cluster of interpretive properties otherwise associated with first- and second-person plural pronouns.

Key formal apparatus (paper §3–§4) #

File structure #

Note on absolute vs relative PCC #

(86) is the formal Agree condition. It predicts same-rank pairings (e.g. 1>1, 2>2) are licit because A trivially has all of A's features. The paper acknowledges (footnote 22, p. 798) that the empirical person-based PCC in many languages is ABSOLUTE — any local-person object is illicit, even with a local-person subject. The animacy-based PCC in Zapotec is RELATIVE and is what (86) directly derives. Cross-linguistic variation between absolute and relative PCC is attributed to the probe's featural relativization (§4.3, not formalized here).

def Toosarvandani2023.npow {α : Type u_1} [DecidableEq α] (atoms : Finset α) :
Finset (Finset α)

Nonempty powerset: all nonempty subsets of atoms. Lean-side encoding of the lattice of plural individuals — every nonempty sum of atoms satisfying a feature predicate. The paper works in a Linkian/Krifkan lattice (47); we represent that lattice as the powerset of an atomic domain.

Equations
Instances For
    def Toosarvandani2023.oplus {α : Type u_1} [DecidableEq α] (F G : Finset (Finset α)) :
    Finset (Finset α)

    ⊕ (oplus): pointwise lattice join of two feature denotations (50): [⊕ G F] = λx . ∃y∃z[x = y ⊔ z, y ∈ ⟦F⟧, z ∈ ⟦G⟧]. Following @cite{harbour-2016}, generalising @cite{kratzer-2009}'s sum operator. Generates heterogeneous pluralities: oplus SPEAKER ANIMATE includes {speaker, animal}.

    Defined directly via Finset.image (rather than the equivalent Finset.image₂ (· ∪ ·)) so that decide can unfold concrete instances without an extra image₂ layer. The structural lemmas (oplus_subset, oplus_assoc) inherit from mathlib via defeq.

    Equations
    Instances For
      theorem Toosarvandani2023.oplus_eq_image₂ {α : Type u_1} [DecidableEq α] (F G : Finset (Finset α)) :
      oplus F G = Finset.image₂ (fun (x1 x2 : Finset α) => x1 x2) F G

      oplus agrees with Finset.image₂ (· ∪ ·) definitionally. Used to inherit mathlib's image₂_* lemmas.

      theorem Toosarvandani2023.oplus_subset {α : Type u_1} [DecidableEq α] {F F' G G' : Finset (Finset α)} (hF : F F') (hG : G G') :
      oplus F G oplus F' G'

      Monotonicity of ⊕: pointwise on both arguments. Inherits from Finset.image₂_subset.

      theorem Toosarvandani2023.oplus_assoc {α : Type u_1} [DecidableEq α] (F G H : Finset (Finset α)) :
      oplus (oplus F G) H = oplus F (oplus G H)

      Associativity of ⊕: inherits from Finset.image₂_assoc and Finset.union_assoc.

      theorem Toosarvandani2023.npow_mono {α : Type u_1} [DecidableEq α] {s t : Finset α} (h : s t) :
      npow s npow t

      Monotonicity of npow: powerset_mono + erase_subset_erase.

      theorem Toosarvandani2023.mem_oplus_iff {α : Type u_1} [DecidableEq α] {F G : Finset (Finset α)} {z : Finset α} :
      z oplus F G xF, yG, x y = z

      Membership in oplus: z ∈ oplus F G ↔ ∃ x ∈ F, ∃ y ∈ G, x ∪ y = z. Inherits from Finset.mem_image₂ via oplus_eq_image₂.

      theorem Toosarvandani2023.mem_oplus_of_mem {α : Type u_1} [DecidableEq α] {F G : Finset (Finset α)} {x y : Finset α} (hx : x F) (hy : y G) :
      x y oplus F G

      Constructor form of membership: x ∈ F → y ∈ G → x ∪ y ∈ oplus F G.

      theorem Toosarvandani2023.npow_union_mem {α : Type u_1} [DecidableEq α] {X s t : Finset α} (hs : s npow X) (ht : t npow X) :
      s t npow X

      npow X is closed under nonempty union: union of two nonempty subsets of X is itself a nonempty subset of X.

      theorem Toosarvandani2023.oplus_subset_npow {α : Type u_1} [DecidableEq α] {X : Finset α} {F G : Finset (Finset α)} (hF : F npow X) (hG : G npow X) :
      oplus F G npow X

      If F ⊆ npow X and G ⊆ npow X, then oplus F G ⊆ npow X. The pointwise union of nonempty subsets of X lands back in npow X.

      theorem Toosarvandani2023.oplus_npow_self_subset {α : Type u_1} [DecidableEq α] (X : Finset α) :
      oplus (npow X) (npow X) npow X

      npow X is closed under self-oplus: union of two elements of npow X is again in npow X.

      def Toosarvandani2023.lexComp {α : Type u_1} [DecidableEq α] (G F : Finset (Finset α)) :
      Finset (Finset α)

      Lexical Complementarity (52): for feature denotations ⟦F⟧ ⊂ ⟦G⟧, use of G is restricted to ⟦G⟧ \ ⟦F⟧. Forestalls a pronoun from being used to refer to individuals picked out by a more featurally specified pronoun (Harbour 2016:80).

      Equations
      Instances For
        def Toosarvandani2023.singularFilter {α : Type u_1} [DecidableEq α] (den : Finset (Finset α)) :
        Finset (Finset α)

        SINGULAR (56a): ⟦SINGULAR⟧ = λx . ∀y[y ≤ x → x = y]. Picks out the atomic individuals. In the Finset encoding, atomic = singleton. Number features compose with person/animacy features by ∩ (intersection), not ⊕.

        Equations
        Instances For
          def Toosarvandani2023.pluralFilter {α : Type u_1} [DecidableEq α] (den : Finset (Finset α)) :
          Finset (Finset α)

          PLURAL (56b): ⟦PLURAL⟧ = λx . ∃y[y ≤ x ∧ x ≠ y]. Picks out the nonatomic individuals. In the Finset encoding, nonatomic = card > 1.

          Equations
          Instances For

            A φ-domain: a finite set of individuals with speaker/addressee roles and animacy predicates forming the entailment hierarchy ELDER ⊂ HUMAN ⊂ ANIMATE (58, p. 786). Speaker and addressee are required to be human (per the paper's footnote 15, p. 787).

            • Ind : Type

              The type of atomic individuals.

            • instDE : DecidableEq self.Ind
            • instFT : Fintype self.Ind
            • speaker : self.Ind

              The speaker.

            • addressee : self.Ind

              The addressee.

            • isElder : self.IndBool

              Elder predicate (salient social role per (58a)).

            • isHuman : self.IndBool

              Human predicate (58b).

            • isAnimate : self.IndBool

              Animate predicate (58c).

            • elder_human (x : self.Ind) : self.isElder x = trueself.isHuman x = true

              Hierarchy: ELDER entails HUMAN (commentary after (58)).

            • human_animate (x : self.Ind) : self.isHuman x = trueself.isAnimate x = true

              Hierarchy: HUMAN entails ANIMATE.

            • speaker_human : self.isHuman self.speaker = true

              Speaker is human (footnote 15).

            • addressee_human : self.isHuman self.addressee = true

              Addressee is human (footnote 15).

            • speaker_ne : self.speaker self.addressee

              Speaker and addressee are distinct.

            Instances For

              Santiago Laxopa Zapotec instance #

              Six representative individuals spanning the four-way animacy scale. This is Toosarvandani's didactic minimum for the Zapotec lattice — not real lexical data.

              Six individuals: speaker, addressee, an elder, a nonelder human, an animal, and an inanimate.

              Instances For
                @[implicit_reducible]
                Equations
                def Toosarvandani2023.instReprZapInd.repr :
                ZapIndStd.Format
                Equations
                • One or more equations did not get rendered due to their size.
                Instances For
                  @[implicit_reducible]
                  Equations
                  • One or more equations did not get rendered due to their size.

                  The Zapotec PhiDomain with 4-way animacy.

                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For
                    @[reducible, inline]

                    Convenience abbreviation.

                    Equations
                    Instances For
                      def Toosarvandani2023.speakerDen (pd : PhiDomain) :
                      Finset (Finset pd.Ind)

                      ⟦SPEAKER⟧ (48a): singleton sums containing the speaker.

                      Equations
                      Instances For
                        def Toosarvandani2023.participantDen (pd : PhiDomain) :
                        Finset (Finset pd.Ind)

                        ⟦PARTICIPANT⟧ (48b): nonempty subsets of {speaker, addressee}.

                        Equations
                        Instances For
                          def Toosarvandani2023.piDen (pd : PhiDomain) :
                          Finset (Finset pd.Ind)

                          ⟦π⟧ (48c): all nonempty sums of individuals — the maximal feature denotation.

                          Equations
                          Instances For
                            def Toosarvandani2023.elderDen (pd : PhiDomain) :
                            Finset (Finset pd.Ind)

                            ⟦ELDER⟧ (58a): individuals holding a salient social role. With (58a)'s semantics, the denotation includes SAPs as well as nonconversational elders, since SAPs trivially satisfy any salient-role test (paper p. 787). The pronoun-level restriction to nonconversational elders arises only after lexical complementarity (62).

                            Equations
                            Instances For
                              def Toosarvandani2023.humanDen (pd : PhiDomain) :
                              Finset (Finset pd.Ind)

                              ⟦HUMAN⟧ (58b). SAPs are included via speaker_human/addressee_human.

                              Equations
                              Instances For
                                def Toosarvandani2023.animateDen (pd : PhiDomain) :
                                Finset (Finset pd.Ind)

                                ⟦ANIMATE⟧ (58c).

                                Equations
                                Instances For
                                  def Toosarvandani2023.elderPron (pd : PhiDomain) :
                                  Finset (Finset pd.Ind)

                                  The 3SG.EL feature stack from (61a): π ⊕ ANIMATE ⊕ HUMAN ⊕ ELDER. We elide the SINGULAR component (which is intersected separately) and return the elder-pronoun denotation prior to LC. The actual pronoun denotation in (62a) is lexComp elderPron over the next-less-marked pronoun — but for the elder pronoun itself, LC is trivial since it's the most marked.

                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For
                                    def Toosarvandani2023.humanPron (pd : PhiDomain) :
                                    Finset (Finset pd.Ind)

                                    The 3SG.HU stack from (61b): π ⊕ ANIMATE ⊕ HUMAN.

                                    Equations
                                    Instances For
                                      def Toosarvandani2023.animalPron (pd : PhiDomain) :
                                      Finset (Finset pd.Ind)

                                      The 3SG.AN stack from (61c): π ⊕ ANIMATE.

                                      Equations
                                      Instances For
                                        def Toosarvandani2023.inanimatePron (pd : PhiDomain) :
                                        Finset (Finset pd.Ind)

                                        The 3SG.IN stack from (61d): π only.

                                        Equations
                                        Instances For
                                          def Toosarvandani2023.thirdOnly (pd : PhiDomain) (den : Finset (Finset pd.Ind)) :
                                          Finset (Finset pd.Ind)

                                          Remove sums containing the speaker or addressee — restricts a pronoun denotation to third-person reference.

                                          Equations
                                          Instances For
                                            def Toosarvandani2023.thirdElderPron (pd : PhiDomain) :
                                            Finset (Finset pd.Ind)

                                            Third-person elder, human, animal, inanimate denotations (post-thirdOnly).

                                            Equations
                                            Instances For
                                              def Toosarvandani2023.thirdElderLC (pd : PhiDomain) :
                                              Finset (Finset pd.Ind)

                                              Elder pronoun after LC: most marked, no subtraction needed (62a).

                                              Equations
                                              Instances For
                                                def Toosarvandani2023.thirdHumanLC (pd : PhiDomain) :
                                                Finset (Finset pd.Ind)

                                                Human pronoun after LC (62b): human \ elder.

                                                Equations
                                                Instances For
                                                  def Toosarvandani2023.thirdAnimalLC (pd : PhiDomain) :
                                                  Finset (Finset pd.Ind)

                                                  Animal pronoun after LC (62c): animal \ human.

                                                  Equations
                                                  Instances For

                                                    Concrete checks on the Zapotec instance, proved structurally via explicit mem_oplus_of_mem witness construction rather than decide. The kernel decide on Finset-of-Finset membership over a 6-atom universe is too heavy (each side up to 63 elements); the structural witnesses build up {x} = {x} ∪ {x} decompositions through the nested oplus.

                                                    An elder + nonelder human group {e, h} is in the elder pronoun denotation. Heterogeneity: elder pronouns can refer to mixed groups of elders and others. Paper §2.3 + (62a).

                                                    Elder + animal group {e, a} is in the elder pronoun denotation. Witness construction: e = {e}, y = {e, a} = {e} ∪ {a} where {e} ∈ humanDen (e is human, being elder) and {a} ∈ animalPron (a is in animateDen ∩ piDen).

                                                    Base-feature containment: each animacy denotation is contained in the next less-marked one. These follow from npow_mono plus a small decide on the underlying 6-atom filter.

                                                    Denotation containment chain. Each pronoun is oplus of progressively less-marked feature denotations. The proof pattern: replace the most-marked component (e.g. elder in elderPron) with the next component (human) using oplus_subset + base-feature monotonicity, then use oplus_assoc and oplus_npow_self_subset to absorb the duplicated component.

                                                    SPEAKER ⊂ PARTICIPANT at the feature-denotation level.

                                                    ⊕ is associative on any Zapotec denotations. Direct corollary of the generic oplus_assoc; no decide needed.

                                                    The PCC (Person Case Constraint), here extended with animacy, derived from feature containment per (86):

                                                    > A functional head F Agrees with two pronouns A and B, where A is
                                                    > higher than B, iff A has all of the features of B.
                                                    
                                                    Person and animacy features occupy the same structural position (D),
                                                    forming a SINGLE hierarchy. Feature containment — not numeric ranking —
                                                    determines licitness. 
                                                    

                                                    Decompositional φ-features in the nested geometry D → [π [ANIMATE [HUMAN [ELDER [PARTICIPANT [SPEAKER]]]]]] (60).

                                                    Instances For
                                                      @[implicit_reducible]
                                                      Equations
                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For
                                                        @[implicit_reducible]
                                                        Equations
                                                        • One or more equations did not get rendered due to their size.

                                                        Pronoun types in the extended person/animacy hierarchy.

                                                        Instances For
                                                          @[implicit_reducible]
                                                          Equations
                                                          Equations
                                                          • One or more equations did not get rendered due to their size.
                                                          Instances For
                                                            @[implicit_reducible]
                                                            Equations
                                                            • One or more equations did not get rendered due to their size.

                                                            All 6 pronoun types.

                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For

                                                              Feature specification per pronoun type. SAPs include all animacy features because speaker/addressee are at least elder-rank under (58a)'s "salient social role" predicate. The feature sets form a strict containment chain: 1P ⊃ 2P ⊃ 3.EL ⊃ 3.HU ⊃ 3.AN ⊃ 3.IN.

                                                              Equations
                                                              Instances For
                                                                def Toosarvandani2023.pccLicit (subj obj : PronType) :
                                                                Bool

                                                                PCC licitness via feature containment (86): subject Agrees with object iff subject has all of object's features. Implemented directly via Finset.Subset decidability.

                                                                Equations
                                                                Instances For
                                                                  theorem Toosarvandani2023.pcc_licit_count :
                                                                  (List.flatMap (fun (subj : PronType) => List.filter (fun (obj : PronType) => pccLicit subj obj) PronType.all) PronType.all).length = 21

                                                                  21 licit subj×obj pairings (15 strict + 6 same-rank). Drift sentry for the per-cell theorems above.

                                                                  theorem Toosarvandani2023.pcc_refl (p : PronType) :
                                                                  pccLicit p p = true

                                                                  Reflexivity: A ⊆ A always, so pccLicit p p = true.

                                                                  theorem Toosarvandani2023.pcc_trans (p1 p2 p3 : PronType) :
                                                                  pccLicit p1 p2 = truepccLicit p2 p3 = truepccLicit p1 p3 = true

                                                                  Transitivity: features p3 ⊆ features p2 and features p2 ⊆ features p1 imply features p3 ⊆ features p1.

                                                                  The Zapotec feature mapping forms a total containment chain. This is the domain-specific fact that makes PCC total.

                                                                  theorem Toosarvandani2023.pcc_total (p1 p2 : PronType) :
                                                                  pccLicit p1 p2 = true pccLicit p2 p1 = true

                                                                  Totality: at least one direction is licit. Follows from the containment chain via Finset.Subset.

                                                                  theorem Toosarvandani2023.pcc_antisymm (p1 p2 : PronType) :
                                                                  pccLicit p1 p2 = truepccLicit p2 p1 = truep1 = p2

                                                                  Antisymmetry on PronType: mutual licitness implies same pronoun type. This requires Zapotec-specific injectivity of the feature mapping (the abstract claim is just feature-set equality).

                                                                  Within third person, denotation nesting (semantic, from ⊕ composition) mirrors PCC licitness (syntactic, from feature containment). The two main contributions of the paper — compositional animacy semantics and derived PCC — are formally connected: more features → smaller denotation → higher on hierarchy → can be subject.

                                                                  The three semantic-side facts are `elderPron_sub_humanPron`,
                                                                  `humanPron_sub_animalPron`, `animalPron_sub_piDen` (§4c above). The
                                                                  syntactic-side facts are `pcc_3el_3hu`, `pcc_3hu_3an`, `pcc_3an_3in`
                                                                  (licit) and `pcc_3hu_3el`, `pcc_3an_3hu`, `pcc_3in_3an` (illicit), all
                                                                  in §5. Earlier drafts bundled them into a 9-conjunct
                                                                  `denotation_pcc_full_chain` aggregator; that was a
                                                                  no-aggregate-count-style sentry and is omitted to keep `decide`
                                                                  recursion within mathlib defaults. 
                                                                  

                                                                  Third-person restriction strictly shrinks the elder pronoun denotation: {i} is a witness that lives in elderPron (via the all-singletons decomposition {i} ∪ {i} ∪ {i} ∪ {i}) but is filtered out of thirdElderPron because it contains the speaker. Structural rather than card-based to avoid computing two ~63-element Finset cardinalities in the kernel.

                                                                  LC produces disjoint third-person pronoun denotations.

                                                                  Person/animacy features compose via ⊕; number features compose via ∩. These two modes MUST apply in a specific order: ⊕ first, then ∩. If SINGULAR were interleaved within the ⊕ chain, ⊕ would re-introduce pluralities by joining the singletons SINGULAR preserved (paper §3.2, derivation (63)). This non-commutativity is why person and number occupy distinct functional heads.

                                                                  Correct composition (57): ⊕ all person features, then ∩ SINGULAR. First-person singular = {speaker}.

                                                                  ⊕ and SINGULAR do not commute. Interleaving SINGULAR within the ⊕ chain gives different (wrong) results because ⊕ creates new pluralities from the singletons SINGULAR preserved. (63).

                                                                  The wrong order produces plural sums: ⊕ SPEAKER applied AFTER SINGULAR creates {speaker, addressee} from singleton {addressee}, undoing SINGULAR's filtering.

                                                                  PCC respects person prominence: when subject is strictly more prominent in person (not just animacy), PCC is licit.

                                                                  Map PronType to binary person features (±participant, ±author).

                                                                  Equations
                                                                  Instances For

                                                                    The two independent decompositions of person agree: PronType → PersonFeatures and PronType → PersonLevel → Features.

                                                                    The 4-way Zapotec animacy system REFINES the 3-way typological scale. Elder and human collapse at AnimacyLevel.human, but PCC distinguishes them. Toosarvandani's central empirical contribution.

                                                                    All PronType person features are well-formed (no [−participant, +author]).