Documentation

Linglib.Phenomena.Allomorphy.Studies.SmithMoskalEtAl2019

Smith, Moskal, Xu, Kang & Bobaljik (2019) — Case and Number Suppletion in Pronouns #

@cite{smith-moskal-xu-kang-bobaljik-2019}

@cite{smith-moskal-xu-kang-bobaljik-2019} extend @cite{bobaljik-2012}'s structural-containment account of *ABA in adjectival degree suppletion (good–better–best / *good–better–goodest) to two further empirical domains: pronominal case suppletion (using @cite{caha-2009}'s case hierarchy as the structural backbone) and pronominal number suppletion (using @cite{harbour-2008} / @cite{noyer-1992} for the number-feature geometry).

The cross-domain extension is not seamless: the paper identifies three points where the empirical generalizations require theoretical refinement of the Bobaljik 2012 framework:

  1. §3.6 — AAB attestation diverges across domains. AAB patterns (e.g., a paradigm where positive and comparative share a root and the superlative is suppletive) are systematically unattested in adjectival degree but are attested in pronominal case and pronominal number. This is the divergence formalized below.

  2. §3.7 — domain-based locality replaces structural/linear adjacency. The locality predicate from @cite{bobaljik-2012} (and @cite{embick-2010}) is too restrictive once AAB is admitted. The paper proposes the weaker domain-based locality from @cite{moskal-2015}.

  3. §4.3.1 — markedness × suppletion. Cross-linguistic variation in pronominal number suppletion correlates with independent evidence for variation in the internal complexity / markedness of the number head — connecting suppletion theory to the feature recursion of @cite{harbour-2014} (already substrate in this codebase, see Theories/Syntax/Minimalist/Agreement/FeatureRecursion.lean).

This file formalizes (1) directly: the DM-derivation in Theories/Morphology/DM/ContainmentVI.lean predicts AAB exclusion in every domain it applies to (because vi_cmpr_eq_sprl forces the two inner cells to share a root); the empirical data the paper reports falsifies that prediction in case and number. (2) and (3) are stubbed as substrate-addition TODOs.

Scope of the formalization #

The DM-flavored derivation of *ABA in Theories/Morphology/DM/ContainmentVI.lean forces CMPR-cell = SPRL-cell for any VI-generated root pattern (the core theorem vi_cmpr_eq_sprl). This is strictly stronger than contiguity: it excludes both *ABA and *AAB.

The two theorems below state the same content at two granularities: the general "no VI-generable pattern has CMPR ≠ SPRL", and the specific corollary "the AAB pattern is not VI-generable." Both are direct consequences of vi_cmpr_eq_sprl from the migration's Phase 4 substrate.

DM Vocabulary Insertion under root-out locality cannot generate any pattern whose CMPR cell differs from its SPRL cell.

Specific corollary: the AAB pattern (POS = CMPR ≠ SPRL) cannot be VI-generated under DM root-out locality.

aab : DegreePattern := ⟨0, 0, 1⟩ lives in Core/Morphology/DegreeContainment.lean.

@cite{smith-moskal-xu-kang-bobaljik-2019} §3.6 distinguishes two kinds of AAB pattern in pronominal case suppletion:

We encode two genuine AAB witnesses:

Projecting onto the 3-cell ABS/ERG/DAT hierarchy, both patterns have the shape [0, 0, 1] (positive and middle cells share root-class 0, suppletive third cell takes root-class 1).

Wardaman 3SG: ABS=narnaj, ERG=narnaj-(j)i, DAT=gunga. @cite{smith-moskal-xu-kang-bobaljik-2019} Table 25 (data from @cite{merlan-1994}).

Equations
Instances For

    Khinalugh 2SG: ABS=vi, ERG=va, DAT=oX(ir). @cite{smith-moskal-xu-kang-bobaljik-2019} Table 24.

    Equations
    Instances For

      Both genuine-AAB witnesses are contiguous in the substrate sense (no *ABA violation): cells at positions 0 and 2 do not share a root they don't also share with position 1.

      The defining AAB shape: cells 1 and 2 differ (suppletion in the third position but not the second). This is the structural feature that the DM derivation in Theories/Morphology/DM/ContainmentVI excludes — vi_cmpr_eq_sprl forces the second and third cells to coincide for any VI-generated root pattern.

      §3.6 cross-domain divergence theorem. The DM derivation in Degree.vi_cmpr_eq_sprl (PART II of Theories/Morphology/DM/ContainmentVI.lean) predicts, for any VI-generable root pattern, that the second and third cells coincide. Lifted to case (where the 3-cell projection is UNMARKED–DEPENDENT–OBLIQUE, e.g. ABS–ERG–DAT in ergative languages), this prediction would exclude AAB cells [A, A, B] where the second cell equals the first but the third cell differs.

      @cite{smith-moskal-xu-kang-bobaljik-2019} §3.6 establishes that AAB is robustly attested in pronominal case suppletion (Table 9: 10 instances, including Wardaman 3SG and the Nakh-Daghestanian 2SG patterns). The existence of a contiguous AAB-shaped case pattern witnesses the falsification of the lifted DM derivation: no vi_cmpr_eq_sprl-style theorem can hold for case morphology.

      The paper's positive proposal (§3.7) is to weaken the locality predicate from structural adjacency (Bobaljik 2012) / linear adjacency (Embick 2010) to domain-based locality (@cite{moskal-2015}); see § 4 below.

      @cite{smith-moskal-xu-kang-bobaljik-2019} §4 surveys pronominal number suppletion and finds the same AAB-attestation profile that §3.6 reports for case: "we find extremely clear-cut examples of ABB, ABC and AAB patterns, alongside AAA. We do not find any unambiguously robust evidence of ABA patterns." Table 32 quantifies: 3 attested AAB number paradigms (vs 48 ABB, 19 ABC, numerous AAA, 1 dubious ABA from Yagua).

      §4 Table 46 lists the three concrete AAB number witnesses:

      We encode Yagua 2 — the cleanest morphological case: PL jiryéy transparently contains the SG root jiy plus a plural suffix -éy, while DL sáada is suppletive (no shared formative). This projects to [0, 0, 1] over the SG/PL/DL hierarchy: positions 0 (SG) and 1 (PL) share root-class 0 (the jiy root); position 2 (DL) takes root-class 1 (the sáada root).

      The number paradigms are 3-cell over SG/PL/DL; the cell-ordering reflects the containment structure SG–PL–DL or SG–DL–PL depending on the language (the paper notes both orderings are attested, motivating the §4.3.1 reanalysis of number representation that connects to Harbour's @cite{harbour-2014} feature recursion). For Yagua, the SG–PL–DL ordering matches the table caption directly.

      Yagua 2nd person number paradigm: SG=jiy, PL=jiry-éy, DL=sáada. @cite{smith-moskal-xu-kang-bobaljik-2019} Table 46 (data from @cite{payne-payne-1990}). The PL is transparently jiy + -éy; the DL is suppletive. Projects to [0, 0, 1] over SG/PL/DL.

      Equations
      Instances For

        §4 number-side analog of case_aab_attested_falsifies_dm. Same structural divergence: AAB is attested in pronominal number suppletion (3 instances per Table 32, with Wambaya / Yagua / Dehu listed in Table 46), falsifying the DM derivation lifted to number. The Yagua 2 witness is morphologically transparent — PL = SG + suffix; DL is suppletive — exactly the AAB shape that vi_cmpr_eq_sprl would predict cannot arise.

        @cite{smith-moskal-xu-kang-bobaljik-2019} §3.7 proposes that the locality predicate driving suppletion contiguity is not structural adjacency (@cite{bobaljik-2012}) or linear adjacency (@cite{embick-2010}), but domain-based (@cite{moskal-2015a-dissertation}). The substrate for this lives in Core/Morphology/DomainLocality.lean (DomainPartition, SameDomain, ViolatesABAWithin, IsContiguousWithin, DomainPartition.trivial).

        This section instantiates the substrate for the case and number domains the paper discusses. The partitions are anchored on independently-motivated linguistic content (per mathlib-reviewer audit recommendation: "must derive from an independently-motivated case-hierarchy split, not threshold-on-cell-index"):

        Scope of the formalization #

        The substrate's converse-direction theorem — "domain-aware DM generates AAB patterns when positions are split" — requires extending LocalVIRule (Theories.Morphology.DM.ContainmentVI.Degree) with a domain field so that a rule's locality bound becomes partition-aware rather than the unconditional Bobaljik bound condGrade.rank ≤ DegreeGrade.cmpr.rank. That's a separate substrate addition deferred to a follow-up.

        What this section ships:

        The 3-cell ergative case paradigm SMSE 2019 analyses: position 0 is ABS, position 1 is ERG, position 2 is DAT. Positions outside this range have no case interpretation (none); they default to the non-oblique domain in caseDomainPartition.

        Equations
        Instances For

          Case partition: derived from Core.Case.IsOblique via caseAtPos. ABS and ERG are off-hierarchy in containmentRank (IsOblique is False for them); DAT contains GEN's representation in the Caha order so IsOblique .dat is True. The boundary thus corresponds to @cite{caha-2009}'s Unmarked-Dependent vs Oblique split — as a consequence of the order substrate, not as a stipulated threshold.

          Equations
          Instances For

            Number partition: SG (position 0) + PL (position 1) in domain false (non-dual); DL (position 2) in domain true (dual). The boundary corresponds to @cite{harbour-2014}'s feature-recursion split where dual is the marked extension over SG/PL.

            Equations
            Instances For

              Under the case partition, ERG and DAT (positions 1 and 2) lie in DIFFERENT domains. This is the structural feature that makes the partition admit AAB patterns like Wardaman 3SG [0, 0, 1].

              Under the number partition, PL and DL (positions 1 and 2) lie in DIFFERENT domains.

              The case AAB witness (Wardaman 3SG) is contiguous under the case partition. (Trivially so — [0, 0, 1] is *ABA-contiguous in the universal sense, and IsContiguousWithin is strictly weaker. The substantive claim — that domain-aware DM generates this pattern — requires the deferred LocalVIRule extension.)

              The number AAB witness (Yagua 2) is contiguous under the number partition. Same caveat as the case-side.

              What's deferred #

              The substrate above sets up the partition + the structural facts that the partitions split the relevant cells. The substantive converse-direction theorem — "under the case partition, there exist domain-aware VI rule lists generating Wardaman 3SG-shape patterns" — requires Theories.Morphology.DM.ContainmentVI.Degree.LocalVIRule to be extended with a domain field so its locality bound becomes partition-relativized. The current LocalVIRule.locality field is structurally Bobaljik-style (condGrade.rank ≤ cmpr.rank, unconditional) and forces vi_cmpr_eq_sprl regardless of partition. A domain-aware variant requires a partition-aware cap-refinement on the rule type itself, sketched below.

              A concrete DomainLocalVIRule shape for that follow-up:

              structure DomainLocalVIRule {Tag : Type*} [DecidableEq Tag]
                  (π : DomainPartition Tag) where
                formClass : Nat
                condGrade : DegreeGrade
                specificity : Nat
                domainLocality : ∃ targetGrade : DegreeGrade,
                  SameDomain π condGrade.rank targetGrade.rank
              

              with viWinner_eq_within_domain proving the conditional analog of vi_cmpr_eq_sprl and a constructive domain_locality_admits_aab showing AAB rule lists exist when positions are split.