Documentation

Linglib.Studies.Lambert2026

[Lam26]: Multitier phonotactics with logic and algebra #

Lambert (2026) classifies attested phonotactic constraints — bounded and unbounded stress, harmony, and tone across ~13 languages — into the multitier (Boolean closure of tier-projected) extensions of definite, generalized definite, and finite-or-cofinite classes. Headline empirical claims:

The propositional logic is IsBTC 𝒞 — the Boolean closure of IsTierBased 𝒞 — for 𝒞 in {IsDefinite k, IsGeneralizedDefinite k, Language.IsStrictlyLocal · k, Language.IsStrictlyPiecewise · k, IsFiniteOrCofinite}; the algebraic side is the syntactic-semigroup characterization of each class via Eilenberg [Eil76] variety equations (e.g., D = ⟦sx̄ = x̄⟧, ℒℐ = ⟦x^ω y x^ω z x^ω = x^ω y x^ω⟧ per [Str85] and [Alm95]). The Lean substrate (IsBTC, IsTierBased) lives in Subregular/Language/Multitier.lean; the algebraic characterization is queued for a future SyntacticMonoid PR.

Disclaimer 1: McCollum (2019) Uyghur gradience (linglib audit) #

This disclaimer is not a scope qualification carried by Lambert (2026); the paper does not cite McCollum. It is a linglib-internal audit annotation: Lambert's BTD analysis is faithful to [MM18]'s categorical idealization, and a separate literature line — [McC19a] — argues the suffix backness assignment is not categorical in the way the multitier-definite formula requires. The "arbitrarily specified, statistical tendency to be back" clause that Mayer & Major report for the no-V no-C case is precisely the locus where McCollum's gradient data resists categorical analysis. The headline theorem uyghur_backness_isBTD characterizes the categorical pattern only; the gradience is out of scope.

Disclaimer 2: Karanga Shona scope restriction #

The BTLI analysis applies to the verb-stem domain (post-hyphen material in Lambert (2026) (45)). [Jar20]'s motivation for a melody local class extends across morphological boundaries and to longer melodic patterns; the BTLI characterization is not a refutation of the broader melody-local programme but a delimited result for the verb-stem surface pattern. The headline theorem karanga_shona_verb_stem_isBTLI is named accordingly.

Cross-framework dialogue #

The multitier substrate is the prohibition reading of constraints scaled to Boolean closure. Cross-references the new file makes explicit (rather than silently diverging from existing linglib formalizations):

Audit calibration note #

Per linglib's domain-expert agent calibration: the McCollum-2019 and Karanga-Shona-scope concerns are flagged HIGH but should be treated PROVISIONAL — they are corrections to scope, not refutations of the formal results. The Lean theorems below state the formal claims; the empirical disclaimers live in this docstring and the per-theorem docstrings.

Sandwich-word helpers #

The Tsuut'ina/Luganda counterexamples below are bookended words replicate kL aL ++ mid ++ replicate kR aR. These thin local wrappers specialise the generic List bookend lemmas (Core/Data/List/Bookend.lean) to the Edge-projection view used in the proofs.

Iban syllable type [Oma69]: stressed (σ́) or unstressed (σ). The two-letter alphabet of Lambert (2026) §2.1.

Instances For
    @[implicit_reducible]
    Equations
    @[implicit_reducible]
    Equations
    def Lambert2026.instReprIbanSyl.repr :
    IbanSylStd.Format
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      The Iban stress-final language [Oma69]: the right-edge D_1 language whose permitted length-1 suffix is [stressed].

      Equations
      Instances For

        Iban stress-final ∈ D_1 (Lambert 2026 §2.1, paper p. 4 example (2)).

        Amara stress [Thu66]: penultimate-syllable stress with the monosyllabic exception (single syllable bears stress). The D_2 language whose permitted length-2 suffixes are a stressed monosyllable or a stressed-then- unstressed penult.

        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          Amara stress-penult ∈ D_2 (Lambert 2026 §2.1).

          The Finnish stress-initial language [STY08]: primary stress fixed to the initial syllable — the reverse-definite dual of Iban, the left-edge RD_1 language whose permitted length-1 prefix is [stressed] [Lam26] §2.2.

          Equations
          Instances For

            Finnish stress-initial ∈ K_1 (Lambert 2026 §2.2, paper p. 5 example (6)).

            Uyghur segment classes relevant to backness harmony [MM18] [Lam26] (33)-(35): front vowels, back vowels, transparent (i, e), front dorsal consonants, back dorsal consonants, suffix-marked harmonizing vowels and consonants, and a residual "other" class for non-harmonizing material.

            Instances For
              @[implicit_reducible]
              Equations
              def Lambert2026.instReprUyghurSeg.repr :
              UyghurSegStd.Format
              Equations
              • One or more equations did not get rendered due to their size.
              Instances For

                Tier predicates #

                Lambert's V_f ∪ V_b is the harmonising-vowel tier; C_f ∪ C_b is the dorsal tier; S_f and S_b are the front- and back-suffix tiers (V_X ∪ C_X marked as suffix). Each is a Boolean predicate on UyghurSeg.

                Front-suffix tier S_f: front-marked suffix material (vowel or dorsal).

                Equations
                Instances For

                  Back-suffix tier S_b: back-marked suffix material (vowel or dorsal).

                  Equations
                  Instances For

                    Atomic tier-projected definite languages #

                    Each of the four atomic predicates in (35) is a IsTierBased (Language.IsDefinite · 1) test: a single permitted length-1 suffix set on the filtered tier (Language.isDefinite_setOf_right), lifted via IsTierBased.

                    def Lambert2026.tierEmptyLang (T : UyghurSegBool) :
                    Language UyghurSeg

                    The language "tier T is empty": after filtering by T, the word is []. Encoded as IsTierBased (IsDefinite 1) via the singleton 1-suffix {[]} (the only word whose right-1-suffix is [] is the empty word).

                    Equations
                    Instances For
                      def Lambert2026.tierEndsWithLang (T : UyghurSegBool) (x : UyghurSeg) :
                      Language UyghurSeg

                      The language "tier T ends with x": after filtering by T, the word's last element is x. Encoded as IsTierBased (IsDefinite 1) via the singleton 1-suffix {[x]}.

                      Equations
                      Instances For

                        tierEmptyLang T is in IsTierBased (IsDefinite 1). The base 1-definite language is the singleton {[]} (only [] has right-1- suffix []).

                        tierEndsWithLang T x is in IsTierBased (IsDefinite 1). The base 1-definite language is the singleton {[x]}.

                        The Uyghur backness language as conjunction of (35a)-(35b) #

                        The English implication A → B is Boolean Aᶜ ∨ B; written using mathlib's Language lattice (Boolean algebra), Aᶜ ⊔ B. The full language is the intersection of four such implications.

                        The Uyghur backness harmony language as the conjunction of the four implications in Lambert (2026) (35a)-(35b). Categorical idealisation — see file docstring for the [McC19a] gradience disclaimer.

                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          Uyghur backness harmony ∈ BTD₁ (Lambert 2026 §4.3, eq. (35), refining [MM18]). Constructive witness: the formalised uyghurBacknessLang is the intersection of four implications, each Aᶜ ⊔ B where A and B are IsTierBased (IsDefinite 1) (each is a tier-projected single-suffix test). The BTD membership follows from IsBTC.{base, compl, union, inter} applied to the four atomic tier-projected definite languages.

                          Karanga Shona tone alphabet [Odd84] [Lam26] §5.6: low (ℓ) and high (h).

                          Instances For
                            @[implicit_reducible]
                            Equations
                            def Lambert2026.instReprKShoTone.repr :
                            KShoToneStd.Format
                            Equations
                            Instances For

                              Atomic IsGeneralizedDefinite languages (uniform k = 5) #

                              Each component of Lambert's φ_F ∨ L_m ∨ H_m reduces to a Boolean combination of edge-anchored substring tests and tier-projected substring tests. We encode each as a Language KShoTone and prove it's IsGeneralizedDefinite 5 directly via List.take_take / List.drop_drop-style structural arguments. The uniform k = 5 is chosen as 1 + max(prefix length, suffix length, tier-projection length, max φ_F word length) = 1 + 4.

                              def Lambert2026.startsWithLang {α : Type u_1} (xs : List α) :
                              Language α

                              "Word starts with xs": the language {w | w.take xs.length = xs}. Originally introduced for KShoTone but α-generic; also reused for LugandaTone in §10 Kagoshima Japanese.

                              Equations
                              Instances For
                                def Lambert2026.endsWithLang {α : Type u_1} (xs : List α) :
                                Language α

                                "Word ends with xs": the language {w | w.drop (w.length - xs.length) = xs}. α-generic, reused for LugandaTone in §10 Kagoshima Japanese.

                                Equations
                                Instances For
                                  def Lambert2026.tierEqualLang {α : Type u_1} (T : αBool) (xs : List α) :
                                  Language α

                                  "Tier-projection by T equals exactly xs": the language {w | w.filter T = xs}. α-generic, reused for LugandaTone in §10 Kagoshima Japanese.

                                  Equations
                                  Instances For

                                    Boolean tier predicate for h-tier (high tones only).

                                    Equations
                                    Instances For

                                      IsGeneralizedDefinite witnesses at k = 5 #

                                      Unfolding helper: Edge.left.takeAt k w = w.take k and Edge.right.takeAt k w = w.drop (w.length - k) by rfl. The hypotheses from IsGeneralizedDefinite k come in the Edge.takeAt form; we unfold via show at the top of each proof.

                                      theorem Lambert2026.startsWithLang_isGenDef {α : Type u_1} (xs : List α) (k : ) (hk : xs.length k) :

                                      startsWithLang xs is IsGeneralizedDefinite k for any k ≥ xs.length. Proof: same k-prefix on both words determines the xs.length-prefix via List.take_take. α-generic.

                                      theorem Lambert2026.endsWithLang_isGenDef {α : Type u_1} (xs : List α) (k : ) (hk : xs.length k) :

                                      endsWithLang xs is IsGeneralizedDefinite k for any k ≥ xs.length. Symmetric to startsWithLang_isGenDef; the underlying identity is w.drop (w.length - xs.length) = (w.drop (w.length - k)).drop (k - xs.length) when xs.length ≤ k ≤ w.length. The general case splits on whether w is shorter than k.

                                      theorem Lambert2026.tierEqualLang_isTierBased {α : Type u_1} (T : αBool) (xs : List α) (k : ) (hk : xs.length < k) :
                                      Language.IsTierBased (fun (x : Language α) => x.IsGeneralizedDefinite k) (tierEqualLang T xs)

                                      tierEqualLang T xs is IsTierBased (IsGeneralizedDefinite k) for any k > xs.length (strict — without strictness, e.g. {[h, h]} is not GeneralizedDefinite 2 since [h, h, h] and [h, h] share both 2-prefix and 2-suffix).

                                      The seven fully specified words from Lambert (2026) §5.6 (just above eq. (46), paper p. 19). Max length 4 (hhℓh).

                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For

                                        The finite-language part φ_F of Lambert's witness — the seven fully specified words.

                                        Equations
                                        Instances For

                                          phi_F is IsGeneralizedDefinite 5. Max word length is 4, so for k = 5 > 4, any word of length ≤ 4 has k-prefix = whole word. Two words with same 5-prefix and length ≤ 4 are equal; any word with length > 4 has a 5-prefix of length 5 (or whole) which differs from any short word's 5-prefix.

                                          The Karanga Shona verb-stem tone language as the disjunction φ_F ∨ L_m ∨ H_m from Lambert (2026) §5.6 (formula appearing just after eq. (49), paper p. 19).

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For

                                            Karanga Shona verb-stem tone ∈ BTLI₅ (Lambert 2026 §5.6, refining [Jar20]). Constructive witness for the disjunction φ_F ∨ L_m ∨ H_m at uniform k = 5. Each disjunct lifts to IsBTC (IsGeneralizedDefinite 5) via IsTierBased.of_class + IsBTC.base; the disjunction is closed by IsBTC.union.

                                            Sibilant-harmony grammars over the shared Sibilant alphabet #

                                            Both asymmetric directions plus the symmetric foil, over the shared Subregular.Sibilant substrate. Lambert's classification draws the symmetric-vs-asymmetric comparison: the symmetric grammar is the [Han10] Navajo profile (TSLGrammar.agree, disagreement forbidden in both directions), the anterior-first asymmetric grammar the [Coo78] Tsuut'ina profile.

                                            Forbidden-pair relation: anterior immediately preceding posterior on the tier (the [Coo78] Tsuut'ina adjacency).

                                            Equations
                                            Instances For

                                              Tsuut'ina-style asymmetric harmony: anterior-before-posterior forbidden on the tier, the reverse permitted.

                                              Equations
                                              Instances For

                                                Dual forbidden-pair relation: posterior immediately preceding anterior.

                                                Equations
                                                Instances For

                                                  Symmetric sibilant harmony: any tier-adjacent disagreement forbidden — the [Han10] Navajo profile, the foil for the asymmetric comparison.

                                                  Equations
                                                  Instances For

                                                    The symmetric language is contained in the anterior-first asymmetric language: forbidding disagreement in both directions rules out everything the one-direction constraint does, and more.

                                                    Dual inclusion against the posterior-first asymmetric language.

                                                    @[reducible, inline]

                                                    The Tsuut'ina sibilant alphabet ([Coo78]) is the shared three-class Subregular.Sibilant (anterior s, posterior ʃ, neutral non-sibilant).

                                                    Equations
                                                    Instances For

                                                      The TSL_2 grammar for Tsuut'ina asymmetric sibilant harmony ([Coo78]; Lambert's asymmetric classification): anterior preceding posterior on the sibilant tier is forbidden, the reverse permitted.

                                                      Equations
                                                      Instances For

                                                        The Tsuut'ina asymmetric sibilant harmony language. Defined as the language of the TSL_2 witness so that the membership theorem is definitional.

                                                        Equations
                                                        Instances For

                                                          Tsuut'ina asymmetric harmony ∈ TSL_2. Definitional witness: the TSL_2 grammar tsuutinaTSLGrammar.

                                                          Refutation: Tsuut'ina ∉ BTLI #

                                                          Lambert's parameterised counterexample: for every k, the words ʃᵏ⁺¹sᵏ⁺¹ (accepted) and ʃᵏ s ʃ sᵏ (rejected) share the length-k tier-prefix and length-k tier-suffix on every Bool tier. Both witnesses are sandwiches with asymmetric bookends posterior on the left and anterior on the right; widths differ between witnesses (k+1 for accepted, k for rejected). The 8-tier enumeration (3 alphabet classes × 2 keep/drop) collapses to 4 since the witnesses contain no neutrals.

                                                          The local sandwich helpers (over Core/Data/List/Bookend.lean) handle the (true, true) case directly via takeAt_*_sandwich (the bookends are wide enough); the other three cases reduce to filtered words being equal (after a replicate_succ rewrite to merge the middle contribution into the bookend).

                                                          @[reducible, inline]

                                                          The accepted Tsuut'ina parameterised witness ʃᵏ⁺¹ sᵏ⁺¹. Sibilant tier projection: posterior^(k+1) ++ anterior^(k+1) — no anterior precedes any posterior.

                                                          Equations
                                                          Instances For
                                                            @[reducible, inline]

                                                            The rejected Tsuut'ina parameterised witness ʃᵏ s ʃ sᵏ. The internal [anterior, posterior] is the forbidden adjacency on the sibilant tier.

                                                            Equations
                                                            Instances For

                                                              Tsuut'ina asymmetric harmony ∉ BTLI (Lambert's parametrised counterexample).

                                                              Luganda tone alphabet [HK10]: low (ℓ) and high (h), following [Jar20]'s identification of "intermediate" with low in the input.

                                                              Instances For
                                                                @[implicit_reducible]
                                                                Equations
                                                                Equations
                                                                • One or more equations did not get rendered due to their size.
                                                                Instances For

                                                                  The Luganda high-tone plateauing predicate [Lam26] (37): no [h, ℓ, h] subsequence (at most one high span), and if any high tone appears then there is a later low ([h, ℓ] is a subsequence).

                                                                  Equations
                                                                  • One or more equations did not get rendered due to their size.
                                                                  Instances For

                                                                    The Luganda high-tone plateauing language.

                                                                    Equations
                                                                    Instances For
                                                                      @[simp]

                                                                      Membership in lugandaLang is membership in lugandaPred.

                                                                      Luganda high-tone plateauing ∈ PT_3 (Lambert 2026 (37)). The predicate depends only on length-≤-3 subsequence presence: the length-3 [h, ℓ, h], the length-2 [h, ℓ], and the length-1 [h].

                                                                      The proof reduces each conjunct of lugandaPred to the corresponding subseqSet 3 membership question, then transfers via subseqSet_eq_iff.

                                                                      Refutation: Luganda ∉ BTLI #

                                                                      Lambert (2026) §5.1 parameterised counterexample: for every k, the words ℓᵏ ℓ h h ℓ ℓᵏ (accepted) and ℓᵏ ℓ h ℓ h ℓ ℓᵏ (rejected) share the length-k tier-prefix and length-k tier-suffix on every Bool tier. Both witnesses are sandwiches with bookend low and width k; the local sandwich helpers (over Core/Data/List/Bookend.lean) handle the bookend-keeps-tier-affix case directly, leaving only the small filtered-middle equality as per-witness work.

                                                                      Membership separation is predicate-level (not TSL-grammar level): lugandaPred requires no [h, ℓ, h] subsequence and (if any high appears) a [h, ℓ] subsequence. Accepted satisfies both; rejected fails the first because its explicit middle [ℓ, h, ℓ, h, ℓ] contains [h, ℓ, h] as a subsequence.

                                                                      @[reducible, inline]

                                                                      The accepted Luganda parameterised witness ℓᵏ ℓ h h ℓ ℓᵏ. The two high tones are adjacent inside the middle, so no [h, ℓ, h] subsequence can be formed (no low position lies strictly between them).

                                                                      Equations
                                                                      • One or more equations did not get rendered due to their size.
                                                                      Instances For
                                                                        @[reducible, inline]

                                                                        The rejected Luganda parameterised witness ℓᵏ ℓ h ℓ h ℓ ℓᵏ. The explicit middle [ℓ, h, ℓ, h, ℓ] contains [h, ℓ, h] as a subsequence — exactly the structural pattern lugandaPred forbids.

                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          Luganda high-tone plateauing ∉ BTLI (Lambert 2026 §5.1). Parametrised-word witness: ℓᵏ ℓ h h ℓ ℓᵏ is accepted while ℓᵏ ℓ h ℓ h ℓ ℓᵏ is rejected, but the two share all length-k tier-affixes on every tier.

                                                                          Lambert 2026 §5.2 ([Din06]): Prinmi pitch-accent lexically selects a high-tone position within a domain (morpheme or span of adjacent morphemes); high may spread progressively to the next syllable. The resulting pattern enforces:

                                                                          1. Obligatoriness: at least one high tone (h).
                                                                          2. At most one high span: no [h, ℓ, h] subsequence (same as Luganda §5.1).
                                                                          3. Span length ≤ 2: no [h, h, h] subsequence (new conjunct not present in Luganda).

                                                                          Lambert: "the same words witness this nonmembership as for high-tone plateauing: ℓᵏ h h ℓᵏ is valid but ℓᵏ h ℓ h ℓᵏ is not, despite the two having the same k-affixes on every tier." So we reuse lugandaAccepted / lugandaRejected as witnesses and luganda_tierAffixes as the indistinguishability proof — the substrate Sandwich's not_sublist_sandwich discharges both the [h, ℓ, h] and the new [h, h, h] non-subsequence claims uniformly.

                                                                          Alphabet: LugandaTone reused per Lambert's unified /h notation across §5.

                                                                          Disclaimer (Lambert §5.2): [Din06] assumes maximally quadrisyllabic domains with significant compounding; that finite-domain restriction would make Prinmi co/finite (a stronger classification). The PT_3-and-not-BTLI result formalised here applies to the unbounded analysis Lambert presents; the bounded analysis is out of scope.

                                                                          The Prinmi pitch-accent predicate [Lam26] (39):

                                                                          • [h] <+ w — at least one high tone (obligatoriness).
                                                                          • ¬ [h, ℓ, h] <+ w — at most one high span.
                                                                          • ¬ [h, h, h] <+ w — high span length ≤ 2 syllables.
                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            The Prinmi pitch-accent language.

                                                                            Equations
                                                                            Instances For
                                                                              @[simp]

                                                                              Membership in prinmiLang is membership in prinmiPred.

                                                                              Prinmi pitch-accent ∈ PT_3 (Lambert 2026 (39)). All three conjuncts depend only on length-≤-3 subsequence presence: the length-1 [h] and the two length-3 patterns.

                                                                              Prinmi pitch-accent ∉ BTLI (Lambert 2026 §5.2). Same witnesses and tier-affix proof as Luganda §5.1 — Lambert: "The same words witness this nonmembership as for high-tone plateauing."

                                                                              Lambert 2026 §5.3 ([Don97]): Arigibi (Trans-New Guinea) allows at most one mora with high tone (position lexically specified; words with no high tone are allowed). The phonotactic constraint is ¬h..h — no [high, high] subsequence anywhere.

                                                                              Lambert: "exactly analogous to culminativity in isolation, and as such it is piecewise testable and tier-based co/finite, as demonstrated in §2.3" (Lambert's §2.3 is unbounded culminativity).

                                                                              Boolean tier predicate selecting LugandaTone.high. Shared by §8 Arigibi and §9 Chuave.

                                                                              Equations
                                                                              Instances For

                                                                                The Arigibi pitch-accent language: at most one high mora.

                                                                                Equations
                                                                                Instances For
                                                                                  @[simp]

                                                                                  Membership in arigibiLang is ¬ [h, h] <+ w.

                                                                                  Arigibi pitch-accent ∈ PT_2 (Lambert 2026 §5.3, formula ¬h..h). The constraint depends only on length-2 subseq [h, h].

                                                                                  Arigibi pitch-accent ∈ BTN (Lambert 2026 §5.3). On the high tier, the projection has length ≤ 1, i.e., is either [] or [high] — a finite set, hence co/finite.

                                                                                  Lambert 2026 §5.5 ([Don97]): Chuave (Trans-New Guinea) exhibits obligatoriness — every word must contain at least one high-tone mora. There is no restriction on placement; multiple high spans are allowed. The phonotactic constraint is the simplest possible: the formula h (at least one high). This is both:

                                                                                  The Chuave obligatoriness language: at least one mora has high tone.

                                                                                  Equations
                                                                                  Instances For
                                                                                    @[simp]

                                                                                    Membership in chuaveLang is high ∈ w.

                                                                                    Chuave obligatoriness ∈ PT_1 (Lambert 2026 §5.5). The constraint high ∈ w is the singleton subseq presence [high] <+ w.

                                                                                    Chuave obligatoriness ∈ BTN (Lambert 2026 §5.5, formula ¬ [⋊⋉]_{h}). On the high tier isLugHigh, the projection w.filter isLugHigh is non-empty iff high ∈ w. The non-empty set {xs | xs ≠ []} is co/finite (complement is {[]}).

                                                                                    Lambert 2026 §5.4 ([Kaw15], [haraguchi-1977]): Kagoshima Japanese has a pitch-accent system with exactly one high tone per word, appearing on the final or penultimate mora.

                                                                                    Lambert formula (42), order-based PT_3:

                                                                                    Lambert formula (43), tier-based multitier definite:

                                                                                    The PT_3 result is direct from subseqSet_eq_iff. The multitier characterization uses tierEqualLang isLugHigh [.high] (high tier = singleton [h]) intersected with the disjunction of two endsWithLang cases. The §4 helpers (startsWithLang, endsWithLang, tierEqualLang) were generalized to α : Type* for this section so they apply at type LugandaTone without duplication.

                                                                                    We prove kagoshima_multitier_isBTLI (multitier generalized definite, k = 2). Lambert states the stronger BTD₂ classification (multitier definite — right-edge-only); BTD substrate for endsWithLang is queued for follow-up. The order/tier formulation equivalence (formulas 42 ↔ 43) is stated as a TODO theorem kagoshima_lang_eq_mt.

                                                                                    Alphabet: LugandaTone reused per Lambert's unified /h notation across §5.

                                                                                    The Kagoshima Japanese pitch-accent predicate [Lam26] (42): at least one high, no two highs, no [h, ℓ, ℓ] subseq.

                                                                                    Equations
                                                                                    • One or more equations did not get rendered due to their size.
                                                                                    Instances For

                                                                                      The Kagoshima Japanese pitch-accent language.

                                                                                      Equations
                                                                                      Instances For
                                                                                        @[simp]

                                                                                        Membership in kagoshimaLang is membership in kagoshimaPred.

                                                                                        Kagoshima Japanese pitch-accent ∈ PT_3 (Lambert 2026 (42)). All three conjuncts depend only on length-≤-3 subseq presence: length-1 [h], length-2 [h, h], length-3 [h, ℓ, ℓ].

                                                                                        The Kagoshima multitier-encoded language: high-tier-projection equals exactly [h] AND word ends with [h, ℓ] or [h]. Lambert 2026 (43).

                                                                                        Equations
                                                                                        • One or more equations did not get rendered due to their size.
                                                                                        Instances For

                                                                                          Kagoshima multitier characterization ∈ BTLI₂ (Lambert 2026 (43)). Each component lifts to IsBTLI 2:

                                                                                          Equivalence (formula 42 ↔ 43) is queued for follow-up. Lambert states the order-based predicate kagoshimaLang and the multitier predicate kagoshimaMTLang describe the same language, but the proof requires structural reasoning about how "no [h, ℓ, ℓ] subseq" combined with "exactly one high" forces the unique high to lie in the final two positions. Both formulations are independently classified above (PT_3 and BTLI_2 respectively).

                                                                                          Audit-flagged cross-framework engagement points. These are stated here so the disagreements (and convergences) with sibling linglib formalizations are visible rather than buried.

                                                                                          Each is tagged with the auditor file path it cross-references. The proofs are deferred — typically to a follow-up substrate PR (e.g. OTBound.lean) or to the chronologically-later sibling study.

                                                                                          Lambert 2026 multitier classification structurally weaker than SS-MTSL (de Santo & Graf 2019) on Uyghur: every BTD language admits an SS-MTSL acceptor, but BTD is strictly more powerful as a phonotactic classifier — Uyghur backness harmony is BTD but [MM18] shows it is not SS-MTSL. (Stated as TODO; activates when SS-MTSL substrate lands in linglib.)

                                                                                          String-input information loss for autosegmental tone analyses: two distinct autosegmental representations (e.g. multiply-linked vs. singly- linked-with-spread on a tone tier) can have the same surface string. The BTC classification of a string language therefore is not a faithful summary of an autosegmental analysis of the same surface tone pattern. Cross-references Phonology/Autosegmental/RegisterTier.lean, GrammaticalTone.lean, and Studies/Lionnet2025.lean.