[Lam26]: Multitier phonotactics with logic and algebra #

Lambert (2026) classifies attested phonotactic constraints — bounded and unbounded stress, harmony, and tone across ~13 languages — into the multitier (Boolean closure of tier-projected) extensions of definite, generalized definite, and finite-or-cofinite classes. Headline empirical claims:

Uyghur backness harmony is multitier definite (BTD) — strictly weaker than the multiple-tier-based strictly local class of [DSG19], settling (categorically) the challenge raised by [MM18].
Karanga Shona tone is multitier generalized definite (BTLI) — no more complex than the default-to-opposite unbounded stress patterns, refining the melody-local analysis of [Jar20].

The propositional logic is IsBTC 𝒞 — the Boolean closure of IsTierBased 𝒞 — for 𝒞 in {IsDefinite k, IsGeneralizedDefinite k, Language.IsStrictlyLocal · k, Language.IsStrictlyPiecewise · k, IsFiniteOrCofinite}; the algebraic side is the syntactic-semigroup characterization of each class via Eilenberg [Eil76] variety equations (e.g., D = ⟦sx̄ = x̄⟧, ℒℐ = ⟦x^ω y x^ω z x^ω = x^ω y x^ω⟧ per [Str85] and [Alm95]). The Lean substrate (IsBTC, IsTierBased) lives in Subregular/Language/Multitier.lean; the algebraic characterization is queued for a future SyntacticMonoid PR.

Disclaimer 1: McCollum (2019) Uyghur gradience (linglib audit) #

This disclaimer is not a scope qualification carried by Lambert (2026); the paper does not cite McCollum. It is a linglib-internal audit annotation: Lambert's BTD analysis is faithful to [MM18]'s categorical idealization, and a separate literature line — [McC19a] — argues the suffix backness assignment is not categorical in the way the multitier-definite formula requires. The "arbitrarily specified, statistical tendency to be back" clause that Mayer & Major report for the no-V no-C case is precisely the locus where McCollum's gradient data resists categorical analysis. The headline theorem uyghur_backness_isBTD characterizes the categorical pattern only; the gradience is out of scope.

Disclaimer 2: Karanga Shona scope restriction #

The BTLI analysis applies to the verb-stem domain (post-hyphen material in Lambert (2026) (45)). [Jar20]'s motivation for a melody local class extends across morphological boundaries and to longer melodic patterns; the BTLI characterization is not a refutation of the broader melody-local programme but a delimited result for the verb-stem surface pattern. The headline theorem karanga_shona_verb_stem_isBTLI is named accordingly.

Cross-framework dialogue #

The multitier substrate is the prohibition reading of constraints scaled to Boolean closure. Cross-references the new file makes explicit (rather than silently diverging from existing linglib formalizations):

OT: linglib's Constraint framework places no complexity bound; Lambert says all phonotactics live in IsBTC. The supraregular counter-witness theorem and the positive mkForbidPairsOnTier ⊆ TSL_2 theorem are queued for a future Phonology/Subregular/OTBound.lean.
Harmonic Serialism: Studies/McPhersonLamont2026.lean proves Poko surface tone HS-derivable but parallel-OT-impossible. Lambert's static BTC characterization, applied to Poko's surface stringset, would clarify static description ≠ alternation explanation. Cross-reference to be added when Poko's surface stringset is independently classified.
Autosegmental: linglib's Phonology/Autosegmental/{ RegisterTier, GrammaticalTone}.lean formalize multiply-linked tone representations. Lambert (2026) §5 self-confesses that string-based analysis loses information for tone; the loss theorem is stated as lambert_string_input_loses_tone_associations (sorry'd) below.
OCP: Phonology/Subregular/OCP.lean carries a prohibition-vs-merger distinction; IsBTC is the mathematical home of the prohibition family at scale. The OCP file's docstring should gain a "see also: BTC" link in a follow-up retrofit.
Structure-sensitive MTSL [DSG19]: not formalized in linglib. Lambert's "BTD strictly supersedes SS-MTSL on Uyghur" is recorded as a TODO theorem (btd_supersedes_ss_mtsl_on_uyghur) for when SS-MTSL substrate lands.

Audit calibration note #

Per linglib's domain-expert agent calibration: the McCollum-2019 and Karanga-Shona-scope concerns are flagged HIGH but should be treated PROVISIONAL — they are corrections to scope, not refutations of the formal results. The Lean theorems below state the formal claims; the empirical disclaimers live in this docstring and the per-theorem docstrings.

Sandwich-word helpers #

The Tsuut'ina/Luganda counterexamples below are bookended words replicate kL aL ++ mid ++ replicate kR aR. These thin local wrappers specialise the generic List bookend lemmas (Core/Data/List/Bookend.lean) to the Edge-projection view used in the proofs.

[Lam26]: Multitier phonotactics with logic and algebra #

Disclaimer 1: McCollum (2019) Uyghur gradience (linglib audit) #

Disclaimer 2: Karanga Shona scope restriction #

Cross-framework dialogue #

Audit calibration note #

Sandwich-word helpers #

Tier predicates #

Atomic tier-projected definite languages #

The Uyghur backness language as conjunction of (35a)-(35b) #

Atomic IsGeneralizedDefinite languages (uniform k = 5) #

IsGeneralizedDefinite witnesses at k = 5 #

Sibilant-harmony grammars over the shared Sibilant alphabet #

Refutation: Tsuut'ina ∉ BTLI #

Refutation: Luganda ∉ BTLI #

Atomic IsGeneralizedDefinite languages (uniform `k = 5`) #

IsGeneralizedDefinite witnesses at `k = 5` #

Sibilant-harmony grammars over the shared `Sibilant` alphabet #