Moroney (2021): Definiteness and Quantification — Evidence from Shan #
@cite{moroney-2021}
Shan (Southwestern Tai, Kra-Dai) bare nouns can be interpreted as indefinite, definite, generic, or kind-denoting. The key finding is that bare nouns in Shan express BOTH unique and anaphoric definiteness — contra @cite{jenks-2018}'s prediction that languages without overt definite articles mark at most one type of definiteness.
Core contributions formalized here #
Revised definiteness marking typology (Table 4.1/4.4): adds an "unmarked" category where bare nouns express both unique and anaphoric definiteness. Languages: Shan, Serbian, Kannada.
Bare noun interpretation distribution (Table 2.3): Shan and English bare nouns agree on low ∃, kind, and generic readings. They differ ONLY on definite readings — Shan bare nouns can be definite, English cannot.
Type-shifting analysis: all bare nouns are base type ⟨s,⟨e,t⟩⟩. Definite readings arise via unblocked ι type-shift (no overt "the" to block it). Kind readings via ∩. Existential via DPP (Derived Predicate Predication,
NMP.DPP), which yields obligatory low scope.Cross-linguistic definiteness data (Table 4.4): Shan uses bare nouns in ALL @cite{schwarz-2009} definite use types. Demonstrative-noun phrases (N Clf Dem) are optional in anaphoric/relational-bridging/donkey contexts where German requires the strong article and Mandarin/Thai require demonstratives.
Equations
- Moroney2021.instDecidableEqDefForm x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Moroney2021.instReprDefForm = { reprPrec := Moroney2021.instReprDefForm.repr }
Equations
- One or more equations did not get rendered due to their size.
- Moroney2021.instReprDefForm.repr Moroney2021.DefForm.bare prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Moroney2021.DefForm.bare")).group prec✝
- Moroney2021.instReprDefForm.repr Moroney2021.DefForm.dem prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Moroney2021.DefForm.dem")).group prec✝
Instances For
Cross-linguistic datum: what form does language L use for definite use
type U? Connects @cite{hawkins-1978}'s use types (already in
Features.Definiteness.DefiniteUseType) to actual morphological expression.
- language : String
- useType : Features.Definiteness.DefiniteUseType
- bridgingSubtype : Option Features.Definiteness.BridgingSubtype
- form : DefForm
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
German data (@cite{schwarz-2009}): weak article for situational uniqueness and part-whole bridging; strong article for anaphora, producer-product bridging, and donkey anaphora.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Thai data (@cite{jenks-2015}): bare nouns for uniqueness contexts, demonstrative-noun phrases for anaphoric/relational contexts.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mandarin data (@cite{jenks-2018}): bare nouns for uniqueness contexts,
demonstrative-noun phrases for anaphoric/relational/donkey contexts.
Same pattern as Thai — Mandarin is classified as .markedAnaphoric
in @cite{jenks-2018}'s typology.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mandarin and Thai have the same definiteness expression pattern: bare for uniqueness, demonstrative for anaphoric/relational/donkey.
Shan data (@cite{moroney-2021} Table 4.4): bare nouns in ALL contexts. Demonstratives optional in anaphoric and relational-bridging contexts. This is the key empirical finding — Shan bare nouns cover ALL of Schwarz's definite use types, unlike Mandarin/Thai (anaphoric requires dem) or German (weak/strong articles).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Shan bare nouns are acceptable in every definite use type.
German requires a distinct article form for every context — no bare nouns.
The five possible interpretations of bare nouns.
- lowExistential : BareNounInterp
- highExistential : BareNounInterp
- definite : BareNounInterp
- kind : BareNounInterp
- generic : BareNounInterp
Instances For
Equations
- Moroney2021.instDecidableEqBareNounInterp x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Moroney2021.instReprBareNounInterp = { reprPrec := Moroney2021.instReprBareNounInterp.repr }
Availability of a bare noun interpretation in Shan vs English.
- interp : BareNounInterp
- shanCount : Bool
- shanMass : Bool
- englishCount : Bool
- englishMass : Bool
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Table 2.3: bare noun interpretation distribution in Shan and English.
Shan and English agree on four of five readings. The sole difference is the definite reading: Shan ✓ (via unblocked ι), English ✗ (ι blocked by overt the).
Equations
- One or more equations did not get rendered due to their size.
Instances For
The definite interpretation is the ONLY point where Shan and English bare nouns differ (Table 2.3).
High scope existential is universally unavailable for bare nouns — a consequence of DPP/DKP locality (@cite{chierchia-1998}). The existential introduced by DPP applies at the point of composition (vP level), so it cannot scope above negation.
Shan has no overt determiners: all type-shifts are unblocked.
Contrast with English (MeaningPreservation.englishBlocking): the presence of
the blocks covert ι, and a/some block covert ∃. In Shan, the
absence of articles means the blocking principle imposes no constraints
on covert type-shifting. Crucially, both ι AND ι^x are unblocked —
this is what allows Shan bare nouns to express both unique and anaphoric
definiteness (@cite{moroney-2021} §4.3).
Derived from Fragments.Shan.Definiteness.blocking — the single source
of truth for Shan's article inventory.
Instances For
When a Shan bare noun is used in a context requiring unique definiteness,
the preferred type-shift is ι (definite), by Meaning Preservation
({∩, ι, ι^x} > ∃). Number-neutral nouns allow both ι and ∩, but ∩
requires a kind-compatible predicate (downDefined).
Compare: English singular nouns get none (MeaningPreservation.dayal_consistent_english_bare_singular_out).
When a Shan bare noun is used with a kind-compatible predicate,
∩ is selected (it appears first in availableShifts for number-neutral
nouns with downDefined).
Number-neutral nouns in Shan make BOTH ∩ and ι available simultaneously when the predicate is kind-compatible. This correctly predicts the ambiguity between definite and kind readings for Shan bare nouns.
The Shan–English definiteness contrast derived from blocking.
Same base noun type (⟨s,⟨e,t⟩⟩), same type-shifting operations, different article inventories. Shan has no "the" → ι unblocked → bare nouns can be definite. English has "the" → ι blocked → bare nouns cannot be definite (must use overt determiner).
The Shan–Thai anaphoric definiteness contrast derived from blocking.
Shan: ι^x is unblocked → bare nouns can be anaphorically definite. Thai: ι^x is blocked by demonstrative → demonstrative required for anaphoric definiteness. Both languages have unblocked ι (unique definiteness via bare nouns).
∃ is available as a last resort in Shan (when ∩ and ι are inapplicable), but by Meaning Preservation it is always dispreferred. This means bare nouns default to definite/kind, not existential — the existential reading arises only via DPP at vP.
Shan's unmarked strategy correctly maps to ArticleType.none_.
Features.Definiteness.articleTypeToDistinguishedPresup correctly returns
zero morphologically distinguished presupposition types for Shan.
The central Moroney insight: morphological marking ≠ semantic availability.
Shan morphologically distinguishes zero presupposition types (no articles) but semantically expresses both unique and anaphoric definiteness (via covert type-shifting). The bridge between article inventory and semantic availability is the blocking principle: no articles → no blocking → all type-shifts (ι, ι^x, ∩) available.
Moroney's new category is genuinely distinct from the three existing ones.
The four Table 4.4 languages classify into the four strategy cells
when the strategy is computed from each language's
Fragments.{Lang}.Definiteness.articleInventory. The classification is
not stipulated — it is derived by ArticleInventory.toMarkingStrategy
from the morphological inventory bools.
The inventory-derived ArticleType agrees with Schwarz's stipulated
typology for the four Table 4.4 languages. The classification is
derived rather than assigned by fiat — toArticleType composes
toMarkingStrategy with the strategy → articleType collapse.
The type-shift system and the canonical referent selector agree:
- ι (unique definiteness) corresponds to
russellIotaList domain R— the Russellian iota over the bare restrictor - ι^x (anaphoric definiteness) corresponds to
russellIotaList domain (R ∧ Q)— the Russellian iota over the intersection of restrictor and anaphoric filter
When Q is vacuously true, the intersected predicate R ∧ true equals R,
so ι^x reduces to ι at the referent-selector layer. The denotational
counterpart (presupOfReferent of these selectors) inherits this collapse
by congruence.
DPP yields obligatory low scope existential: the existential
introduced by DPP applies at the vP level, so it cannot scope above
negation. This is why highExistential is universally unavailable for
bare nouns (@cite{chierchia-1998}; @cite{moroney-2021} §2.3).
The theorem derives the universal blocking from the data table rather than stipulating it.
Concrete witness of FakeMass behavior: Shan bare count nouns like
mǎa 'dog' are CUM (the sum of two dogs is dogs) but not g-homogeneous
(a dog's leg is part of a dog but is not itself a dog).
We construct a three-element partial order: two atoms a, b and their
join ab = a ⊔ b. The predicate isDog holds of a, b, and ab
(CUM), but fails g-homogeneity at ab because its proper parts a and
b could have sub-parts (in a richer model) that are not dogs. Here we
use the atoms directly: ab has proper parts a and b which ARE dogs,
so g-homogeneity holds vacuously on this small model. The genuine failure
requires non-atomic non-P parts, which we model by adding a non-dog atom
c with c ≤ ab (representing a dog-leg).
- a : FakeMassEntity
- b : FakeMassEntity
- c : FakeMassEntity
- ab : FakeMassEntity
Instances For
Equations
- Moroney2021.instDecidableEqFakeMassEntity x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Moroney2021.instReprFakeMassEntity = { reprPrec := Moroney2021.instReprFakeMassEntity.repr }
Equations
- One or more equations did not get rendered due to their size.
The blocking principle connects article inventory to available type-shifts,
and ArticleInventory.toMarkingStrategy connects inventory to marking
strategy. This theorem shows the full correspondence for the four
Table 4.4 languages: the same inventory bools that determine the
marking strategy also determine which type-shifts are blocked.
This is the structural core of Moroney's analysis: article inventory is the single parameter from which both the typological classification AND the available interpretations of bare nouns are derived.
Shan demonstratives refine the bare definite by adding a spatial filter to the referent selector:
- Bare noun:
russellIotaList domain restrictor— any unique satisfier - nâj:
russellIotaList domain (restrictor && spatialPred .proximal) - nân:
russellIotaList domain (restrictor && spatialPred .distal)
The demonstrative is always optional in Shan because the bare noun already provides a definite reading via unblocked ι. The demonstrative adds information (spatial restriction) but never replaces an unavailable reading (unlike Thai/Mandarin where demonstratives are required for anaphoric definiteness).
When the bare definite already selects a referent that satisfies the
demonstrative's spatial predicate, the demonstrative agrees with the
bare form (handled by Fragments.Shan.Definiteness.dem_refines_bare).
Shan is a CLF-for-N language: the classifier atomizes the noun denotation (@cite{little-moroney-royer-2022}; @cite{moroney-2021} Ch. 3).
The classifier semantics module provides clfForNoun as a thin wrapper
around Mereology.atomize. This bridge confirms that Shan classifiers
use the atomization strategy (CLF-for-N), connecting the Shan fragment's
ClassifierStrategy.forNoun to the denotation function.
The §1–§7 derivation works at the level of DefMarkingParams (three
booleans). Core.Nominal.ArticleInventory is the upstream object — it
records the morphological inventory directly (indefinite article, unique
article, anaphoric article, syncretism flag, demonstrative paradigm,
possessive paradigm) and derives the DefMarkingParams reading.
This section verifies that the inventory-derived classifications agree
with the parameters used in §7 for all four languages, and connects the
licensing predicate ArticleInventory.licensesKind to Moroney's central
empirical finding: Shan licenses anaphoric definiteness without any
anaphoric article.
Shorthand handles for the four Table 4.4 inventories, each defined in
its language fragment (Fragments.{Lang}.Definiteness.articleInventory).
Centralizing the names here keeps the §14 theorems readable without
duplicating fragment-level data.
Instances For
Instances For
Instances For
Instances For
Instances For
Inventory-derived strategies match §7's derive_all_languages for the
four Table 4.4 languages. The inventory subsumes the params layer
(the §7 *Params defs are now inv.toMarkingParams projections, so
the agreement theorem that previously lived here is rfl-tautological
and has been removed).
Mandarin is in .markedAnaphoric — same cell as Thai. (Not part of
Moroney's Table 4.4 but anchors the Jenks 2018 typological backdrop.)
Moroney's central observation, stated against the article inventory:
Shan has no article that licenses an .anaphoric NominalKind,
yet expresses anaphoric definiteness through bare nouns and optional
demonstratives. The licensing predicate makes this morphologically
visible — .anaphoric is not licensed by Shan's inventory.
Bare nominals are licensed for Shan (and every language) — this is the morphological substrate for Moroney's analysis: Shan's anaphoric definites surface as bare nouns.
Demonstratives are licensed in Shan (the nâj/nân paradigm).
Combined with shan_bare_licensed, this gives the morphological
inventory of strategies Shan deploys for definite reference.
English licenses .anaphoric via the syncretic the (uniqueArticle ∧
syncretism), without an independent strong article. Contrasts with
Shan (no licensing form at all) and German (independent strong form).
German licenses .anaphoric via its independent strong article (no
syncretism). The unique vs anaphoric distinction is morphologically
marked.
The English and Mandarin inventories both collapse to ArticleType.weakOnly,
witnessing the lossiness of ArticleType relative to DefMarkingStrategy:
the inventories differ (English has a unique article, Mandarin does not),
and the strategies differ (.generallyMarked vs .markedAnaphoric),
yet toArticleType collapses both to .weakOnly.
The English and Mandarin inventories themselves are distinct, even
though their ArticleType classifications collide. They differ on
hasUniqueArticle (English True, Mandarin False).
Shan-specific consequence of Core.Nominal.interpret_bare_eq_unique:
a bare definite description and a uniqueness definite over the same
restrictor select the same referent. This is the Core-API analogue of
Moroney's claim that bare nouns in Shan express weak/uniqueness
definiteness via unblocked ι.
Shan-specific consequence of Core.Nominal.interpret_demonstrative_eq_anaphoric:
the demonstrative's deictic feature is a presupposition filter, not a
referent selector. Demonstrative- and anaphoric-marked descriptions
over the same restrictor and discourse index pick the same entity.
This is the type-theoretic correlate of Moroney's claim that nâj/nân
add spatial content rather than substituting a different selector.
@cite{jenks-2018} §7 proposed a typology of definiteness marking with
three attested cells (.generallyMarked, .bipartite, .markedAnaphoric)
and one unattested cell (a language overtly marking only unique definites).
The empirical core of @cite{moroney-2021} is the discovery that Shan
instantiates a fourth attested cell — .unmarked — that Jenks's
three-cell space had no slot for: bare nouns express both unique and
anaphoric definiteness without any obligatory morphological marking.
The theorems below state the refutation against the substrate. Shan
derives .unmarked (already proved in §7 / §14); .unmarked is
distinct from each of Jenks's three cells (already proved in §6); the
new content is the joint statement that Shan instantiates a strategy
not in the Jenks-attested set.
Shan's morphologically-derived strategy is not in the
@cite{jenks-2018}-attested set (imported from
Phenomena.Definiteness.Studies.Jenks2018.jenksAttestedStrategies).
The Moroney refutation in one statement: Shan instantiates a marking strategy that @cite{jenks-2018}'s typology predicted to be unattested. This is the formal content of the prose claim "contra @cite{jenks-2018}'s prediction" in this file's module docstring.