Documentation

Linglib.Phenomena.Definiteness.Studies.Moroney2021

Moroney (2021): Definiteness and Quantification — Evidence from Shan #

@cite{moroney-2021}

Shan (Southwestern Tai, Kra-Dai) bare nouns can be interpreted as indefinite, definite, generic, or kind-denoting. The key finding is that bare nouns in Shan express BOTH unique and anaphoric definiteness — contra @cite{jenks-2018}'s prediction that languages without overt definite articles mark at most one type of definiteness.

Core contributions formalized here #

  1. Revised definiteness marking typology (Table 4.1/4.4): adds an "unmarked" category where bare nouns express both unique and anaphoric definiteness. Languages: Shan, Serbian, Kannada.

  2. Bare noun interpretation distribution (Table 2.3): Shan and English bare nouns agree on low ∃, kind, and generic readings. They differ ONLY on definite readings — Shan bare nouns can be definite, English cannot.

  3. Type-shifting analysis: all bare nouns are base type ⟨s,⟨e,t⟩⟩. Definite readings arise via unblocked ι type-shift (no overt "the" to block it). Kind readings via ∩. Existential via DPP (Derived Predicate Predication, NMP.DPP), which yields obligatory low scope.

  4. Cross-linguistic definiteness data (Table 4.4): Shan uses bare nouns in ALL @cite{schwarz-2009} definite use types. Demonstrative-noun phrases (N Clf Dem) are optional in anaphoric/relational-bridging/donkey contexts where German requires the strong article and Mandarin/Thai require demonstratives.

What form a language uses to express definiteness in a given context.

Instances For
    @[implicit_reducible]
    Equations
    @[implicit_reducible]
    Equations
    def Moroney2021.instReprDefForm.repr :
    DefFormStd.Format
    Equations
    Instances For

      Cross-linguistic datum: what form does language L use for definite use type U? Connects @cite{hawkins-1978}'s use types (already in Features.Definiteness.DefiniteUseType) to actual morphological expression.

      Instances For
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          def Moroney2021.instDecidableEqDefExpressionDatum.decEq (x✝ x✝¹ : DefExpressionDatum) :
          Decidable (x✝ = x✝¹)
          Equations
          • One or more equations did not get rendered due to their size.
          Instances For

            German data (@cite{schwarz-2009}): weak article for situational uniqueness and part-whole bridging; strong article for anaphora, producer-product bridging, and donkey anaphora.

            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              Thai data (@cite{jenks-2015}): bare nouns for uniqueness contexts, demonstrative-noun phrases for anaphoric/relational contexts.

              Equations
              • One or more equations did not get rendered due to their size.
              Instances For

                Mandarin data (@cite{jenks-2018}): bare nouns for uniqueness contexts, demonstrative-noun phrases for anaphoric/relational/donkey contexts. Same pattern as Thai — Mandarin is classified as .markedAnaphoric in @cite{jenks-2018}'s typology.

                Equations
                • One or more equations did not get rendered due to their size.
                Instances For
                  theorem Moroney2021.mandarin_thai_same_pattern :
                  List.map (fun (x : DefExpressionDatum) => x.form) mandarinData = List.map (fun (x : DefExpressionDatum) => x.form) thaiData

                  Mandarin and Thai have the same definiteness expression pattern: bare for uniqueness, demonstrative for anaphoric/relational/donkey.

                  Shan data (@cite{moroney-2021} Table 4.4): bare nouns in ALL contexts. Demonstratives optional in anaphoric and relational-bridging contexts. This is the key empirical finding — Shan bare nouns cover ALL of Schwarz's definite use types, unlike Mandarin/Thai (anaphoric requires dem) or German (weak/strong articles).

                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For

                    Shan bare nouns are acceptable in every definite use type.

                    German requires a distinct article form for every context — no bare nouns.

                    The five possible interpretations of bare nouns.

                    Instances For
                      @[implicit_reducible]
                      Equations
                      Equations
                      • One or more equations did not get rendered due to their size.
                      Instances For

                        Availability of a bare noun interpretation in Shan vs English.

                        • shanCount : Bool
                        • shanMass : Bool
                        • englishCount : Bool
                        • englishMass : Bool
                        Instances For
                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For
                            def Moroney2021.instDecidableEqInterpAvailability.decEq (x✝ x✝¹ : InterpAvailability) :
                            Decidable (x✝ = x✝¹)
                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For

                              Table 2.3: bare noun interpretation distribution in Shan and English.

                              Shan and English agree on four of five readings. The sole difference is the definite reading: Shan ✓ (via unblocked ι), English ✗ (ι blocked by overt the).

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                The definite interpretation is the ONLY point where Shan and English bare nouns differ (Table 2.3).

                                High scope existential is universally unavailable for bare nouns — a consequence of DPP/DKP locality (@cite{chierchia-1998}). The existential introduced by DPP applies at the point of composition (vP level), so it cannot scope above negation.

                                Shan has no overt determiners: all type-shifts are unblocked.

                                Contrast with English (MeaningPreservation.englishBlocking): the presence of the blocks covert ι, and a/some block covert ∃. In Shan, the absence of articles means the blocking principle imposes no constraints on covert type-shifting. Crucially, both ι AND ι^x are unblocked — this is what allows Shan bare nouns to express both unique and anaphoric definiteness (@cite{moroney-2021} §4.3).

                                Derived from Fragments.Shan.Definiteness.blocking — the single source of truth for Shan's article inventory.

                                Equations
                                Instances For
                                  theorem Moroney2021.shan_neutral_prefers_iota :
                                  have ctx := { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true }; Semantics.Kinds.MeaningPreservation.selectShift ctx = some Semantics.Kinds.MeaningPreservation.TypeShift.iota

                                  When a Shan bare noun is used in a context requiring unique definiteness, the preferred type-shift is ι (definite), by Meaning Preservation ({∩, ι, ι^x} > ∃). Number-neutral nouns allow both ι and ∩, but ∩ requires a kind-compatible predicate (downDefined).

                                  Compare: English singular nouns get none (MeaningPreservation.dayal_consistent_english_bare_singular_out).

                                  theorem Moroney2021.shan_neutral_kind_prefers_down :
                                  have ctx := { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := true, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true }; Semantics.Kinds.MeaningPreservation.selectShift ctx = some Semantics.Kinds.MeaningPreservation.TypeShift.down

                                  When a Shan bare noun is used with a kind-compatible predicate, ∩ is selected (it appears first in availableShifts for number-neutral nouns with downDefined).

                                  Number-neutral nouns in Shan make BOTH ∩ and ι available simultaneously when the predicate is kind-compatible. This correctly predicts the ambiguity between definite and kind readings for Shan bare nouns.

                                  theorem Moroney2021.shan_english_definiteness_contrast :
                                  Semantics.Kinds.MeaningPreservation.selectShift { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true } = some Semantics.Kinds.MeaningPreservation.TypeShift.iota Semantics.Kinds.MeaningPreservation.selectShift { number := Semantics.Kinds.MeaningPreservation.NumberFeature.sg, downDefined := false, iotaBlocked := true, iotaAnaphoricBlocked := true, existsBlocked := true, instantiationAccessible := true } = none

                                  The Shan–English definiteness contrast derived from blocking.

                                  Same base noun type (⟨s,⟨e,t⟩⟩), same type-shifting operations, different article inventories. Shan has no "the" → ι unblocked → bare nouns can be definite. English has "the" → ι blocked → bare nouns cannot be definite (must use overt determiner).

                                  theorem Moroney2021.shan_thai_anaphoric_contrast :
                                  Semantics.Kinds.MeaningPreservation.TypeShift.iotaAnaphoric Semantics.Kinds.MeaningPreservation.availableShifts { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true } Semantics.Kinds.MeaningPreservation.TypeShift.iotaAnaphoricSemantics.Kinds.MeaningPreservation.availableShifts { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := true, existsBlocked := false, instantiationAccessible := true }

                                  The Shan–Thai anaphoric definiteness contrast derived from blocking.

                                  Shan: ι^x is unblocked → bare nouns can be anaphorically definite. Thai: ι^x is blocked by demonstrative → demonstrative required for anaphoric definiteness. Both languages have unblocked ι (unique definiteness via bare nouns).

                                  theorem Moroney2021.shan_exists_is_last_resort :
                                  (Semantics.Kinds.MeaningPreservation.availableShifts { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true }).head? = some Semantics.Kinds.MeaningPreservation.TypeShift.iota Semantics.Kinds.MeaningPreservation.TypeShift.exists Semantics.Kinds.MeaningPreservation.availableShifts { number := Semantics.Kinds.MeaningPreservation.NumberFeature.neutral, downDefined := false, iotaBlocked := false, iotaAnaphoricBlocked := false, existsBlocked := false, instantiationAccessible := true }

                                  ∃ is available as a last resort in Shan (when ∩ and ι are inapplicable), but by Meaning Preservation it is always dispreferred. This means bare nouns default to definite/kind, not existential — the existential reading arises only via DPP at vP.

                                  The central Moroney insight: morphological marking ≠ semantic availability.

                                  Shan morphologically distinguishes zero presupposition types (no articles) but semantically expresses both unique and anaphoric definiteness (via covert type-shifting). The bridge between article inventory and semantic availability is the blocking principle: no articles → no blocking → all type-shifts (ι, ι^x, ∩) available.

                                  The four Table 4.4 languages classify into the four strategy cells when the strategy is computed from each language's Fragments.{Lang}.Definiteness.articleInventory. The classification is not stipulated — it is derived by ArticleInventory.toMarkingStrategy from the morphological inventory bools.

                                  theorem Moroney2021.type_shift_referent_agreement (E : Type) (domain : List E) (restrictor : EBool) :
                                  (Core.Nominal.russellIotaList domain fun (e : E) => restrictor e && true) = Core.Nominal.russellIotaList domain restrictor

                                  The type-shift system and the canonical referent selector agree:

                                  • ι (unique definiteness) corresponds to russellIotaList domain R — the Russellian iota over the bare restrictor
                                  • ι^x (anaphoric definiteness) corresponds to russellIotaList domain (R ∧ Q) — the Russellian iota over the intersection of restrictor and anaphoric filter

                                  When Q is vacuously true, the intersected predicate R ∧ true equals R, so ι^x reduces to ι at the referent-selector layer. The denotational counterpart (presupOfReferent of these selectors) inherits this collapse by congruence.

                                  theorem Moroney2021.dpp_scope_below_neg (interp : InterpAvailability) :
                                  interp interpretationTableinterp.interp = BareNounInterp.highExistentialinterp.shanCount = false interp.shanMass = false interp.englishCount = false interp.englishMass = false

                                  DPP yields obligatory low scope existential: the existential introduced by DPP applies at the vP level, so it cannot scope above negation. This is why highExistential is universally unavailable for bare nouns (@cite{chierchia-1998}; @cite{moroney-2021} §2.3).

                                  The theorem derives the universal blocking from the data table rather than stipulating it.

                                  Concrete witness of FakeMass behavior: Shan bare count nouns like mǎa 'dog' are CUM (the sum of two dogs is dogs) but not g-homogeneous (a dog's leg is part of a dog but is not itself a dog).

                                  We construct a three-element partial order: two atoms a, b and their join ab = ab. The predicate isDog holds of a, b, and ab (CUM), but fails g-homogeneity at ab because its proper parts a and b could have sub-parts (in a richer model) that are not dogs. Here we use the atoms directly: ab has proper parts a and b which ARE dogs, so g-homogeneity holds vacuously on this small model. The genuine failure requires non-atomic non-P parts, which we model by adding a non-dog atom c with cab (representing a dog-leg).

                                  Instances For
                                    @[implicit_reducible]
                                    Equations
                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For
                                      @[implicit_reducible]
                                      Equations
                                      • One or more equations did not get rendered due to their size.

                                      isDog is not g-homogeneous: ab is a dog, c < ab, but no dog z ≤ c exists (since c is an atom and isDog c = False).

                                      The blocking principle connects article inventory to available type-shifts, and ArticleInventory.toMarkingStrategy connects inventory to marking strategy. This theorem shows the full correspondence for the four Table 4.4 languages: the same inventory bools that determine the marking strategy also determine which type-shifts are blocked.

                                      This is the structural core of Moroney's analysis: article inventory is the single parameter from which both the typological classification AND the available interpretations of bare nouns are derived.

                                      theorem Moroney2021.demonstrative_adds_spatial_info {E : Type} (domain : List E) (restrictor : EBool) (spatialPred : Core.Deixis.FeatureEBool) :
                                      (Fragments.Shan.Definiteness.demDenotation domain Fragments.Shan.Definiteness.naj restrictor spatialPred = Core.Nominal.russellIotaList domain fun (e : E) => restrictor e && spatialPred Core.Deixis.Feature.proximal e) (Fragments.Shan.Definiteness.demDenotation domain Fragments.Shan.Definiteness.nan restrictor spatialPred = Core.Nominal.russellIotaList domain fun (e : E) => restrictor e && spatialPred Core.Deixis.Feature.distal e) Fragments.Shan.Definiteness.bareDefinite domain restrictor = Core.Nominal.russellIotaList domain restrictor

                                      Shan demonstratives refine the bare definite by adding a spatial filter to the referent selector:

                                      • Bare noun: russellIotaList domain restrictor — any unique satisfier
                                      • nâj: russellIotaList domain (restrictor && spatialPred .proximal)
                                      • nân: russellIotaList domain (restrictor && spatialPred .distal)

                                      The demonstrative is always optional in Shan because the bare noun already provides a definite reading via unblocked ι. The demonstrative adds information (spatial restriction) but never replaces an unavailable reading (unlike Thai/Mandarin where demonstratives are required for anaphoric definiteness).

                                      When the bare definite already selects a referent that satisfies the demonstrative's spatial predicate, the demonstrative agrees with the bare form (handled by Fragments.Shan.Definiteness.dem_refines_bare).

                                      Shan is a CLF-for-N language: the classifier atomizes the noun denotation (@cite{little-moroney-royer-2022}; @cite{moroney-2021} Ch. 3).

                                      The classifier semantics module provides clfForNoun as a thin wrapper around Mereology.atomize. This bridge confirms that Shan classifiers use the atomization strategy (CLF-for-N), connecting the Shan fragment's ClassifierStrategy.forNoun to the denotation function.

                                      The §1–§7 derivation works at the level of DefMarkingParams (three booleans). Core.Nominal.ArticleInventory is the upstream object — it records the morphological inventory directly (indefinite article, unique article, anaphoric article, syncretism flag, demonstrative paradigm, possessive paradigm) and derives the DefMarkingParams reading.

                                      This section verifies that the inventory-derived classifications agree with the parameters used in §7 for all four languages, and connects the licensing predicate ArticleInventory.licensesKind to Moroney's central empirical finding: Shan licenses anaphoric definiteness without any anaphoric article.

                                      @[reducible, inline]

                                      Shorthand handles for the four Table 4.4 inventories, each defined in its language fragment (Fragments.{Lang}.Definiteness.articleInventory). Centralizing the names here keeps the §14 theorems readable without duplicating fragment-level data.

                                      Equations
                                      Instances For

                                        Inventory-derived strategies match §7's derive_all_languages for the four Table 4.4 languages. The inventory subsumes the params layer (the §7 *Params defs are now inv.toMarkingParams projections, so the agreement theorem that previously lived here is rfl-tautological and has been removed).

                                        Mandarin is in .markedAnaphoric — same cell as Thai. (Not part of Moroney's Table 4.4 but anchors the Jenks 2018 typological backdrop.)

                                        Moroney's central observation, stated against the article inventory: Shan has no article that licenses an .anaphoric NominalKind, yet expresses anaphoric definiteness through bare nouns and optional demonstratives. The licensing predicate makes this morphologically visible — .anaphoric is not licensed by Shan's inventory.

                                        Bare nominals are licensed for Shan (and every language) — this is the morphological substrate for Moroney's analysis: Shan's anaphoric definites surface as bare nouns.

                                        Demonstratives are licensed in Shan (the nâj/nân paradigm). Combined with shan_bare_licensed, this gives the morphological inventory of strategies Shan deploys for definite reference.

                                        English licenses .anaphoric via the syncretic the (uniqueArticle ∧ syncretism), without an independent strong article. Contrasts with Shan (no licensing form at all) and German (independent strong form).

                                        German licenses .anaphoric via its independent strong article (no syncretism). The unique vs anaphoric distinction is morphologically marked.

                                        The English and Mandarin inventories both collapse to ArticleType.weakOnly, witnessing the lossiness of ArticleType relative to DefMarkingStrategy: the inventories differ (English has a unique article, Mandarin does not), and the strategies differ (.generallyMarked vs .markedAnaphoric), yet toArticleType collapses both to .weakOnly.

                                        The English and Mandarin inventories themselves are distinct, even though their ArticleType classifications collide. They differ on hasUniqueArticle (English True, Mandarin False).

                                        Shan-specific consequence of Core.Nominal.interpret_bare_eq_unique: a bare definite description and a uniqueness definite over the same restrictor select the same referent. This is the Core-API analogue of Moroney's claim that bare nouns in Shan express weak/uniqueness definiteness via unblocked ι.

                                        Shan-specific consequence of Core.Nominal.interpret_demonstrative_eq_anaphoric: the demonstrative's deictic feature is a presupposition filter, not a referent selector. Demonstrative- and anaphoric-marked descriptions over the same restrictor and discourse index pick the same entity. This is the type-theoretic correlate of Moroney's claim that nâj/nân add spatial content rather than substituting a different selector.

                                        @cite{jenks-2018} §7 proposed a typology of definiteness marking with three attested cells (.generallyMarked, .bipartite, .markedAnaphoric) and one unattested cell (a language overtly marking only unique definites). The empirical core of @cite{moroney-2021} is the discovery that Shan instantiates a fourth attested cell — .unmarked — that Jenks's three-cell space had no slot for: bare nouns express both unique and anaphoric definiteness without any obligatory morphological marking.

                                        The theorems below state the refutation against the substrate. Shan derives .unmarked (already proved in §7 / §14); .unmarked is distinct from each of Jenks's three cells (already proved in §6); the new content is the joint statement that Shan instantiates a strategy not in the Jenks-attested set.

                                        The Moroney refutation in one statement: Shan instantiates a marking strategy that @cite{jenks-2018}'s typology predicted to be unattested. This is the formal content of the prose claim "contra @cite{jenks-2018}'s prediction" in this file's module docstring.