Article-and-demonstrative typology — substrate types and WALS data #
@cite{wals-2013} (Chs 37, 38, 41, 42, 43) @cite{bhat-2013} @cite{diessel-2013} @cite{dryer-haspelmath-2013} @cite{greenberg-1978} @cite{himmelmann-1997}
Type-level enums + per-language profile struct for definiteness marking, indefinite articles, and demonstrative systems across @cite{wals-2013} chapters 37, 38, 41, 42, 43, plus WALS distribution data, the principal cross-linguistic generalizations, and the demonstrative→article→affix grammaticalization cline.
Schema #
DefiniteArticleType(Ch 37): how (or whether) definiteness is markedIndefiniteArticleType(Ch 38): indefinite article strategyDemDistanceSystem(Ch 41): number of distance contrastsDemOrientationType: distance- vs person-oriented (for 3-way systems)DemFormRelation(Ch 42): pronominal vs adnominal demonstrative formPronounDemRelation(Ch 43): 3rd-pronoun ↔ demonstrative relationshipArticleDemProfile: per-language bundle (all five chapters)GrammaticalizationStage: stages of definiteness grammaticalization
Per-language data lives in Fragments/{Lang}/Reference.lean.
Definite article type (WALS Ch 37, @cite{dryer-haspelmath-2013}). Categories ordered along the grammaticalization cline: demonstrative → definite word → definite affix.
- definiteWord : DefiniteArticleType
Definite word distinct from demonstratives (e.g., English the).
- definiteAffix : DefiniteArticleType
Definite affix on the noun (e.g., Danish -en, Arabic al-).
- demonstrativeUsed : DefiniteArticleType
No dedicated definite article; a demonstrative is used for definiteness (e.g., Ojibwa, Swahili).
- noDefButIndef : DefiniteArticleType
No definite article, but language has an indefinite article.
- noArticle : DefiniteArticleType
Neither definite nor indefinite article.
Instances For
Equations
- Typology.instDecidableEqDefiniteArticleType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Indefinite article type (WALS Ch 38, @cite{dryer-haspelmath-2013}). Languages either have a dedicated indefinite word, use the numeral 'one' as an indefinite (the most common grammaticalization path), have an indefinite affix, or lack indefinite articles entirely.
- indefiniteWord : IndefiniteArticleType
Indefinite word distinct from the numeral 'one' (e.g., English a).
- numeralOne : IndefiniteArticleType
Numeral 'one' used as indefinite article (e.g., German ein).
- indefiniteAffix : IndefiniteArticleType
Indefinite affix on noun.
- noIndefButDef : IndefiniteArticleType
No indefinite article, but language has a definite article.
- noArticle : IndefiniteArticleType
Neither indefinite nor definite article.
Instances For
Equations
- Typology.instDecidableEqIndefiniteArticleType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Number of distance contrasts in adnominal demonstratives (WALS Ch 41, @cite{diessel-2013}). Two-way systems are by far the most common (54.3%), followed by three-way (37.6%).
- noContrast : DemDistanceSystem
No distance contrast (e.g., Modern German dieser).
- twoWay : DemDistanceSystem
Two-way contrast: proximal vs distal (e.g., English this/that).
- threeWay : DemDistanceSystem
Three-way contrast (e.g., Japanese ko/so/a, Latin hic/iste/ille).
- fourWay : DemDistanceSystem
Four-way contrast (e.g., Hausa).
- fiveOrMore : DemDistanceSystem
Five or more distance contrasts.
Instances For
Equations
- Typology.instDecidableEqDemDistanceSystem x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Typology.instReprDemDistanceSystem = { reprPrec := Typology.instReprDemDistanceSystem.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Whether a three-way demonstrative system is distance- or person-oriented. @cite{diessel-2013}: about 2/3 of three-way systems are distance-oriented, 1/3 person-oriented (e.g., Japanese is the canonical person-oriented system).
- distanceOriented : DemOrientationType
All terms encode distance from speaker (proximal/medial/distal).
- personOriented : DemOrientationType
One term encodes proximity to the hearer (near-speaker / near-hearer / distal).
- notApplicable : DemOrientationType
Not applicable (system is not three-way).
Instances For
Equations
- Typology.instDecidableEqDemOrientationType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Typology.instReprDemOrientationType = { reprPrec := Typology.instReprDemOrientationType.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Relationship between pronominal and adnominal demonstratives (WALS Ch 42, @cite{diessel-2013}). English uses the same forms; French uses different stems (ce/cette vs celui/celle); Turkish uses the same stems but different inflectional features.
- sameForms : DemFormRelation
Same forms for pronominal and adnominal use (e.g., English).
- differentStems : DemFormRelation
Different stems (e.g., French ce/celui).
- differentInflection : DemFormRelation
Same stems but different inflectional features (e.g., Turkish).
Instances For
Equations
- Typology.instDecidableEqDemFormRelation x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Typology.instReprDemFormRelation = { reprPrec := Typology.instReprDemFormRelation.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Relationship between third-person pronouns and demonstratives (WALS Ch 43, @cite{bhat-2013}). In "two-person languages", 3rd-person pronouns are demonstrative-derived; in "three-person languages", 3rd-person pronouns form an independent person paradigm.
Instances For
Equations
- Typology.instDecidableEqPronounDemRelation x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Typology.instReprPronounDemRelation = { reprPrec := Typology.instReprPronounDemRelation.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Whether 3rd-person pronouns show ANY relationship to demonstratives (Bhat's "two-person" vs "three-person" distinction).
Equations
- Typology.PronounDemRelation.unrelated.isRelated = false
- x✝.isRelated = true
Instances For
A language's article-and-demonstrative profile across @cite{wals-2013} Chs 37, 38, 41, 42, 43. WALS samples vary by chapter, so each field is optional.
- language : String
- family : String
- iso : String
ISO 639-3 code.
- defArticle : Option DefiniteArticleType
Ch 37: definite article type.
- indefArticle : Option IndefiniteArticleType
Ch 38: indefinite article type.
- demDistance : Option DemDistanceSystem
Ch 41: distance contrasts in demonstratives.
- demOrientation : Option DemOrientationType
Ch 41 subtype: distance- vs person-oriented (for three-way systems).
- demFormType : Option DemFormRelation
Ch 42: pronominal vs adnominal demonstrative form.
- pronDemRelation : Option PronounDemRelation
Ch 43: 3rd-person pronoun ~ demonstrative relationship.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Typology.instReprArticleDemProfile = { reprPrec := Typology.instReprArticleDemProfile.repr }
Does this language have any form of definite marking (word, affix, or demonstrative used as definite)?
Equations
- One or more equations did not get rendered due to their size.
Instances For
Does this language have any indefinite article (word, numeral 'one', or affix)?
Equations
- One or more equations did not get rendered due to their size.
Instances For
Convert WALS 37A definite-article values into the substrate enum.
Equations
- Typology.fromWALS37A Data.WALS.F37A.DefiniteArticleType.definiteWordDistinctFromDemonstrative = Typology.DefiniteArticleType.definiteWord
- Typology.fromWALS37A Data.WALS.F37A.DefiniteArticleType.demonstrativeWordUsedAsDefiniteArticle = Typology.DefiniteArticleType.demonstrativeUsed
- Typology.fromWALS37A Data.WALS.F37A.DefiniteArticleType.definiteAffix = Typology.DefiniteArticleType.definiteAffix
- Typology.fromWALS37A Data.WALS.F37A.DefiniteArticleType.noDefiniteButIndefiniteArticle = Typology.DefiniteArticleType.noDefButIndef
- Typology.fromWALS37A Data.WALS.F37A.DefiniteArticleType.noDefiniteOrIndefiniteArticle = Typology.DefiniteArticleType.noArticle
Instances For
Convert WALS 38A indefinite-article values into the substrate enum.
Equations
- Typology.fromWALS38A Data.WALS.F38A.IndefiniteArticleType.indefiniteWordDistinctFromOne = Typology.IndefiniteArticleType.indefiniteWord
- Typology.fromWALS38A Data.WALS.F38A.IndefiniteArticleType.indefiniteWordSameAsOne = Typology.IndefiniteArticleType.numeralOne
- Typology.fromWALS38A Data.WALS.F38A.IndefiniteArticleType.indefiniteAffix = Typology.IndefiniteArticleType.indefiniteAffix
- Typology.fromWALS38A Data.WALS.F38A.IndefiniteArticleType.noIndefiniteButDefiniteArticle = Typology.IndefiniteArticleType.noIndefButDef
- Typology.fromWALS38A Data.WALS.F38A.IndefiniteArticleType.noDefiniteOrIndefiniteArticle = Typology.IndefiniteArticleType.noArticle
Instances For
Convert WALS 41A distance-contrast values into the substrate enum.
Equations
- Typology.fromWALS41A Data.WALS.F41A.DistanceContrastsInDemonstratives.noDistanceContrast = Typology.DemDistanceSystem.noContrast
- Typology.fromWALS41A Data.WALS.F41A.DistanceContrastsInDemonstratives.twoWayContrast = Typology.DemDistanceSystem.twoWay
- Typology.fromWALS41A Data.WALS.F41A.DistanceContrastsInDemonstratives.threeWayContrast = Typology.DemDistanceSystem.threeWay
- Typology.fromWALS41A Data.WALS.F41A.DistanceContrastsInDemonstratives.fourWayContrast = Typology.DemDistanceSystem.fourWay
- Typology.fromWALS41A Data.WALS.F41A.DistanceContrastsInDemonstratives.fiveWayContrast = Typology.DemDistanceSystem.fiveOrMore
Instances For
Convert WALS 42A pronominal/adnominal-form values into the substrate enum.
Equations
- Typology.fromWALS42A Data.WALS.F42A.PronominalAndAdnominalDemonstratives.identical = Typology.DemFormRelation.sameForms
- Typology.fromWALS42A Data.WALS.F42A.PronominalAndAdnominalDemonstratives.differentStem = Typology.DemFormRelation.differentStems
- Typology.fromWALS42A Data.WALS.F42A.PronominalAndAdnominalDemonstratives.differentInflection = Typology.DemFormRelation.differentInflection
Instances For
Convert WALS 43A 3rd-person-pronoun values into the substrate enum.
Equations
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.unrelated = Typology.PronounDemRelation.unrelated
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.relatedForAllDemonstratives = Typology.PronounDemRelation.relatedAll
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.relatedToRemoteDemonstratives = Typology.PronounDemRelation.relatedRemote
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.relatedToNonRemoteDemonstratives = Typology.PronounDemRelation.relatedNonRemote
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.relatedByGenderMarkers = Typology.PronounDemRelation.relatedGender
- Typology.fromWALS43A Data.WALS.F43A.ThirdPersonPronounsAndDemonstratives.relatedForNonHumanReference = Typology.PronounDemRelation.relatedNonhuman
Instances For
WALS Ch 37: definite article distribution (@cite{dryer-haspelmath-2013}, n = 566).
- definiteWord : Nat
- demonstrativeUsed : Nat
- definiteAffix : Nat
- noDefButIndef : Nat
- noArticle : Nat
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- c.total = c.definiteWord + c.demonstrativeUsed + c.definiteAffix + c.noDefButIndef + c.noArticle
Instances For
WALS Ch 37 counts (566 languages).
Equations
- Typology.walsDefiniteArticle = { definiteWord := 197, demonstrativeUsed := 56, definiteAffix := 84, noDefButIndef := 41, noArticle := 188 }
Instances For
WALS Ch 38: indefinite article distribution (@cite{dryer-haspelmath-2013}, n = 473).
- indefiniteWord : Nat
- numeralOne : Nat
- indefiniteAffix : Nat
- noIndefButDef : Nat
- noArticle : Nat
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- c.total = c.indefiniteWord + c.numeralOne + c.indefiniteAffix + c.noIndefButDef + c.noArticle
Instances For
WALS Ch 38 counts (473 languages).
Equations
- Typology.walsIndefiniteArticle = { indefiniteWord := 91, numeralOne := 90, indefiniteAffix := 23, noIndefButDef := 81, noArticle := 188 }
Instances For
WALS Ch 41: demonstrative distance contrasts (@cite{diessel-2013}, n = 234).
- noContrast : Nat
- twoWay : Nat
- threeWay : Nat
- fourWay : Nat
- fiveOrMore : Nat
Instances For
Equations
- Typology.instReprDemDistanceCounts = { reprPrec := Typology.instReprDemDistanceCounts.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- c.total = c.noContrast + c.twoWay + c.threeWay + c.fourWay + c.fiveOrMore
Instances For
WALS Ch 41 counts (234 languages).
Equations
- Typology.walsDemDistance = { noContrast := 7, twoWay := 127, threeWay := 88, fourWay := 8, fiveOrMore := 4 }
Instances For
WALS Ch 42: pronominal/adnominal demonstrative form (@cite{diessel-2013}, n = 201).
- sameForms : Nat
- differentStems : Nat
- differentInflection : Nat
Instances For
Equations
- Typology.instReprDemFormCounts = { reprPrec := Typology.instReprDemFormCounts.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- c.total = c.sameForms + c.differentStems + c.differentInflection
Instances For
WALS Ch 42 counts (201 languages).
Equations
- Typology.walsDemForm = { sameForms := 143, differentStems := 37, differentInflection := 21 }
Instances For
WALS Ch 43: 3rd-person pronoun ~ demonstrative relationship (@cite{bhat-2013}, n = 225).
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Typology.instReprPronounDemCounts = { reprPrec := Typology.instReprPronounDemCounts.repr }
Equations
- c.total = c.unrelated + c.relatedAll + c.relatedRemote + c.relatedNonRemote + c.relatedGender + c.relatedNonhuman
Instances For
Total count of languages where 3rd-person pronouns show any relationship to demonstratives.
Equations
- c.totalRelated = c.relatedAll + c.relatedRemote + c.relatedNonRemote + c.relatedGender + c.relatedNonhuman
Instances For
WALS Ch 43 counts (225 languages).
Equations
- Typology.walsPronounDem = { unrelated := 100, relatedAll := 52, relatedRemote := 18, relatedNonRemote := 14, relatedGender := 24, relatedNonhuman := 17 }
Instances For
Two-way demonstrative systems (proximal/distal) are the most common type: 127 of 234 languages (54.3%). @cite{diessel-2013}: "the vast majority of the world's languages employ two or three distance-marked demonstratives".
Two-way and three-way systems together account for over 90% of languages; one-way, four-way, and five-or-more systems together are under 10%.
Languages with definite articles tend to also have indefinite articles. The asymmetry: 81 languages have a definite but no indefinite article, vs. 41 languages with indefinite but no definite article. Definiteness marking is the typologically prior category.
Languages with some form of definite marking (word, affix, or demonstrative) outnumber those without. 337 of 566 languages (59.5%) have definite marking.
In most languages (143 of 201 = 71.1%), pronominal and adnominal demonstratives have the same forms (@cite{diessel-2013}). Differential marking via different stems (37) or different inflection (21) is the minority.
The grammaticalization hierarchy for definiteness marking (@cite{greenberg-1978}, @cite{himmelmann-1997}):
Stage 0 — No definiteness marking (bare nouns; e.g., Mandarin, Russian) Stage 1 — Demonstrative used for definiteness (e.g., Swahili, Ojibwa) Stage 2 — Definite word distinct from demonstrative (e.g., English) Stage 3 — Definite affix (e.g., Danish, Arabic)
Each stage represents further grammaticalization: phonological reduction, semantic bleaching (loss of deictic content), and increased obligatoriness.
- noMarking : GrammaticalizationStage
- demonstrative : GrammaticalizationStage
- definiteWord : GrammaticalizationStage
- definiteAffix : GrammaticalizationStage
Instances For
Equations
- Typology.instDecidableEqGrammaticalizationStage x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Map a DefiniteArticleType to its grammaticalization stage.
Equations
- Typology.DefiniteArticleType.noArticle.stage = Typology.GrammaticalizationStage.noMarking
- Typology.DefiniteArticleType.noDefButIndef.stage = Typology.GrammaticalizationStage.noMarking
- Typology.DefiniteArticleType.demonstrativeUsed.stage = Typology.GrammaticalizationStage.demonstrative
- Typology.DefiniteArticleType.definiteWord.stage = Typology.GrammaticalizationStage.definiteWord
- Typology.DefiniteArticleType.definiteAffix.stage = Typology.GrammaticalizationStage.definiteAffix
Instances For
Numeric rank for the grammaticalization stage (higher = more grammaticalized).
Equations
Instances For
All three intermediate stages of the grammaticalization cline are well attested cross-linguistically: 56 languages use demonstratives as definite markers (Stage 1), 197 have distinct definite words (Stage 2), and 84 have definite affixes (Stage 3). The transitional Stage 1 is smaller than both later stages.