Documentation

Linglib.Typology.Reference

Article-and-demonstrative typology — substrate types and WALS data #

@cite{wals-2013} (Chs 37, 38, 41, 42, 43) @cite{bhat-2013} @cite{diessel-2013} @cite{dryer-haspelmath-2013} @cite{greenberg-1978} @cite{himmelmann-1997}

Type-level enums + per-language profile struct for definiteness marking, indefinite articles, and demonstrative systems across @cite{wals-2013} chapters 37, 38, 41, 42, 43, plus WALS distribution data, the principal cross-linguistic generalizations, and the demonstrative→article→affix grammaticalization cline.

Schema #

Per-language data lives in Fragments/{Lang}/Reference.lean.

Definite article type (WALS Ch 37, @cite{dryer-haspelmath-2013}). Categories ordered along the grammaticalization cline: demonstrative → definite word → definite affix.

  • definiteWord : DefiniteArticleType

    Definite word distinct from demonstratives (e.g., English the).

  • definiteAffix : DefiniteArticleType

    Definite affix on the noun (e.g., Danish -en, Arabic al-).

  • demonstrativeUsed : DefiniteArticleType

    No dedicated definite article; a demonstrative is used for definiteness (e.g., Ojibwa, Swahili).

  • noDefButIndef : DefiniteArticleType

    No definite article, but language has an indefinite article.

  • noArticle : DefiniteArticleType

    Neither definite nor indefinite article.

Instances For
    @[implicit_reducible]
    Equations
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Indefinite article type (WALS Ch 38, @cite{dryer-haspelmath-2013}). Languages either have a dedicated indefinite word, use the numeral 'one' as an indefinite (the most common grammaticalization path), have an indefinite affix, or lack indefinite articles entirely.

      Instances For
        @[implicit_reducible]
        Equations
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          Number of distance contrasts in adnominal demonstratives (WALS Ch 41, @cite{diessel-2013}). Two-way systems are by far the most common (54.3%), followed by three-way (37.6%).

          Instances For
            @[implicit_reducible]
            Equations
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              Whether a three-way demonstrative system is distance- or person-oriented. @cite{diessel-2013}: about 2/3 of three-way systems are distance-oriented, 1/3 person-oriented (e.g., Japanese is the canonical person-oriented system).

              • distanceOriented : DemOrientationType

                All terms encode distance from speaker (proximal/medial/distal).

              • personOriented : DemOrientationType

                One term encodes proximity to the hearer (near-speaker / near-hearer / distal).

              • notApplicable : DemOrientationType

                Not applicable (system is not three-way).

              Instances For
                @[implicit_reducible]
                Equations
                Equations
                • One or more equations did not get rendered due to their size.
                Instances For

                  Relationship between pronominal and adnominal demonstratives (WALS Ch 42, @cite{diessel-2013}). English uses the same forms; French uses different stems (ce/cette vs celui/celle); Turkish uses the same stems but different inflectional features.

                  • sameForms : DemFormRelation

                    Same forms for pronominal and adnominal use (e.g., English).

                  • differentStems : DemFormRelation

                    Different stems (e.g., French ce/celui).

                  • differentInflection : DemFormRelation

                    Same stems but different inflectional features (e.g., Turkish).

                  Instances For
                    @[implicit_reducible]
                    Equations
                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      Relationship between third-person pronouns and demonstratives (WALS Ch 43, @cite{bhat-2013}). In "two-person languages", 3rd-person pronouns are demonstrative-derived; in "three-person languages", 3rd-person pronouns form an independent person paradigm.

                      • unrelated : PronounDemRelation

                        3rd-person pronouns unrelated to demonstratives (e.g., Ainu, Polish).

                      • relatedAll : PronounDemRelation

                        Related to all demonstratives (e.g., Basque).

                      • relatedRemote : PronounDemRelation

                        Related specifically to remote/distal demonstratives (e.g., Eastern Armenian: 3sg na = distal na).

                      • relatedNonRemote : PronounDemRelation

                        Related specifically to non-remote (proximal/medial) demonstratives (e.g., Brahui).

                      • relatedGender : PronounDemRelation

                        Related via shared gender/noun-class markers (e.g., Venda).

                      • relatedNonhuman : PronounDemRelation

                        Demonstratives used as 3rd-person pronouns for nonhuman reference only (e.g., Jaqaru).

                      Instances For
                        @[implicit_reducible]
                        Equations
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          Whether 3rd-person pronouns show ANY relationship to demonstratives (Bhat's "two-person" vs "three-person" distinction).

                          Equations
                          Instances For

                            A language's article-and-demonstrative profile across @cite{wals-2013} Chs 37, 38, 41, 42, 43. WALS samples vary by chapter, so each field is optional.

                            • language : String
                            • family : String
                            • iso : String

                              ISO 639-3 code.

                            • defArticle : Option DefiniteArticleType

                              Ch 37: definite article type.

                            • indefArticle : Option IndefiniteArticleType

                              Ch 38: indefinite article type.

                            • demDistance : Option DemDistanceSystem

                              Ch 41: distance contrasts in demonstratives.

                            • demOrientation : Option DemOrientationType

                              Ch 41 subtype: distance- vs person-oriented (for three-way systems).

                            • demFormType : Option DemFormRelation

                              Ch 42: pronominal vs adnominal demonstrative form.

                            • pronDemRelation : Option PronounDemRelation

                              Ch 43: 3rd-person pronoun ~ demonstrative relationship.

                            Instances For
                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                Does this language have any form of definite marking (word, affix, or demonstrative used as definite)?

                                Equations
                                • One or more equations did not get rendered due to their size.
                                Instances For

                                  Does this language have any indefinite article (word, numeral 'one', or affix)?

                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For

                                    WALS Ch 37: definite article distribution (@cite{dryer-haspelmath-2013}, n = 566).

                                    • definiteWord : Nat
                                    • demonstrativeUsed : Nat
                                    • definiteAffix : Nat
                                    • noDefButIndef : Nat
                                    • noArticle : Nat
                                    Instances For
                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For

                                        WALS Ch 37 counts (566 languages).

                                        Equations
                                        Instances For

                                          WALS Ch 38: indefinite article distribution (@cite{dryer-haspelmath-2013}, n = 473).

                                          • indefiniteWord : Nat
                                          • numeralOne : Nat
                                          • indefiniteAffix : Nat
                                          • noIndefButDef : Nat
                                          • noArticle : Nat
                                          Instances For
                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For

                                              WALS Ch 38 counts (473 languages).

                                              Equations
                                              Instances For

                                                WALS Ch 41: demonstrative distance contrasts (@cite{diessel-2013}, n = 234).

                                                • noContrast : Nat
                                                • twoWay : Nat
                                                • threeWay : Nat
                                                • fourWay : Nat
                                                • fiveOrMore : Nat
                                                Instances For
                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    WALS Ch 41 counts (234 languages).

                                                    Equations
                                                    Instances For

                                                      WALS Ch 42: pronominal/adnominal demonstrative form (@cite{diessel-2013}, n = 201).

                                                      • sameForms : Nat
                                                      • differentStems : Nat
                                                      • differentInflection : Nat
                                                      Instances For
                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For

                                                          WALS Ch 42 counts (201 languages).

                                                          Equations
                                                          Instances For

                                                            WALS Ch 43: 3rd-person pronoun ~ demonstrative relationship (@cite{bhat-2013}, n = 225).

                                                            • unrelated : Nat
                                                            • relatedAll : Nat
                                                            • relatedRemote : Nat
                                                            • relatedNonRemote : Nat
                                                            • relatedGender : Nat
                                                            • relatedNonhuman : Nat
                                                            Instances For
                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                Total count of languages where 3rd-person pronouns show any relationship to demonstratives.

                                                                Equations
                                                                Instances For

                                                                  WALS Ch 43 counts (225 languages).

                                                                  Equations
                                                                  • Typology.walsPronounDem = { unrelated := 100, relatedAll := 52, relatedRemote := 18, relatedNonRemote := 14, relatedGender := 24, relatedNonhuman := 17 }
                                                                  Instances For

                                                                    Two-way demonstrative systems (proximal/distal) are the most common type: 127 of 234 languages (54.3%). @cite{diessel-2013}: "the vast majority of the world's languages employ two or three distance-marked demonstratives".

                                                                    Two-way and three-way systems together account for over 90% of languages; one-way, four-way, and five-or-more systems together are under 10%.

                                                                    Languages with definite articles tend to also have indefinite articles. The asymmetry: 81 languages have a definite but no indefinite article, vs. 41 languages with indefinite but no definite article. Definiteness marking is the typologically prior category.

                                                                    Languages with some form of definite marking (word, affix, or demonstrative) outnumber those without. 337 of 566 languages (59.5%) have definite marking.

                                                                    The most common subtype of pronoun-demonstrative relationship is "related to all demonstratives" (52 languages), where any demonstrative can serve as a 3rd-person pronoun.

                                                                    In most languages (143 of 201 = 71.1%), pronominal and adnominal demonstratives have the same forms (@cite{diessel-2013}). Differential marking via different stems (37) or different inflection (21) is the minority.

                                                                    The grammaticalization hierarchy for definiteness marking (@cite{greenberg-1978}, @cite{himmelmann-1997}):

                                                                    Stage 0 — No definiteness marking (bare nouns; e.g., Mandarin, Russian) Stage 1 — Demonstrative used for definiteness (e.g., Swahili, Ojibwa) Stage 2 — Definite word distinct from demonstrative (e.g., English) Stage 3 — Definite affix (e.g., Danish, Arabic)

                                                                    Each stage represents further grammaticalization: phonological reduction, semantic bleaching (loss of deictic content), and increased obligatoriness.

                                                                    Instances For
                                                                      @[implicit_reducible]
                                                                      Equations
                                                                      Equations
                                                                      • One or more equations did not get rendered due to their size.
                                                                      Instances For

                                                                        All three intermediate stages of the grammaticalization cline are well attested cross-linguistically: 56 languages use demonstratives as definite markers (Stage 1), 197 have distinct definite words (Stage 2), and 84 have definite affixes (Stage 3). The transitional Stage 1 is smaller than both later stages.