Documentation

Linglib.Typology.Numerals

Numeral typology — substrate types and WALS data #

@cite{wals-2013} (Chs 53, 54, 55, 56, 131) @cite{aikhenvald-2000} @cite{greenberg-1978} @cite{stolz-veselinova-2013}

Type-level enums + per-language profile struct for numeral systems across @cite{wals-2013} chapters 53–56 (Gil, Comrie) and 131: ordinal formation, distributive numerals, numeral classifiers, conjunction-quantifier identity, numeral base. Plus WALS distribution data, the principal cross-linguistic generalizations, and the Greenberg suppletion hierarchy for ordinal formation.

ClassifierStatus (WALS Ch 55) and fromWALS55A live in Typology/ClassifierSystem.lean and are re-imported here.

Schema #

Per-language data lives in Fragments/{Lang}/Numerals.lean.

WALS Ch 53: how a language forms ordinal numerals from cardinals. The dominant pattern is "first" suppletive + higher ordinals regular.

  • firstSuppletion : OrdinalFormation

    "first" is suppletive, "second" onward regular (e.g., English).

  • firstSecondSuppletion : OrdinalFormation

    "first" and "second" suppletive, "third" onward regular.

  • allFromCardinals : OrdinalFormation

    All ordinals derived regularly from cardinals.

  • various : OrdinalFormation

    Mixed strategies, no single dominant pattern.

  • noOrdinals : OrdinalFormation

    No productive ordinal formation reported.

Instances For
    @[implicit_reducible]
    Equations
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      WALS Ch 54: whether and how a language marks distributive numerals ("N each" / "N apiece"). Reduplication is the most widespread dedicated strategy.

      Instances For
        @[implicit_reducible]
        Equations
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          WALS Ch 56 (Gil): relationship between 'and' and 'all/every'. Identity reflects a deep connection between conjunction (exhaustive pairing) and universal quantification (exhaustive predication).

          Instances For
            @[implicit_reducible]
            Equations
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              Areal region — used for areal generalizations about classifier distribution (e.g., Sanches-Slobin's classifier-belt observation).

              Instances For
                @[implicit_reducible]
                Equations
                @[implicit_reducible]
                Equations
                def Typology.instReprRegion.repr :
                RegionNatStd.Format
                Equations
                Instances For

                  Whether a language has obligatory grammatical plural marking on common nouns. Used for the Sanches-Slobin generalization relating numeral classifiers and plural.

                  • obligatory : PluralMarking

                    Plural marking required (e.g., English, Spanish).

                  • optional : PluralMarking

                    Plural marking available but not required (e.g., Korean).

                  • none : PluralMarking

                    No grammatical plural on nouns (e.g., Mandarin, Japanese).

                  Instances For
                    @[implicit_reducible]
                    Equations
                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      WALS Ch 131 (Comrie): the base of a language's numeral system. Most languages use a decimal (base-10) system.

                      • decimal : NumeralBase

                        Base 10 (e.g., English, Mandarin, Swahili).

                      • vigesimal : NumeralBase

                        Pure base 20 (e.g., Ainu, Chukchi).

                      • hybridVigesimalDecimal : NumeralBase

                        Mixed base-20 / base-10 (e.g., French, Basque, Georgian).

                      • otherBase : NumeralBase

                        Base 5, 6, or other (rare).

                      • bodyPartSystem : NumeralBase

                        Extended body-part counting system (e.g., Eipo).

                      • restricted : NumeralBase

                        Restricted numeral system (few numerals, no productive base).

                      Instances For
                        @[implicit_reducible]
                        Equations
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          A language's numeral typology profile across all four WALS dimensions

                          • areal region + plural-marking status.
                          • language : String
                          • iso : String

                            ISO 639-3 code.

                          • Ch 53: ordinal numeral formation.

                          • distributive : DistributiveNumeral

                            Ch 54: distributive numeral marking.

                          • classifier : ClassifierStatus

                            Ch 55: numeral classifier status.

                          • Ch 56: conjunction-quantifier relationship.

                          • region : Region

                            Areal region (for areal generalizations).

                          • pluralMarking : PluralMarking

                            Plural marking on common nouns (for Sanches-Slobin).

                          • numeralBase : Option NumeralBase

                            Ch 131: numeral base (optional; not all languages surveyed).

                          Instances For
                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For
                              def Typology.instDecidableEqNumeralProfile.decEq (x✝ x✝¹ : NumeralProfile) :
                              Decidable (x✝ = x✝¹)
                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                Does a language have obligatory numeral classifiers?

                                Equations
                                Instances For

                                  Does a language have any numeral classifiers (obligatory or optional)?

                                  Equations
                                  Instances For

                                    Does a language have obligatory plural marking on common nouns?

                                    Equations
                                    Instances For

                                      Does a language have a morphological distributive numeral form?

                                      Equations
                                      Instances For

                                        Is a language in the East/Southeast Asian region?

                                        Equations
                                        Instances For

                                          WALS Chapter 53 distribution: language counts per ordinal formation type. Total: 321 languages.

                                          • firstSuppletion : Nat
                                          • firstSecondSuppletion : Nat
                                          • allFromCardinals : Nat
                                          • various : Nat
                                          • noOrdinals : Nat
                                          Instances For
                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For

                                              WALS Ch 53 counts (321 languages).

                                              Equations
                                              • Typology.ch53Distribution = { firstSuppletion := 99, firstSecondSuppletion := 45, allFromCardinals := 28, various := 83, noOrdinals := 66 }
                                              Instances For

                                                WALS Chapter 54 distribution: language counts per distributive type. Total: 251 languages.

                                                • noDistributive : Nat
                                                • reduplication : Nat
                                                • suffixCount : Nat
                                                • prefixCount : Nat
                                                • otherMeans : Nat
                                                Instances For
                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    WALS Ch 54 counts (251 languages).

                                                    Equations
                                                    Instances For

                                                      WALS Chapter 56 distribution: language counts per conjunction-quantifier type. Total: 220 languages.

                                                      • identity : Nat
                                                      • differentiation : Nat
                                                      Instances For
                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For

                                                          WALS Ch 56 counts (220 languages).

                                                          Equations
                                                          Instances For

                                                            Suppletive "first" is the dominant ordinal formation strategy (WALS Ch 53). Languages with suppletive "first" (alone or with suppletive "second") outnumber languages where all ordinals derive regularly from cardinals.

                                                            Languages with some form of ordinal formation (regular or suppletive) outnumber languages lacking ordinals entirely.

                                                            Languages with dedicated distributive numeral forms outnumber those without, but neither is a negligible minority.

                                                            Differentiation between 'and' and 'all' is the dominant pattern (WALS Ch 56).

                                                            Differentiation accounts for more than three-quarters of the sample.

                                                            Identity between 'and' and 'all' is a non-negligible minority pattern, attested in roughly a fifth of languages (43 out of 220).

                                                            @cite{greenberg-1978}'s implicational universal for ordinal suppletion: if a language has a suppletive ordinal for numeral N, then it has suppletive ordinals for all numerals less than N. Equivalently: suppletion cuts off at some point in the sequence 1st, 2nd, 3rd,… and all ordinals above the cutoff are regular.

                                                            The WALS data captures the coarsest version: suppletion is most likely for "first", less likely for "second", and rare beyond that.

                                                            Instances For
                                                              @[implicit_reducible]
                                                              Equations
                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                Numeric rank for the suppletion cutoff (higher = more suppletion).

                                                                Equations
                                                                Instances For

                                                                  WALS aggregate confirms the hierarchy: languages with "first"-only suppletion outnumber those with "first+second" suppletion, which in turn outnumber those with no suppletion at all. This reflects the implicational scale: suppletion at higher numerals is rarer.