Documentation

Linglib.Phenomena.Causation.Studies.BeaversEtAl2021

Cross-Linguistic Typology of Change-of-State Verbs #

@cite{dixon-1982} @cite{levin-1993} @cite{beavers-etal-2021} @cite{rose-nichols-2021} @cite{coon-2019}

Empirical data from Beavers, Everdell, Jerro, Kauhanen, @cite{beavers-etal-2021} "States and changes of state: A crosslinguistic study of the roots of verbal meaning." Language 97(3), 439–484.

88-language balanced sample (WALS 100 + additions). For each of 36 property concept (PC) and 36 result root meanings, the authors collected five-item paradigms (underlying root, simple stative, inchoative, causative, result stative) and coded morphological relationships.

Key findings (theory-neutral) #

  1. Simple statives: PC roots overwhelmingly have simple stative forms (median = 95.67% of languages); result roots overwhelmingly lack them (median = 1.59%). Mann-Whitney U = 1266.5, p < 0.001.

  2. Verbal markedness: PC root verbs tend to be marked (median = 56.01%); result root verbs tend to be unmarked (median = 15.20%). Mann-Whitney U = 1291, p < 0.001.

  3. Subclass clustering: PC subclasses (dimension, color, value, physical property, speed) cluster near 100% simple statives. Result subclasses (breaking, cooking, killing, destroying, directed motion) cluster near 0%.

Two classes of change-of-state verb roots, defined by morphological and semantic diagnostics (@cite{beavers-etal-2021} §3.1).

Classification criteria:

  • PC roots: the root of deadjectival CoS verbs (@cite{levin-1993}:245); describe @cite{dixon-1982}'s basic property types
  • Result roots: the root of non-deadjectival CoS verbs; describe specific result states (physical damage, cooking, etc.)
Instances For
    @[implicit_reducible]
    Equations
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      PC subclasses (@cite{dixon-1982}; @cite{beavers-etal-2021} ex. 5).

      Instances For
        @[implicit_reducible]
        Equations
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          Result root subclasses (@cite{levin-1993}; @cite{beavers-etal-2021} ex. 6).

          Instances For
            @[implicit_reducible]
            Equations
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              The five positions in a CoS verb paradigm (@cite{beavers-etal-2021} eq. 40). Every root meaning is associated with (up to) five forms.

              Instances For
                @[implicit_reducible]
                Equations
                Equations
                • One or more equations did not get rendered due to their size.
                Instances For

                  Morphological relationship codes between forms (@cite{beavers-etal-2021} eq. 41, generalizing @cite{haspelmath-1993}:90–92).

                  Instances For
                    @[implicit_reducible]
                    Equations
                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      A root meaning with its crosslinguistic attestation.

                      • gloss : String

                        English gloss(es)

                      • rootClass : CoSRootClass

                        PC or result root

                      • subclass : Option (PCSubclass ResultSubclass)

                        Subclass, if applicable

                      • nSimpleStative :

                        Number of languages with a simple stative for this root (Table A1)

                      • nLanguages :

                        Number of languages with any data for this root (Table A1)

                      • nMarkedVerbal :

                        Number of languages with a marked verbal paradigm (Table A2)

                      • nVerbalLanguages :

                        Number of languages with verbal paradigm data (Table A2)

                      Instances For
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          Percentage of languages with simple stative (exact ℚ — kernel-decidable).

                          Equations
                          Instances For

                            Percentage of languages with marked verbal paradigm (exact ℚ).

                            Equations
                            Instances For
                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For
                                Equations
                                • One or more equations did not get rendered due to their size.
                                Instances For
                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For
                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For
                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For
                                        Equations
                                        • One or more equations did not get rendered due to their size.
                                        Instances For
                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For
                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For
                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For
                                                Equations
                                                • One or more equations did not get rendered due to their size.
                                                Instances For
                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For
                                                    Equations
                                                    • One or more equations did not get rendered due to their size.
                                                    Instances For
                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For
                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For
                                                          Equations
                                                          • One or more equations did not get rendered due to their size.
                                                          Instances For
                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For
                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For
                                                                Equations
                                                                • One or more equations did not get rendered due to their size.
                                                                Instances For
                                                                  Equations
                                                                  • One or more equations did not get rendered due to their size.
                                                                  Instances For
                                                                    Equations
                                                                    • One or more equations did not get rendered due to their size.
                                                                    Instances For

                                                                      Summary of a crosslinguistic comparison between PC and result roots. Numeric fields are exact ℚ (mathlib idiom for kernel-decidable arithmetic; @cite{beavers-etal-2021} reports values to 2 decimal places).

                                                                      • measure : String

                                                                        What is being measured

                                                                      • pcMedian :

                                                                        PC root median percentage

                                                                      • resultMedian :

                                                                        Result root median percentage

                                                                      • uStatistic :

                                                                        Mann-Whitney U statistic

                                                                      • pThreshold :

                                                                        One-tailed p-value threshold

                                                                      • nPC :

                                                                        Sample sizes (PC roots, result roots)

                                                                      • nResult :
                                                                      Instances For
                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          Simple stative form comparison (§6, Fig. 1). Medians: 95.67% and 1.59% encoded as exact rationals via OfScientific.

                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            Verbal markedness comparison (§7, Fig. 5). Medians: 56.01% and 15.20% encoded as exact rationals via OfScientific.

                                                                            Equations
                                                                            • One or more equations did not get rendered due to their size.
                                                                            Instances For

                                                                              Empirical diagnostics for classifying roots. Each diagnostic independently sorts roots into two classes that align with the PC vs result distinction.

                                                                              Instances For
                                                                                @[implicit_reducible]
                                                                                Equations
                                                                                Equations
                                                                                • One or more equations did not get rendered due to their size.
                                                                                Instances For

                                                                                  Stative form + change denial test (§3.3, ex. 10–11). "The bright photo has never brightened" → OK (PC) "#The shattered vase has never shattered" → contradictory (result)

                                                                                  Equations
                                                                                  Instances For

                                                                                    Restitutive 'again' test (§3.4, ex. 15–16). "John sharpened the knife again" → can be just one sharpening (PC) "#Chris thawed the meat again" → necessarily two thawings (result)

                                                                                    Equations
                                                                                    Instances For

                                                                                      The two diagnostics always agree.

                                                                                      A language's CoS verb typological profile.

                                                                                      • language : String
                                                                                      • family : String
                                                                                      • nPCParadigms :

                                                                                        Number of PC root verbal paradigms with data

                                                                                      • nResultParadigms :

                                                                                        Number of result root verbal paradigms with data

                                                                                      • pctPCMarked :

                                                                                        % of PC paradigms that are marked (exact ℚ)

                                                                                      • pctResultMarked :

                                                                                        % of result paradigms that are marked (exact ℚ)

                                                                                      Instances For
                                                                                        Equations
                                                                                        • One or more equations did not get rendered due to their size.
                                                                                        Instances For

                                                                                          The six in-depth case study languages (§4).

                                                                                          Equations
                                                                                          • BeaversEtAl2021.kakataibo = { language := "Kakataibo", family := "Panoan", nPCParadigms := 59, nResultParadigms := 64, pctPCMarked := 23.73, pctResultMarked := 31.25 }
                                                                                          Instances For
                                                                                            Equations
                                                                                            • BeaversEtAl2021.kinyarwanda = { language := "Kinyarwanda", family := "Northeastern Bantu", nPCParadigms := 69, nResultParadigms := 33, pctPCMarked := 4.17, pctResultMarked := 9.09 }
                                                                                            Instances For
                                                                                              Equations
                                                                                              • BeaversEtAl2021.hebrew = { language := "Hebrew (Modern)", family := "Semitic", nPCParadigms := 35, nResultParadigms := 42, pctPCMarked := 97.62, pctResultMarked := 0.98 }
                                                                                              Instances For
                                                                                                Equations
                                                                                                • BeaversEtAl2021.marathi = { language := "Marathi", family := "Indic", nPCParadigms := 40, nResultParadigms := 35, pctPCMarked := 20.41, pctResultMarked := 21.33 }
                                                                                                Instances For
                                                                                                  Equations
                                                                                                  • BeaversEtAl2021.greek = { language := "Greek (Modern)", family := "Indo-European", nPCParadigms := 76, nResultParadigms := 57, pctPCMarked := 2.63, pctResultMarked := 2.15 }
                                                                                                  Instances For
                                                                                                    Equations
                                                                                                    • BeaversEtAl2021.english = { language := "English", family := "Germanic", nPCParadigms := 43, nResultParadigms := 60, pctPCMarked := 0.0, pctResultMarked := 0.0 }
                                                                                                    Instances For

                                                                                                      Attested language types (§7.2, §8).

                                                                                                      Instances For
                                                                                                        @[implicit_reducible]
                                                                                                        Equations
                                                                                                        Equations
                                                                                                        • One or more equations did not get rendered due to their size.
                                                                                                        Instances For

                                                                                                          The fourth logically possible type (result marked, PC unmarked) is UNATTESTED — predicted by the markedness generalization.

                                                                                                          Equations
                                                                                                          Instances For

                                                                                                            Three and only three types are attested.

                                                                                                            All entries in our PC root sample are classified as PC.

                                                                                                            All entries in our result root sample are classified as result.

                                                                                                            theorem BeaversEtAl2021.most_pc_roots_have_statives :
                                                                                                            (List.filter (fun (r : RootMeaning) => decide (r.nSimpleStative * 2 r.nLanguages)) pcRoots).length pcRoots.length - 1

                                                                                                            Most PC roots in the sample have ≥ 50% simple stative attestation.

                                                                                                            No result root in the sample exceeds 10% simple stative attestation.

                                                                                                            The §§8–14 sections below were originally housed in Phenomena/Causation/Studies/Coon2019.lean as bridge content between Coon's Chuj data, the present empirical typology, and RootTypology.lean. They are relocated here per the chronological- dependency rule (Coon 2019 < Beavers et al. 2021 — only the later paper may reference the earlier).

                                                                                                            Map the theory's root type to the empirical root class. These are parallel enums — the bridge makes the correspondence explicit.

                                                                                                            Equations
                                                                                                            Instances For

                                                                                                              The mapping is a bijection (left inverse).

                                                                                                              The mapping is a bijection (right inverse).

                                                                                                              The empirical changeDenialTest agrees with the theory's entailsChange.

                                                                                                              Theory: RootType.entailsChange.result = true (result roots entail change) Empirical: changeDenialTest.result =.negative ("#The shattered vase has never shattered" is contradictory — the state entails prior change)

                                                                                                              The relationship is: entailsChange = true ↔ changeDenial = negative.

                                                                                                              Both diagnostics jointly align with the full semantic correlate package. The bridge form of semantic_determines_morphosyntax.

                                                                                                              theorem BeaversEtAl2021.pc_stative_prediction_matches_data :
                                                                                                              RootType.propertyConcept.hasSimpleStative = true (List.filter (fun (r : RootMeaning) => decide (r.nSimpleStative * 2 r.nLanguages)) pcRoots).length pcRoots.length - 1

                                                                                                              Theory predicts: PC roots have simple statives. Data confirms: 7 of 8 PC sample roots have ≥ 50% attestation. The one exception (oldRoot, age class) has 0 — noted by the present paper as a crosslinguistic outlier.

                                                                                                              Theory predicts: result roots LACK simple statives. Data confirms: all 10 result sample roots have ≤ 10% attestation.

                                                                                                              Theory predicts: PC verbs are morphologically marked; result verbs are unmarked (Markedness Generalization, @cite{beavers-etal-2021}). Data confirms: PC median marked % (56.01) > result median (15.20).

                                                                                                              The theory's markedness complementarity predicts that if a language marks PC verbs, it should NOT also show result verbs as more marked than PC verbs. The fourth logically possible language type (result marked, PC unmarked) is unattested — exactly 3 types are attested. This matches the theory: markedness_complementarity says verbal and stative markedness are always opposite.

                                                                                                              Chuj √TV result roots instantiate the theory's result root predictions: entails change, no simple stative, unmarked verb.

                                                                                                              Chuj √TV PC roots instantiate the theory's PC root predictions: no change entailment, has simple stative, marked verb.

                                                                                                              Every PC root in the empirical sample is classified as PC, and the theory predicts PC roots should have simple statives — they do.

                                                                                                              Every result root in the empirical sample is classified as result, and the theory predicts result roots lack simple statives — they do.