Cross-Linguistic Typology of Change-of-State Verbs #
@cite{dixon-1982} @cite{levin-1993} @cite{beavers-etal-2021} @cite{rose-nichols-2021} @cite{coon-2019}
Empirical data from Beavers, Everdell, Jerro, Kauhanen, @cite{beavers-etal-2021} "States and changes of state: A crosslinguistic study of the roots of verbal meaning." Language 97(3), 439–484.
88-language balanced sample (WALS 100 + additions). For each of 36 property concept (PC) and 36 result root meanings, the authors collected five-item paradigms (underlying root, simple stative, inchoative, causative, result stative) and coded morphological relationships.
Key findings (theory-neutral) #
Simple statives: PC roots overwhelmingly have simple stative forms (median = 95.67% of languages); result roots overwhelmingly lack them (median = 1.59%). Mann-Whitney U = 1266.5, p < 0.001.
Verbal markedness: PC root verbs tend to be marked (median = 56.01%); result root verbs tend to be unmarked (median = 15.20%). Mann-Whitney U = 1291, p < 0.001.
Subclass clustering: PC subclasses (dimension, color, value, physical property, speed) cluster near 100% simple statives. Result subclasses (breaking, cooking, killing, destroying, directed motion) cluster near 0%.
Two classes of change-of-state verb roots, defined by morphological and semantic diagnostics (@cite{beavers-etal-2021} §3.1).
Classification criteria:
- PC roots: the root of deadjectival CoS verbs (@cite{levin-1993}:245); describe @cite{dixon-1982}'s basic property types
- Result roots: the root of non-deadjectival CoS verbs; describe specific result states (physical damage, cooking, etc.)
- propertyConcept : CoSRootClass
- result : CoSRootClass
Instances For
Equations
- BeaversEtAl2021.instDecidableEqCoSRootClass x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
PC subclasses (@cite{dixon-1982}; @cite{beavers-etal-2021} ex. 5).
- dimension : PCSubclass
- age : PCSubclass
- value : PCSubclass
- color : PCSubclass
- physicalProperty : PCSubclass
- speed : PCSubclass
Instances For
Equations
- BeaversEtAl2021.instDecidableEqPCSubclass x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- BeaversEtAl2021.instReprPCSubclass = { reprPrec := BeaversEtAl2021.instReprPCSubclass.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Result root subclasses (@cite{levin-1993}; @cite{beavers-etal-2021} ex. 6).
- entitySpecificCoS : ResultSubclass
- cooking : ResultSubclass
- breaking : ResultSubclass
- bending : ResultSubclass
- killing : ResultSubclass
- destroying : ResultSubclass
- calibratableCoS : ResultSubclass
- inherentlyDirectedMotion : ResultSubclass
Instances For
Equations
- BeaversEtAl2021.instDecidableEqResultSubclass x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
The five positions in a CoS verb paradigm (@cite{beavers-etal-2021} eq. 40). Every root meaning is associated with (up to) five forms.
- underlyingRoot : ParadigmPosition
- simpleStative : ParadigmPosition
- inchoative : ParadigmPosition
- causative : ParadigmPosition
- resultStative : ParadigmPosition
Instances For
Equations
- BeaversEtAl2021.instDecidableEqParadigmPosition x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Morphological relationship codes between forms (@cite{beavers-etal-2021} eq. 41, generalizing @cite{haspelmath-1993}:90–92).
- input : MorphRelation
- derived : MorphRelation
- transitive : MorphRelation
- labile : MorphRelation
- equipollent : MorphRelation
- unattested : MorphRelation
- suppletive : MorphRelation
Instances For
Equations
- BeaversEtAl2021.instDecidableEqMorphRelation x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
A root meaning with its crosslinguistic attestation.
- gloss : String
English gloss(es)
- rootClass : CoSRootClass
PC or result root
- subclass : Option (PCSubclass ⊕ ResultSubclass)
Subclass, if applicable
- nSimpleStative : ℕ
Number of languages with a simple stative for this root (Table A1)
- nLanguages : ℕ
Number of languages with any data for this root (Table A1)
- nMarkedVerbal : ℕ
Number of languages with a marked verbal paradigm (Table A2)
- nVerbalLanguages : ℕ
Number of languages with verbal paradigm data (Table A2)
Instances For
Equations
- BeaversEtAl2021.instReprRootMeaning = { reprPrec := BeaversEtAl2021.instReprRootMeaning.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Percentage of languages with simple stative (exact ℚ — kernel-decidable).
Equations
- r.pctSimpleStative = if r.nLanguages = 0 then 0 else ↑r.nSimpleStative * 100 / ↑r.nLanguages
Instances For
Percentage of languages with marked verbal paradigm (exact ℚ).
Equations
- r.pctMarkedVerbal = if r.nVerbalLanguages = 0 then 0 else ↑r.nMarkedVerbal * 100 / ↑r.nVerbalLanguages
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Summary of a crosslinguistic comparison between PC and result roots. Numeric fields are exact ℚ (mathlib idiom for kernel-decidable arithmetic; @cite{beavers-etal-2021} reports values to 2 decimal places).
- measure : String
What is being measured
- pcMedian : ℚ
PC root median percentage
- resultMedian : ℚ
Result root median percentage
- uStatistic : ℚ
Mann-Whitney U statistic
- pThreshold : ℚ
One-tailed p-value threshold
- nPC : ℕ
Sample sizes (PC roots, result roots)
- nResult : ℕ
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Simple stative form comparison (§6, Fig. 1). Medians: 95.67% and 1.59% encoded as exact rationals via OfScientific.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Verbal markedness comparison (§7, Fig. 5). Medians: 56.01% and 15.20% encoded as exact rationals via OfScientific.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Both comparisons are statistically significant.
Empirical diagnostics for classifying roots. Each diagnostic independently sorts roots into two classes that align with the PC vs result distinction.
- positive : DiagnosticResult
- negative : DiagnosticResult
Instances For
Equations
- BeaversEtAl2021.instDecidableEqDiagnosticResult x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Stative form + change denial test (§3.3, ex. 10–11). "The bright photo has never brightened" → OK (PC) "#The shattered vase has never shattered" → contradictory (result)
Equations
Instances For
Restitutive 'again' test (§3.4, ex. 15–16). "John sharpened the knife again" → can be just one sharpening (PC) "#Chris thawed the meat again" → necessarily two thawings (result)
Equations
Instances For
The two diagnostics always agree.
A language's CoS verb typological profile.
- language : String
- family : String
- nPCParadigms : ℕ
Number of PC root verbal paradigms with data
- nResultParadigms : ℕ
Number of result root verbal paradigms with data
- pctPCMarked : ℚ
% of PC paradigms that are marked (exact ℚ)
- pctResultMarked : ℚ
% of result paradigms that are marked (exact ℚ)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
The six in-depth case study languages (§4).
Equations
- BeaversEtAl2021.kakataibo = { language := "Kakataibo", family := "Panoan", nPCParadigms := 59, nResultParadigms := 64, pctPCMarked := 23.73, pctResultMarked := 31.25 }
Instances For
Equations
- BeaversEtAl2021.kinyarwanda = { language := "Kinyarwanda", family := "Northeastern Bantu", nPCParadigms := 69, nResultParadigms := 33, pctPCMarked := 4.17, pctResultMarked := 9.09 }
Instances For
Equations
- BeaversEtAl2021.hebrew = { language := "Hebrew (Modern)", family := "Semitic", nPCParadigms := 35, nResultParadigms := 42, pctPCMarked := 97.62, pctResultMarked := 0.98 }
Instances For
Equations
- BeaversEtAl2021.marathi = { language := "Marathi", family := "Indic", nPCParadigms := 40, nResultParadigms := 35, pctPCMarked := 20.41, pctResultMarked := 21.33 }
Instances For
Equations
- BeaversEtAl2021.greek = { language := "Greek (Modern)", family := "Indo-European", nPCParadigms := 76, nResultParadigms := 57, pctPCMarked := 2.63, pctResultMarked := 2.15 }
Instances For
Equations
- BeaversEtAl2021.english = { language := "English", family := "Germanic", nPCParadigms := 43, nResultParadigms := 60, pctPCMarked := 0.0, pctResultMarked := 0.0 }
Instances For
Attested language types (§7.2, §8).
- asymmetric : LanguageType
- highMarking : LanguageType
- lowMarking : LanguageType
Instances For
Equations
- BeaversEtAl2021.instDecidableEqLanguageType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
The fourth logically possible type (result marked, PC unmarked) is UNATTESTED — predicted by the markedness generalization.
Equations
Instances For
Three and only three types are attested.
All entries in our PC root sample are classified as PC.
All entries in our result root sample are classified as result.
Most PC roots in the sample have ≥ 50% simple stative attestation.
No result root in the sample exceeds 10% simple stative attestation.
The §§8–14 sections below were originally housed in
Phenomena/Causation/Studies/Coon2019.lean as bridge content
between Coon's Chuj data, the present empirical typology, and
RootTypology.lean. They are relocated here per the chronological-
dependency rule (Coon 2019 < Beavers et al. 2021 — only the later
paper may reference the earlier).
Map the theory's root type to the empirical root class. These are parallel enums — the bridge makes the correspondence explicit.
Equations
Instances For
Map back from empirical to theory.
Equations
Instances For
The mapping is a bijection (left inverse).
The mapping is a bijection (right inverse).
The empirical changeDenialTest agrees with the theory's entailsChange.
Theory: RootType.entailsChange.result = true (result roots entail change)
Empirical: changeDenialTest.result =.negative ("#The shattered vase
has never shattered" is contradictory — the state entails prior change)
The relationship is: entailsChange = true ↔ changeDenial = negative.
The empirical restitutiveAgainTest agrees with the theory's
allowsRestitutiveAgain.
Both diagnostics jointly align with the full semantic correlate package.
The bridge form of semantic_determines_morphosyntax.
Theory predicts: PC roots have simple statives. Data confirms: 7 of 8 PC sample roots have ≥ 50% attestation. The one exception (oldRoot, age class) has 0 — noted by the present paper as a crosslinguistic outlier.
Theory predicts: result roots LACK simple statives. Data confirms: all 10 result sample roots have ≤ 10% attestation.
Theory predicts: PC verbs are morphologically marked; result verbs are unmarked (Markedness Generalization, @cite{beavers-etal-2021}). Data confirms: PC median marked % (56.01) > result median (15.20).
The theory's markedness complementarity predicts that if a language
marks PC verbs, it should NOT also show result verbs as more marked
than PC verbs. The fourth logically possible language type (result
marked, PC unmarked) is unattested — exactly 3 types are attested.
This matches the theory: markedness_complementarity says verbal and
stative markedness are always opposite.
Chuj √TV result roots instantiate the theory's result root predictions: entails change, no simple stative, unmarked verb.
Chuj √TV PC roots instantiate the theory's PC root predictions: no change entailment, has simple stative, marked verb.
The Chuj fragment witnesses the full orthogonality theorem: all four cells of the (arity × changeType) matrix are inhabited.
Per-root class verification: each Chuj root's change entailment matches
its predicted morphosyntactic correlates via grand_unification.
Every PC root in the empirical sample is classified as PC, and the theory predicts PC roots should have simple statives — they do.
Every result root in the empirical sample is classified as result, and the theory predicts result roots lack simple statives — they do.
The subclass taxonomies are aligned: B&KG's PCSubclass has 6
categories (matching their Table 2); the theory's PCClass has 7
(adding humanPropensity from @cite{dixon-1982}, attested in
@cite{hanink-koontz-garboden-2025}). ResultClass and ResultSubclass
match exactly (8 subclasses).