@cite{haspelmath-2021}: Role-reference associations and the explanation of argument coding splits #

@cite{haspelmath-2021}, Linguistics 59(1): 123–174. DOI: 10.1515/ling-2020-0252.

Overview #

Haspelmath proposes a single meta-universal — the Role-Reference Association Universal (Universal 1) — that subsumes differential object marking, split ergativity, ditransitive splits, and person-scenario splits under one generalization: deviations from the usual associations of role rank and referential prominence tend to be coded by longer grammatical forms.

Universal 1 in turn is "evidently a special case of" the broader form-frequency correspondence universal (Universal 68 in §11.2): the "usual" associations ARE the frequent ones, and frequent expressions get shorter forms (Zipf).

The Paper's Numbered Universals #

The paper states the following numbered universals (Figure 1, §11.1):

Meta-universals (§2) #

Universal 1 (statement (5)): Role-Reference Association Universal
Universal 2 (statement (6)): usual role-reference associations

Single-argument coding splits (§3–5) #

Universal 3 (statement (13), §3): Single-argument flagging universal
Universal 4 (statement (14), §4.1): Split P flagging (DOM)
Universal 5 (statement (16), §3, restated §6): Scenario coding universal
Universal 6 (statement (21), §4.2): Split A flagging (DSM)
Universal 7 (statement (26), §5): Split R flagging
Universal 8 (statement (27), §5): Split T flagging

Ditransitive scenario splits (§7) #

Universal 9a (statement (41), §7.1): Ditransitive Person-Role Constraint
Universal 9b (statement (42), §7.1): Ditransitive person-role universal

Relative scenario / inverse / alternations (§8–§10) #

Universal 10 (statement (54), §8): Relative scenario universal
Universal 11 (statement (57), §9): Inverse universal
Universal 12 (statement (61), §10.1): Alternation universal
Universal 13 (statement (62), §10.1): Passive universal
Universal 14 (statement (63), §10.1): Dative alternation universal

The reductive claim #

Universal 68 (statement (68), §11.2): Grammatical form-frequency correspondence universal — Universal 1 is "evidently a special case of" this broader universal.

What This File Formalizes #

Universals 1–14, with U4 and U6 re-expressing model predictions from @cite{aissen-2003} and @cite{de-hoop-malchukov-2008} respectively, and a final §18 contrastive section showing how @cite{marantz-1991}'s dependent case algorithm partitions the empirical territory of "split case marking" with Haspelmath's framework: structural-condition splits (Marantz) vs. prominence-condition splits (Haspelmath).

What This File Does NOT Formalize #

The paper's frequency claims are tendency-claims based on corpus regularities. Haspelmath himself: "I do not focus on documenting the discourse frequencies in this paper... testing this claim more thoroughly is a topic for future comparative corpus research" (p. 126). Lean theorems committing the frequency-class function to specific Nat values would over-reify a tendency-claim. We use Scenario.frequencyClass from the substrate as a discrete proxy and clearly mark its theorems as proxy-checks, not empirical claims about token frequencies.

Haspelmath 2021's deeper explanation of argument-coding splits: the Role-Reference Association Universal (Universal 1) reduces to the general cognitive tendency for frequent expressions to be short.

Three-step chain:

1. **Frequency asymmetry**: some role-reference combinations are more
   frequent than others ("I saw him" > "He saw me"; animate agents >
   inanimate agents).
2. **Form-frequency correspondence**: more frequent expressions tend
   to get shorter forms (diachronic erosion + analogical extension).
3. **Coding asymmetry**: "usual" role-reference associations (= the
   frequent ones) get shorter (often zero) coding; "unusual" ones get
   longer (overt) coding.

Previously housed in `Core/FormFrequency.lean` — demoted to this study
file at 0.230.551 when the consumer count was 1 (only Haspelmath2021
used any of the primitives) and four primitives in the substrate file
(`respectsFormFrequency`, `argumentCodingRespectsFrequency`,
`VoiceDirection`, `DitransitiveFrame`) were completely unused.