Anderson (2006): Auxiliary Verb Constructions #
Cross-linguistic typology of auxiliary verb constructions (AVCs): classification by inflectional distribution between auxiliary and lexical verb, with diachronic grounding in [Hei93]'s grammaticalization framework.
Key contributions formalized #
- Inflectional pattern typology (
InflPatternfrom substrate): auxHeaded, lexHeaded, doubled, split, splitDoubled — defined inTypology/AuxiliaryVerbs.lean, verified per-datum here. - Semantic head invariant (Anderson p. 23, Table 3.1 p. 116): the lexical verb is always the semantic head, regardless of where inflection sits.
- Typed inflectional distribution:
InflDistribution(fromMorphology) records whichMorphCategoryvalues each element hosts. - Grammaticalization grounding: Anderson ch. 7 traces AVCs
onto Heine 1993's cline (Anderson p. 5: "According to Heine
(1993: 48ff.)..."). The cline lives in
Morphology/Grammaticalization.lean, anchored on [Hei93] ch. 3.
Coverage #
Sample of 9 AVC datums across 7 languages, covering all 5 patterns:
- English have eaten (aux-headed) — Anderson ch. 2 canonical example
- Doyayo lex-headed (Anderson ch. 3 ex. 15a, p. 121)
- Doyayo split/doubled (Anderson ch. 5 ex. 129, p. 223)
- Gorum (doubled)
- Jakaltek (split, abs/erg)
- Pipil split/doubled (Anderson ch. 5 ex. 133, p. 224)
- Pipil lex-headed (Anderson ch. 3 ex. 49, p. 130, with weli)
- Finnish split (negative auxiliary ei) [Kar18]
- Hemba split/doubled
2026-04-30 audit fixes #
PDF-verified against Anderson 2006 (book pp. 5, 114, 116, 121, 224) and Heine 1993 (book p. 48ff. via Anderson p. 5):
- Doyayo
.split→.lexHeaded(Ch 3 ex. 15a) + newdoyayoSplitDoubled(Ch 5 ex. 129). Anderson never classifies Doyayo as plain split. - Pipil
.split→.splitDoubled(Ch 5 ex. 133, with subjects doubly marked). FragmentsplitDistributioncorrected to put.agreementon AUX (matches the gloss1-AUX 1-2PL-show). - Cline reattributed to [Hei93] (Anderson p. 5
explicitly cites Heine 1993:48ff.). Substrate
GramStage.toMorphStatusprojection moved toMorphology/Grammaticalization.lean. negStrategyStageis nowNegStrategy.toGramStage : NegStrategy → Option GramStage(inTypology/Negation.lean), with.negParticle => none(not on the verbal cline) — fixes the earlier formaliser-introduced collapse of [Mie05]'s particle-vs-verb distinction.- Dropped Table 2.3 reference: Anderson's Table 2.3 (p. 114) catalogs non-finite forms within AUX-headed AVCs (a single- pattern grid of language exemplars), not a cross-pattern prediction. The pattern→LV-form prediction lives in Ch 2-5 prose, not in any single table.
- English exemplar: switched from will go (modal) to have eaten (perfect) as Anderson's canonical Ch 2 AUX-headed example; gloss corrected from FUT (non-Andersonian for will) to AUX.PRS + PTCP.
- 9×
native_decide→decide: structural reduction is tractable; native_decide bypasses the kernel. finnish_form_from_paradigm: paired withfinnish_1sg_in_paradigmwitness lemma, now provably grounded in the Fragment paradigm rather than passing through the deadnonefallback branch.- Sorace bridge chains through [Sor00]'s
vendlerClassToTypicalTransitivityfor a 2-step Vendler → TransitivityClass derivation. - Cross-framework theorem
anderson_miestamo_agree_on_neg_morphologydocuments the corrected mapping that preserves Miestamo's particle-vs-verb distinction.
A cross-linguistic AVC datum. Study-local: the AuxiliaryVerbs substrate
classifies over InflPattern; this paper's per-language rows bundle the
form/distribution/gloss with it.
- language : String
- form : String
- inflPattern : AuxiliaryVerbs.InflPattern
- distribution : Option Morphology.InflDistribution
- gloss : String
Instances For
Equations
- Anderson2006.instReprAVCDatum = { reprPrec := Anderson2006.instReprAVCDatum.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
- Anderson2006.instBEqAVCDatum.beq x✝¹ x✝ = false
Instances For
Equations
Per-language AVC data #
The 9 sample datums (7 languages) covering all 5 of Anderson's
inflectional patterns. Per-datum verification theorems below ensure
each datum's inflPattern and form/distribution derive from
the Fragment files (changing a Fragment entry breaks exactly one
theorem).
English have eaten — aux-headed (AUX have carries tense and agreement, LV eaten is a past participle). Anderson ch. 2 (p. 40) describes the English perfect as AUX have with the LV in the -ed ~ -en form; the specific have eaten form is this file's instantiation of that pattern, not a verbatim Anderson example.
Equations
- Anderson2006.english = { language := "English", form := "have eaten", inflPattern := AuxiliaryVerbs.InflPattern.auxHeaded, gloss := "AUX.PRS eat.PTCP" }
Instances For
Doyayo lex-headed (Anderson Ch 3 ex. 15a, p. 121).
mi¹ (gi²) kpel¹-ko¹ 'I'm going to pour'. Auxiliary gi²
parenthesized in Anderson's example, carrying only tonal subject
person (Anderson p. 120); LV carries proximate TAM. Form derived from
Doyayo.AuxiliaryVerbs.lexHeadedForm.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Doyayo split/doubled (Anderson Ch 5 ex. 129, p. 223).
hi¹-za¹ hi¹-zaa¹³ hi¹-lɔ-mɔ 'they might come bite you'.
Subject hi¹ doubly marked on AUX and LV; object -mɔ only
on LV. Anderson p. 223: "this pattern... is common in Doyayo."
Equations
- One or more equations did not get rendered due to their size.
Instances For
Gorum (doubled): subject + TAM on both AUX and LV.
Form derived from Gorum.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Jakaltek (split): absolutive on AUX, ergative on LV.
Form derived from Jakaltek.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pipil split/doubled (Anderson Ch 5 ex. 133b, p. 224).
n-yu ni-mitsin-ilwitia 'I'm going to show you'. Subject 1
doubly marked (n- on AUX, ni- on LV); object -mitsin-
only on LV. AUX root yu lexically encodes prospective TAM.
Anderson p. 224: "Subjects are doubly marked... while objects
occur only on lexical verbs."
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pipil lex-headed (Anderson ch. 3 ex. 49, p. 130; Campbell 1985: 139). weli ni-nehnemi wehka 'CAP 1-walk far' 'I can walk far'. AUX weli uninflected; LV carries subject agreement. Anderson's fn. 6 (p. 221) separately documents lex-headed/split-doubled variation in the Pipil progressive — a different AVC from this capability construction.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Finnish negative auxiliary ei (split): person/number on aux,
TAM on lexical verb (connegative form). Anderson §1.7.2 (p. 33-34)
presents negative auxiliaries as a family-level Uralic trait
(with connegative-marked LV, exx. 44-48) and, across other
families, as spanning multiple AVC patterns — Udihe and Neyo
aux-headed, Kokota split, Kwerba lex-headed, 'Iipay doubled.
Finnish ei itself is
not classified by Anderson in §1.7.2 with a specific pattern label;
the split classification here follows [Kar18] §19.5
where the connegative suffix on the LV is the load-bearing diagnostic.
The split nature derives from Finnish.Negation.finnishNegDistribution:
the negative auxiliary hosts negation, tense, and agreement, while
the lexical verb retains stem and aspect.
The 1sg neg-aux form derives from negParadigm; see the
finnish_1sg_form_eq witness lemma for the strong pinning of
the form-construction. Gloss style follows Anderson's Uralic
convention (Neg-1 read-conneg).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hemba split/doubled: subject doubled on both AUX and LV; tense
on AUX only; mood on LV only. Form derived from
Hemba.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
All 9 AVC datums (covering all 5 of Anderson's patterns).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Per-datum InflPattern verification #
Each theorem below is a structural drift sentry: changing a Fragment entry's underlying analysis breaks exactly one theorem.
Per-datum form/distribution Fragment grounding #
Finnish form construction grounding #
The Finnish form construction uses negParadigm.find?, which is
Option-typed. The fallback | none => "en lue" was historically
chosen to coincide with the success-branch result, making
finnish.form = "en lue" pass by rfl even when the lookup
would return none. The witness theorem below grounds the
construction in the actual paradigm content.
The Finnish 1sg negative-auxiliary entry exists in negParadigm.
Strong pinning: the 1sg paradigm entry's form is exactly "en".
Together with finnish_1sg_in_paradigm, this pins finnish.form
to the Fragment's actual paradigm content. A future change to
negParadigm that altered the 1sg form (or removed the entry
entirely) would break this theorem; the dead-branch | none => "en lue" fallback in finnish.form cannot mask the change.
The Finnish form is the 1sg negParadigm entry's form
concatenated with " lue".
Connection to Finnish Fragment #
The Finnish negative auxiliary construction is a split AVC: the auxiliary hosts some inflectional categories and the lexical verb hosts others, with neither element hosting all categories.
Pattern coverage (named witnesses) #
All five of Anderson's inflectional patterns are attested in the
sample. Each pattern is witnessed by a named datum (existence by
witness, not aggregate .any-scan). Adding a new datum doesn't
break this theorem; changing one of the named witnesses' patterns
does.
Structural theorems on distribution #
In Gorum's doubled AVC, aux and lex host exactly the same categories.
In Doyayo's lex-headed AVC, the auxiliary hosts ONLY tonal subject agreement (per Anderson p. 120), and the LV carries TAM.
Anderson Ch 5 §5.2 payoff (Doyayo): subject agreement is
doubled across AUX and LV; object agreement appears on LV only.
Now Lean-checkable directly via Controller-typed .agreement,
rather than via the fragile list-length encoding the 0.230.578
workaround used.
Anderson Ch 5 §5.2 payoff (Pipil): same generalization as
Doyayo — subject doubled across AUX and LV, object on LV only.
The AUX root yu itself encodes TAM lexically (no separate
.tense morpheme on AUX).
In Pipil's lex-headed AVC, the auxiliary hosts no inflection.
In Finnish's split AVC, aux and lex host disjoint category types.
(.stem on the lex side is a base, not an inflectional overlap.)
Anderson Ch 5 abs/erg payoff (Jakaltek): with role-typed
agreement, the absolutive-on-AUX / ergative-on-LV split is now
a category-level distinction (.agreement .obj vs .agreement .subj),
not just a within-category meta-comment.
In Hemba's split/doubled AVC, subject agreement is doubled (on both elements), tense is AUX-only, mood is LV-only. No object agreement in this construction.
Dual headedness #
Anderson distinguishes three notions of head — inflectional, phrasal/syntactic, and semantic (§1.4, pp. 22-24; Table 3.1 on p. 116 tabulates the assignment for the lex-headed pattern). The semantic head (content provider) is always the lexical verb (Anderson p. 23: "It is the lexical verb"); the inflectional host varies by pattern. This mismatch is what makes AVCs typologically distinctive.
The semantic head and inflectional host coincide only in lex-headed AVCs. In all other patterns they diverge: the semantic head is always the lexical verb, but inflection may sit on the auxiliary (or on both elements).
Negative auxiliaries as AVCs #
[And06a] §1.7.2 (p. 33-34) treats negative auxiliaries
across multiple AVC patterns: aux-headed in Udihe, Neyo; split in
Kokota; lex-headed in Kwerba; doubled in 'Iipay. The example rows
live in Data/Examples/Anderson2006.json (Komi (47a,b), Udihe (49),
Kwerba (52a,b), all verified against the book); each row's
infl_pattern feature records the book's classification where it
states one. The NegStrategy → InflPattern mapping lives in
Typology/Negation.lean: NegStrategy.expectedInflPattern encodes
the most common verbal-negator → aux-headed mapping, and the Kwerba
rows witness below that it is a tendency, not a law.
Udihe (49) bi ei-mi sa: is classified aux-headed by Anderson, and the strategy-level projection expects exactly that.
Kwerba (52a,b) shows a negative auxiliary in a lex-headed AVC
(the lexical verb hosts the inflection), so the aux-headed
expectation of NegStrategy.expectedInflPattern is defeasible —
Anderson's own four-pattern list is the counterexample source.
The Komi tense alternation (47a,b) sits entirely on the negative auxiliary: same lexical verb token, different auxiliary form.
LV form predictions across patterns #
Anderson predicts the verb form of the lexical verb from the
inflectional pattern. AUX-headed AVCs have a nonfinite LV
(infinitive/participle/etc.; Anderson Table 2.3 on p. 114
catalogs the LV non-finite-form types within AUX-headed AVCs
specifically — the table is a one-pattern grid of language
exemplars, not a cross-pattern prediction). All other patterns
have a finite LV (carrying at least some inflection, per Anderson
chs. 3-5 prose). The pattern→LV-form prediction is formalized in
Typology.lvVerbForm.
Bridge to Sorace 2000 (auxiliary selection) #
Be/have auxiliary selection (Typology/AuxiliaryVerbs.lean) operates
within aux-headed AVCs: the question of which auxiliary appears
presupposes the auxiliary hosts inflection. [Sor00]'s
sister study Studies/Sorace2000.lean provides
vendlerClassToTypicalTransitivity; the quantified composition
with canonicalSelection is given below.
Sorace's gradient Auxiliary Selection Hierarchy is not yet
formalized in linglib (per Sorace2000.lean docstring); the
contrastive theorem against Anderson's discrete pattern typology
(anderson_silent_on_intermediate_ash) will land when ASH ranks
are added.
Auxiliary selection presupposes aux-headed pattern: the selecting auxiliary hosts tense/agreement (is the inflectional head).
Quantified Sorace bridge: composing vendlerClassToTypicalTransitivity
with canonicalSelection yields .be exactly for achievements,
.have elsewhere (Italian è arrivato instantiates the
achievement case). Exposes the composition
canonicalSelection ∘ vendlerClassToTypicalTransitivity
as a single theorem rather than a hand-picked tuple. Falsifiable
by changing either lookup.
Cross-framework: Miestamo 2005 (negation morpheme classification) #
[Mie05] classifies negation strategies by morpheme
type (NegMorphemeType: .auxVerb, .affix, .particle, ...);
[And06a] via [Hei93]'s grammaticalization
framework places verbal negators on the cline at .auxiliary and
non-verbal negators off the cline (Anderson §1.7.2 covers only
verbal negators). The two frameworks classify by independently-
motivated criteria but, for the strategies linglib's
NegStrategy enum exposes, AGREE on which strategies are
"verbal": Anderson's .toGramStage = some .auxiliary is exactly
Miestamo's .toNegMorphemeType = .auxVerb.
Composition with [Mie05]'s
verbal_constructional_always_derived (in
Linglib/Studies/Miestamo2005.lean) then
yields: any NegStrategy Anderson places at the auxiliary cline
stage, in any Miestamo datum showing constructional asymmetry,
has its asymmetry classified as .derived rather than
.independent — a falsifiable empirical prediction whose chain
runs Anderson's cline → Miestamo's morpheme type → Miestamo's
asymmetry source.
The earlier anderson_miestamo_agree_on_neg_morphology theorem
in this slot (0.230.573) was a stapled conjunction of three
independent facts; the meta-audit at 0.230.576 replaced it with
the quantified-equivalence shape below.
Cross-framework equivalence: Anderson's grammaticalization-cline
placement at .auxiliary and Miestamo's morpheme-type
classification as .auxVerb partition the NegStrategy enum
identically. Both frameworks classify exactly .negVerb
(Finnish ei-style inflecting negators) as the verbal subtype.
Falsifiable by changing either projection: a future split of
NegStrategy.negVerb into Miestamo-style auxVerb-vs-doubleNeg
subtypes would break this without breaking either projection
individually.