Anderson (2006): Auxiliary Verb Constructions #
@cite{anderson-2006} @cite{heine-1993}
Cross-linguistic typology of auxiliary verb constructions (AVCs): classification by inflectional distribution between auxiliary and lexical verb, with diachronic grounding in @cite{heine-1993}'s grammaticalization framework.
Key contributions formalized #
- Inflectional pattern typology (
InflPatternfrom substrate): auxHeaded, lexHeaded, doubled, split, splitDoubled — defined inTypology/AuxiliaryVerbs.lean, verified per-datum here. - Semantic head invariant (Anderson p. 23, Table 3.1 p. 117): the lexical verb is always the semantic head, regardless of where inflection sits.
- Typed inflectional distribution:
InflDistribution(fromCore.Morphology) records whichMorphCategoryvalues each element hosts. - Grammaticalization grounding: Anderson ch. 7 traces AVCs
onto Heine 1993's cline (Anderson p. 5: "According to Heine
(1993: 48ff.)..."). The cline lives in
Theories/Diachronic/Grammaticalization.lean, anchored on @cite{heine-1993} ch. 3.
Coverage #
Sample of 9 AVC datums across 7 languages, covering all 5 patterns:
- English have eaten (aux-headed) — Anderson ch. 2 canonical example
- Doyayo lex-headed (Anderson ch. 3 ex. 15a, p. 121)
- Doyayo split/doubled (Anderson ch. 5 ex. 129, p. 223)
- Gorum (doubled)
- Jakaltek (split, abs/erg)
- Pipil split/doubled (Anderson ch. 5 ex. 133, p. 224)
- Pipil lex-headed (Anderson p. 220-221 fn. 6, with weli)
- Finnish split (negative auxiliary ei) @cite{karlsson-2017}
- Hemba split/doubled
2026-04-30 audit fixes #
PDF-verified against Anderson 2006 (book pp. 5, 114, 117, 121, 224) and Heine 1993 (book p. 48ff. via Anderson p. 5):
- Doyayo
.split→.lexHeaded(Ch 3 ex. 15a) + newdoyayoSplitDoubled(Ch 5 ex. 129). Anderson never classifies Doyayo as plain split. - Pipil
.split→.splitDoubled(Ch 5 ex. 133, with subjects doubly marked). FragmentsplitDistributioncorrected to put.agreementon AUX (matches the gloss1-AUX 1-2PL-show). - Cline reattributed to @cite{heine-1993} (Anderson p. 5
explicitly cites Heine 1993:48ff.). Substrate
GramStage.toMorphStatusprojection moved toTheories/Diachronic/Grammaticalization.lean. negStrategyStagemoved toNegativeAuxiliaries.leanasNegStrategy.toGramStage : NegStrategy → Option GramStage, with.negParticle => none(not on the verbal cline) — fixes the earlier formaliser-introduced collapse of @cite{miestamo-2005}'s particle-vs-verb distinction.- Dropped Table 2.3 reference: Anderson's Table 2.3 (p. 114) catalogs non-finite forms within AUX-headed AVCs (a single- pattern grid of language exemplars), not a cross-pattern prediction. The pattern→LV-form prediction lives in Ch 2-5 prose, not in any single table.
- English exemplar: switched from will go (modal) to have eaten (perfect) as Anderson's canonical Ch 2 AUX-headed example; gloss corrected from FUT (non-Andersonian for will) to AUX.PRS + PTCP.
- 9×
native_decide→decide: structural reduction is tractable; native_decide bypasses the kernel. finnish_form_from_paradigm: paired withfinnish_1sg_in_paradigmwitness lemma, now provably grounded in the Fragment paradigm rather than passing through the deadnonefallback branch.- Sorace bridge chains through @cite{sorace-2000}'s
vendlerClassToTypicalTransitivityfor a 2-step Vendler → TransitivityClass derivation. - Cross-framework theorem
anderson_miestamo_agree_on_neg_morphologydocuments the corrected mapping that preserves Miestamo's particle-vs-verb distinction.
Per-language AVC data #
The 9 sample datums (7 languages) covering all 5 of Anderson's
inflectional patterns. Per-datum verification theorems below ensure
each datum's inflPattern and form/distribution derive from
the Fragment files (changing a Fragment entry breaks exactly one
theorem).
English have eaten — aux-headed (AUX have carries tense and agreement, LV eaten is a past participle). Anderson ch. 2 (p. 40) describes English perfects schematically as have V-ed; the specific have eaten form is this file's instantiation of that pattern, not a verbatim Anderson example.
Equations
- Anderson2006.english = { language := "English", form := "have eaten", inflPattern := Typology.AuxiliaryVerbs.InflPattern.auxHeaded, gloss := "AUX.PRS eat.PTCP" }
Instances For
Doyayo lex-headed (Anderson Ch 3 ex. 15a, p. 121).
mi¹ (gi²) kpel¹-ko¹ 'I'm going to pour'. Auxiliary gi²
uninflected (parenthesized in Anderson's gloss); LV carries
proximate-future TAM. Form derived from
Fragments.Doyayo.AuxiliaryVerbs.lexHeadedForm.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Doyayo split/doubled (Anderson Ch 5 ex. 129, p. 223).
hi¹-za¹ hi¹-zaa³ hi¹-lɔ-mɔ 'they might come bite you'.
Subject hi¹ doubly marked on AUX and LV; object -mɔ only
on LV. Anderson p. 223: "this pattern... is common in Doyayo."
Equations
- One or more equations did not get rendered due to their size.
Instances For
Gorum (doubled): subject + TAM on both AUX and LV.
Form derived from Fragments.Gorum.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Jakaltek (split): absolutive on AUX, ergative on LV.
Form derived from Fragments.Jakaltek.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pipil split/doubled (Anderson Ch 5 ex. 133b, p. 224).
n-yu ni-mitsin-ilwitia 'I'm going to show you'. Subject 1
doubly marked (n- on AUX, ni- on LV); object -mitsin-
only on LV. AUX root yu lexically encodes prospective TAM.
Anderson p. 224: "Subjects are doubly marked... while objects
occur only on lexical verbs."
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pipil lex-headed (Anderson p. 220-221 fn. 6; Campbell 1985: 139). weli ni-nehnemi wehka 'I can walk far'. AUX weli uninflected; LV carries subject agreement. Anderson explicitly contrasts this with the split/doubled pattern in fn. 6 as a coexisting Pipil construction.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Finnish negative auxiliary ei (split): person/number on aux,
TAM on lexical verb (connegative form). Anderson §1.7.2 (p. 33-34)
treats Uralic negative auxiliaries as a class spanning multiple
AVC patterns — Udihe and Neyo as aux-headed, Kokota as split,
Kwerba as lex-headed, 'Iipay as doubled. Finnish ei itself is
not classified by Anderson in §1.7.2 with a specific pattern label;
the split classification here follows @cite{karlsson-2017} §19.5
where the connegative suffix on the LV is the load-bearing diagnostic.
The split nature derives from Fragments.Finnish.Negation.finnishNegDistribution:
the negative auxiliary hosts negation, tense, and agreement, while
the lexical verb retains stem and aspect.
The 1sg neg-aux form derives from negParadigm; see the
finnish_1sg_form_eq witness lemma for the strong pinning of
the form-construction. Gloss style follows Anderson's Uralic
convention (Neg-1 read-conneg).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Hemba split/doubled: subject doubled on both AUX and LV; tense
on AUX only; mood on LV only. Form derived from
Fragments.Hemba.AuxiliaryVerbs.
Equations
- One or more equations did not get rendered due to their size.
Instances For
All 9 AVC datums (covering all 5 of Anderson's patterns).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Per-datum InflPattern verification #
Each theorem below is a structural drift sentry: changing a Fragment entry's underlying analysis breaks exactly one theorem.
Per-datum form/distribution Fragment grounding #
Finnish form construction grounding #
The Finnish form construction uses negParadigm.find?, which is
Option-typed. The fallback | none => "en lue" was historically
chosen to coincide with the success-branch result, making
finnish.form = "en lue" pass by rfl even when the lookup
would return none. The witness theorem below grounds the
construction in the actual paradigm content.
The Finnish 1sg negative-auxiliary entry exists in negParadigm.
Strong pinning: the 1sg paradigm entry's form is exactly "en".
Together with finnish_1sg_in_paradigm, this pins finnish.form
to the Fragment's actual paradigm content. A future change to
negParadigm that altered the 1sg form (or removed the entry
entirely) would break this theorem; the dead-branch | none => "en lue" fallback in finnish.form cannot mask the change.
The Finnish form is the 1sg negParadigm entry's form
concatenated with " lue".
Connection to Finnish Fragment #
The Finnish negative auxiliary construction is a split AVC: the auxiliary hosts some inflectional categories and the lexical verb hosts others, with neither element hosting all categories.
Pattern coverage (named witnesses) #
All five of Anderson's inflectional patterns are attested in the
sample. Each pattern is witnessed by a named datum (existence by
witness, not aggregate .any-scan). Adding a new datum doesn't
break this theorem; changing one of the named witnesses' patterns
does.
Structural theorems on distribution #
In Gorum's doubled AVC, aux and lex host exactly the same categories.
In Doyayo's lex-headed AVC, the auxiliary hosts ONLY tonal subject agreement (per Anderson p. 120), and the LV carries TAM.
Anderson Ch 5 §5.2 payoff (Doyayo): subject agreement is
doubled across AUX and LV; object agreement appears on LV only.
Now Lean-checkable directly via Controller-typed .agreement,
rather than via the fragile list-length encoding the 0.230.578
workaround used.
Anderson Ch 5 §5.2 payoff (Pipil): same generalization as
Doyayo — subject doubled across AUX and LV, object on LV only.
The AUX root yu itself encodes TAM lexically (no separate
.tense morpheme on AUX).
In Pipil's lex-headed AVC, the auxiliary hosts no inflection.
In Finnish's split AVC, aux and lex host disjoint category types.
(.stem on the lex side is a base, not an inflectional overlap.)
Anderson Ch 5 abs/erg payoff (Jakaltek): with role-typed
agreement, the absolutive-on-AUX / ergative-on-LV split is now
a category-level distinction (.agreement .obj vs .agreement .subj),
not just a within-category meta-comment.
In Hemba's split/doubled AVC, subject agreement is doubled (on both elements), tense is AUX-only, mood is LV-only. No object agreement in this construction.
Dual headedness #
Anderson p. 117 Table 3.1 distinguishes three notions of head: inflectional, phrasal/syntactic, and semantic. The semantic head (content provider) is always the lexical verb (Anderson p. 23); the inflectional host varies by pattern. This mismatch is what makes AVCs typologically distinctive.
The semantic head and inflectional host coincide only in lex-headed AVCs. In all other patterns they diverge: the semantic head is always the lexical verb, but inflection may sit on the auxiliary (or on both elements).
Negative auxiliaries as AVCs #
@cite{anderson-2006} §1.7.2 (p. 33-34) treats negative auxiliaries
across multiple AVC patterns: aux-headed in Udihe, Neyo; split in
Kokota; lex-headed in Kwerba; doubled in 'Iipay. The substantive
theorems about NegStrategy → InflPattern live in
Phenomena/AuxiliaryVerbs/NegativeAuxiliaries.lean (negVerb_implies_auxHeaded
encodes the most common verbal-negator → aux-headed mapping; the
other patterns Anderson documents are not yet a Lean-checkable
NegStrategy → InflPattern typology because they require finer
sub-typing of NegStrategy.negVerb). This file does not re-export.
LV form predictions across patterns #
Anderson predicts the verb form of the lexical verb from the
inflectional pattern. AUX-headed AVCs have a nonfinite LV
(infinitive/participle/etc.; Anderson Table 2.3 on p. 114
catalogs the LV non-finite-form types within AUX-headed AVCs
specifically — the table is a one-pattern grid of language
exemplars, not a cross-pattern prediction). All other patterns
have a finite LV (carrying at least some inflection, per Anderson
chs. 3-5 prose). The pattern→LV-form prediction is formalized in
Typology.lvVerbForm.
Bridge to Sorace 2000 (auxiliary selection) #
Be/have auxiliary selection (Selection.lean) operates within
aux-headed AVCs: the question of which auxiliary appears
presupposes the auxiliary hosts inflection. @cite{sorace-2000}'s
sister study Studies/Sorace2000.lean provides
vendlerClassToTypicalTransitivity; the quantified composition
with canonicalSelection is given below.
Sorace's gradient Auxiliary Selection Hierarchy is not yet
formalized in linglib (per Sorace2000.lean docstring); the
contrastive theorem against Anderson's discrete pattern typology
(anderson_silent_on_intermediate_ash) will land when ASH ranks
are added.
Auxiliary selection presupposes aux-headed pattern: the selecting auxiliary hosts tense/agreement (is the inflectional head).
Quantified Sorace bridge: composing vendlerClassToTypicalTransitivity
with canonicalSelection yields .be exactly for achievements,
.have elsewhere. This generalizes the per-datum Italian
arrivare claim to all Vendler classes, exposing the
composition canonicalSelection ∘ vendlerClassToTypicalTransitivity
as a single theorem rather than a hand-picked tuple. Falsifiable
by changing either lookup.
Italian arrivare grounds the achievement → unaccusative → BE
chain: the Selection datum independently classifies arrivare
as unaccusative, and canonicalSelection .unaccusative = .be.
Composes with sorace_canonical_chain .achievement.
Cross-framework: Miestamo 2005 (negation morpheme classification) #
@cite{miestamo-2005} classifies negation strategies by morpheme
type (NegMorphemeType: .auxVerb, .affix, .particle, ...);
@cite{anderson-2006} via @cite{heine-1993}'s grammaticalization
framework places verbal negators on the cline at .auxiliary and
non-verbal negators off the cline (Anderson §1.7.2 covers only
verbal negators). The two frameworks classify by independently-
motivated criteria but, for the strategies linglib's
NegStrategy enum exposes, AGREE on which strategies are
"verbal": Anderson's .toGramStage = some .auxiliary is exactly
Miestamo's .toNegMorphemeType = .auxVerb.
Composition with @cite{miestamo-2005}'s
verbal_constructional_always_derived (in
Linglib/Phenomena/Negation/Studies/Miestamo2005.lean) then
yields: any NegStrategy Anderson places at the auxiliary cline
stage, in any Miestamo datum showing constructional asymmetry,
has its asymmetry classified as .derived rather than
.independent — a falsifiable empirical prediction whose chain
runs Anderson's cline → Miestamo's morpheme type → Miestamo's
asymmetry source.
The earlier anderson_miestamo_agree_on_neg_morphology theorem
in this slot (0.230.573) was a stapled conjunction of three
independent facts; the meta-audit at 0.230.576 replaced it with
the quantified-equivalence shape below.
Cross-framework equivalence: Anderson's grammaticalization-cline
placement at .auxiliary and Miestamo's morpheme-type
classification as .auxVerb partition the NegStrategy enum
identically. Both frameworks classify exactly .negVerb
(Finnish ei-style inflecting negators) as the verbal subtype.
Falsifiable by changing either projection: a future split of
NegStrategy.negVerb into Miestamo-style auxVerb-vs-doubleNeg
subtypes would break this without breaking either projection
individually.