Sakha Two-Modality Case Assignment @cite{baker-vinokurova-2010} #
@cite{baker-vinokurova-2010} argue that Sakha (Turkic) requires both of the case-assignment mechanisms that linguistic theory has on offer: configurational dependent case (Marantz; @cite{marantz-1991}) for ACC and DAT, and Agree with a functional head (Chomsky; @cite{chomsky-2000}) for NOM and GEN. The two modalities are not in competition — they coexist in a single grammar:
- DAT rule (paper (4a)/(85)): if NP1 c-commands NP2 in the same VP-phase and NP2 is unmarked, value NP1 as DAT.
- ACC rule (paper (4b)/(85)): if NP1 c-commands NP2 in any phase and NP1 is unmarked, value NP2 as ACC. (4a) bleeds (4b) on the VP cycle by Elsewhere ordering.
- NOM rule (paper (5)/(86)): finite T Agrees with the highest unvalued NP visible on the CP cycle and values it NOM.
- GEN rule (paper (5)/(86)): D Agrees with the possessor inside DP
and values it GEN. (DP-internal; not modeled by the clausal
algorithm here, but parameterized via
genMode := .agreeD.)
The library's CaseSystemConfig (in DependentCase.lean) is
parameterized so each of the four structural cases gets an independent
mechanism slot. Sakha is the configuration where ACC and DAT are
dependent while NOM and GEN are Agree-based — exactly the
@cite{baker-vinokurova-2010} grammar.
Phase visibility and DOM #
NPs carry both a basePhase (where they were merged) and a
shifted flag (did they move to a higher phase before evaluation).
This captures the Phase Impenetrability Condition: an unshifted NP
inside VP is invisible on the CP cycle, so it cannot be a competitor
for the ACC rule on that cycle.
Differential Object Marking falls out: specific objects shift to the edge of VP (or beyond) and become visible on the CP cycle, where they form a competitor pair with the subject and receive ACC. Nonspecific objects stay inside VP, are invisible to T, and surface unmarked.
This file states the central derivations for monotransitives,
ditransitives, unaccusatives, and DOM. ECM and DP-level GEN are
acknowledged but not formalized; the clausal algorithm covers them
in principle but the present PhasedNP representation does not yet
distinguish embedded vs matrix domains.
Sakha's case system: accusative alignment with the @cite{baker-vinokurova-2010} two-modality split. ACC and DAT are dependent (Marantz); NOM and GEN are Agree-based (Chomsky).
Equations
- One or more equations did not get rendered due to their size.
Instances For
The two-modality thesis stated as a structural property of the config: at least one case is configurational, at least one is Agree-based. Mongolian shares the configurational ACC but uses a nonstructural DAT, so Sakha is the strictest exemplar.
A subject NP merged at the vP edge / SpecTP — visible on the CP cycle.
Equations
- Phenomena.Case.Studies.BakerVinokurova2010.subj label = { label := label, lexicalCase := none, basePhase := Syntax.Case.CasePhase.cp }
Instances For
A VP-internal NP that has shifted (specific object, raised theme).
Equations
- Phenomena.Case.Studies.BakerVinokurova2010.shiftedVP label = { label := label, lexicalCase := none, basePhase := Syntax.Case.CasePhase.vp, shifted := true }
Instances For
A VP-internal NP that has not shifted (nonspecific object).
Equations
- Phenomena.Case.Studies.BakerVinokurova2010.lowVP label = { label := label, lexicalCase := none, basePhase := Syntax.Case.CasePhase.vp }
Instances For
"Masha cake-ACC ate" with a specific object: the object shifts, competes with the subject on the CP cycle, and is valued ACC.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Specific object surfaces with ACC, by the (4b)/(86) ACC rule on the CP cycle.
The ACC on the specific object is dependent case, not lexical or Agree-based — verifying the Marantz modality is doing the work.
Subject is valued NOM by T-Agree (paper (5)/(86)).
The NOM on the subject is the Chomskyan Agree modality, not the
Marantzian unmarked default — the central contrast with
Mongolian and the structural payoff of nomMode := .agreeT.
"Masha cake ate" with a nonspecific object: the object stays in VP and is invisible to T on the CP cycle, so the ACC rule never fires (no competitor pair). The object surfaces unmarked.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Nonspecific object: no ACC, surfaces unmarked. PIC-driven DOM.
Subject still gets NOM by T-Agree — the Agree probe always finds the highest CP-visible unvalued NP, which is the subject in both DOM variants.
The DOM alternation: object case differs purely by whether the object has shifted out of VP, with no change to the subject. The grammar does not stipulate "specificity → ACC"; it is derived from phase visibility + the (4b) ACC rule.
The subject case is invariant across the DOM contrast — the same NOM-by-Agree applies whether the object is specific or not.
Ditransitive with a specific theme. Three NPs: subj (CP), goal (VP), theme (VP, shifted). The DAT rule fires on the VP cycle for the goal (highest of two unmarked VP-internals), bleeding ACC at that cycle by Elsewhere. The theme then competes with the subject on the CP cycle and is valued ACC.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Goal NP receives DAT by the (4a)/(85) DAT rule on the VP cycle.
Specific theme receives ACC on the CP cycle (after the goal has been valued DAT and removed from competition).
Subject receives NOM by T-Agree.
The full NOM/DAT/ACC ditransitive pattern derived in one step from
assignCasesPhased: this is the central empirical signature of
@cite{baker-vinokurova-2010}'s analysis, and it follows from the
interaction of the two modalities, not from a stipulated
case-assignment template.
Elsewhere ordering: in the ditransitive, only ONE NP gets ACC despite there being two VP-internal NPs. The (4a) DAT rule bleeds (4b) at the VP cycle.
This is the per-datum verification. The structural reason — that
applyAccRule cannot overwrite any marked NP, regardless of the
input — is applyAccRule_preserves_marked_at in
DependentCase.lean, and the full pipeline analogue is
dat_persists_through_assignCasesPhased.
Unaccusative: theme raises to SpecTP (modeled with basePhase := .cp). With a single visible NP on the CP cycle, no ACC competitor
exists; T-Agree values the theme NOM.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
No NP receives ACC in the unaccusative — the dependent ACC rule requires two unmarked competitors and there is only one NP.
These derivations exhibit both modalities firing within a single sentence: dependent case for one NP, Agree for another. This is what forces a hybrid grammar — neither pure Marantz nor pure Chomsky covers the full Sakha pattern.
In the specific-object monotransitive, subject and object receive case from different mechanisms: NOM by Agree, ACC by dependent case.
In the ditransitive, all three modal sources are attested: Agree (subject NOM), and two dependent cases (goal DAT, theme ACC). The .lexical and .unmarked sources are absent from this derivation — they would arise for quirky-DAT subjects and for nonspecific themes respectively.
@cite{gong-2022} adopts the Sakha framework for Mongolian but
swaps datMode from .dependent to .nonstructural: Mongolian DAT
is inherent. Holding ACC, NOM, GEN modes constant and varying only
DAT, the algorithm correctly predicts that Mongolian ditransitives
have no DAT-from-the-algorithm — DAT must come pre-loaded as
lexicalCase.
The Mongolian config differs from Sakha only in datMode.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Same ditransitive input, Mongolian config: the goal is no longer valued DAT by the algorithm — it becomes an unmarked VP-internal NP, which then competes with the theme on the VP cycle for ACC.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Without dependent DAT, no NP gets DAT from the algorithm. The DAT on Mongolian goals must be supplied as inherent/lexical case at the lexicon level — exactly @cite{gong-2022}'s claim.
The Sakha vs. Mongolian contrast localizes to a single config
parameter — datMode — exactly as predicted by the parameterized
CaseSystemConfig design.
A structural payoff of distinguishing .agree from .unmarked:
the same surface case (NOM) can have two distinct sources, and the
source matters for downstream computations (visibility to higher
probes, raising-to-object, etc.). Sakha NOM is always .agree; a
default-NOM language (pure Marantz) would have it as .unmarked.
Every NOM in Sakha derivations comes from T-Agree, never from
the unmarked default. This holds across all derivations in this
file by construction (because nomMode := .agreeT), and is the
structural fingerprint of the Chomsky modality.
@cite{baker-vinokurova-2010} (23)–(24): morphological causatives in Sakha exhibit a striking cascade. The causee surfaces with ACC if the base verb is intransitive (one argumental NP in max VP, no DAT competitor) but with DAT if the base verb is transitive (two argumental NPs in max VP, (4a) fires marking the causee as DAT).
This is the cleanest test of the dependent-case modality: adding an NP (the lower theme) changes the case on a different NP (the causee), which is impossible under any version of head-driven Agree case. The algorithmic Mechanism — (4a) bleeding (4b) on the VP cycle — predicts the cascade without any additional stipulation.
(23a) "Sardaana made Aisen cry" — base verb 'cry' is intransitive. Max VP contains only the causee (Aisen). With only one argumental NP visible on the VP cycle, neither (4a) nor (4b) fires. The causee shifts to the CP phase, becomes a competitor for the causer (Sardaana), and is valued ACC by (4b) on the CP cycle.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
(23b) "Misha made Masha eat soup" — base verb 'eat' is transitive. Max VP contains the causee (Masha) and theme (soup), both argumental. On the VP cycle, (4a) fires: Masha (the higher of the two unmarked NPs) is valued DAT, bleeding (4b). The theme then shifts to the CP phase, competes with the causer (Misha), and is valued ACC.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The causative cascade: the same causative morpheme produces ACC on the causee in (23a) and DAT on the causee in (23b). The only difference is the transitivity of the base verb — i.e., the number of argumental NPs in max VP. This is the structural signature of dependent case.
@cite{baker-vinokurova-2010} (8)–(9): rules (4a)/(4b) only apply between argumental NPs (those bearing a θ-role w.r.t. some case- assigning head). Bare-NP adverbs like sajyn 'summer' do not count as case competitors, even when c-commanded by another caseless NP.
The PhasedNP isArgumental field captures this: when set to false,
the NP is filtered out of unmarkedVisible and so cannot trigger or
receive dependent case. The very same noun that surfaces as ACC when
functioning as the object of a transitive verb (8c) surfaces unmarked
when functioning as a temporal adverb (8a)/(8b).
Adverbial NP — bears no θ-role w.r.t. a case-assigning head.
Equations
- Phenomena.Case.Studies.BakerVinokurova2010.adverb label = { label := label, lexicalCase := none, basePhase := Syntax.Case.CasePhase.cp, isArgumental := false }
Instances For
(8a) "Bihigi beqehee ystan-nybyt" 'we yesterday jumped'. Two NPs: 'we' (subject, argumental) and 'yesterday' (adverb, non-argumental). The adverb is filtered from case competition; only one argumental NP is visible to T-Agree on the CP cycle.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The adverb is not marked ACC: rule (4b) does not see it as a case competitor. The subject is valued NOM by T-Agree, and the adverb falls through to the default sweep with unmarked NOM.
(8c) "Masha sajyn-y axt-ar" 'Masha summer-ACC misses'. Same noun 'summer', now functioning as the object of transitive 'miss' — it bears a θ-role and so counts as argumental. Now (4b) applies and the object is marked ACC, exactly the contrast (8a/b vs 8c).
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The argumental contrast: the very same lexical noun receives ACC
when argumental and unmarked NOM when adverbial. The grammar does
not stipulate that 'summer' is ambiguous between an argumental
and an adverbial entry; both readings reduce to a single algorithm
parameterized by the isArgumental feature.
Replacing the trivial two_modalities_present (which only restated
the config) with a theorem that no single modality — pure Marantz
or pure Chomsky — can produce the Sakha derivational pattern. The
two-modality grammar is empirically required, not just stipulated.
Pure Marantz (Sakha pattern with NOM as unmarked default and no Agree-based case): all structural cases are configurational.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pure Chomsky: every structural case assigned by Agree with a
functional head. v-Agree marks the lowest CP-visible argumental
NP as ACC; T-Agree marks the highest as NOM; D-Agree marks DP-
internals as GEN; DAT is purely lexical/inherent
(.nonstructural). This is the standard
@cite{chomsky-2000}/@cite{chomsky-2001} configuration.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Pure Marantz produces the same surface NOM on the subject as Sakha
— but with source := .unmarked, not .agree. The structural
fingerprint of the modality differs even when the morphology
coincides.
Pure Chomsky has datMode := .nonstructural, so the algorithm
never derives DAT — Mongolian-style, every DAT must be lexical.
This contradicts the Sakha pattern where DAT is productive on
structural goals.
Pure Chomsky's v-Agree fires under accMode := .agreeV and marks
the theme ACC via Agree (not via the dependent rule). This makes
the source-distinction operative, not just the case-distinction:
Sakha derives ACC via .dependent, pure Chomsky via .agree.
The causative cascade is the canonical wedge against any
pure-Agree theory of structural case. Adding the lower theme to
a transitive-base causative changes the case on the causee
(ACC → DAT) — but no head's Agree relation has been altered,
only the count of NPs in max VP. Pure Chomsky predicts the
causee in (23b) surfaces with NOM-by-default, exactly because
its v-Agree probe targets the theme; the (4a)/dependent-DAT
rule cannot apply without datMode := .dependent.
The strong two-modality theorem: neither pure modality derives
the Sakha pattern. Pure Marantz fails the NOM-as-Agree fingerprint
(subject NOM source = .unmarked). Pure Chomsky fails on DAT —
not just on the ditransitive but, more sharply, on the causative
cascade where adding an NP changes another NP's case in a way no
Agree relation can mediate. The two-modality grammar is
required, not stipulated.
@cite{baker-vinokurova-2010} (5)/(86): D Agrees with the
possessor inside DP and values it GEN. The clausal cycles see the DP
as opaque — its possessor is filtered out of unmarkedVisible by
the inDP flag — and applyGenAgree runs as the DP-internal
counterpart to T-Agree. This is the second Agree-modality slot
(genMode := .agreeD) that distinguishes the Sakha grammar from a
purely Marantzian one.
A DP-internal possessor: opaque to clause-level case competition but valued GEN by D-Agree.
Equations
- Phenomena.Case.Studies.BakerVinokurova2010.possessor label = { label := label, lexicalCase := none, basePhase := Syntax.Case.CasePhase.cp, inDP := true }
Instances For
"Aisen's house [is in town]" — the matrix subject is a DP whose
possessor aisen is valued GEN by D-Agree. The possessor is
invisible to clausal probes; the head noun (house) is the
subject of T-Agree and surfaces NOM.
Equations
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The GEN possessor is invisible to (4b)/ACC: in a transitive with a possessed object, the head noun is what receives ACC, not the possessor (which is busy being valued GEN inside its DP).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
The two Agree modalities (T → NOM, D → GEN) coexist with the two dependent modalities (DAT, ACC) in a single derivation — the strongest empirical demonstration of the four-slot parameterization at work.
The phased algorithm is total on every Sakha derivation in this
file: each input NP appears in the output with exactly one case.
Follows from assignCasesPhased_length in DependentCase.lean.