Bach, Brown & Marslen-Wilson (1986) #
@cite{bach-brown-marslen-wilson-1986}
Crossed and Nested Dependencies in German and Dutch: A Psycholinguistic Study. Language and Cognitive Processes, 1(4), 249–262.
Core Finding #
Dutch crossed verb-cluster dependencies (NP₁ NP₂ NP₃ V₁ V₂ V₃) are easier to process than German nested dependencies (NP₁ NP₂ NP₃ V₃ V₂ V₁) at two or more levels of embedding, in both comprehensibility ratings and comprehension accuracy. At one level of embedding (Level 2), German/Participle does not differ from Dutch, though German/Infinitive shows a significant baseline disadvantage across all levels. This confirms @cite{evers-1975}'s intuition that crossed dependencies are easier, with the first controlled experimental evidence.
Incremental Integration Model #
The paper argues qualitatively that crossed dependencies allow incremental
top-down integration while nested dependencies force bottom-up accumulation of
floating propositions. We formalize this via totalIntegrationCost: the
cumulative count of NPs awaiting matrix-connected integration during verb-cluster
processing. This metric is our formalization, not the paper's — they argue
informally about when partial interpretations become available.
The integration model is derived from the VerbClusterBinding permutation:
NP_i is matrix-integrated after k verbs iff max(σ(0), σ(i)) < k — all
verbs in the control chain from the bound verb to the matrix verb have been
heard. For identity (crossed), σ(0) = 0 so integration is min(k, n). For
reverse (nested), σ(0) = n−1 so nothing integrates until all verbs are heard.
The cost ratio nested/crossed is exactly 2 for all n, but the absolute difference n(n−1)/2 grows quadratically — consistent with the finding that the processing difference is undetectable at n=2 (gap = 1) but large at n=3 (gap = 3).
Dependency Length Invariance #
Crossed and nested patterns have identical total NP-verb dependency length (n²). This means the Bach et al. finding cannot be explained by dependency length minimization alone — the advantage of crossed dependencies is about when information becomes available for matrix integration, not about dependency distance.
Formal–Processing Dissociation #
Crossed dependencies require mildly context-sensitive power (@cite{shieber-1985}, @cite{bresnan-etal-1982}) while nested dependencies are context-free, yet crossed is psycholinguistically easier. This refutes models where parsing difficulty tracks the Chomsky hierarchy and provides evidence against push-down-store models of human parsing (@cite{evers-1975}).
Cumulative unintegrated NPs across verb positions 1..n.
Crossed: (n−1) + (n−2) + ··· + 0 = n(n−1)/2 Nested: n + n + ··· + n + 0 = n(n−1)
Equations
- BachBrownMarslenWilson1986.totalIntegrationCost σ = ∑ k ∈ Finset.range n, σ.unintegratedCount (k + 1)
Instances For
Level 2 (n=2): minimal gap (1 vs 2).
Level 3 (n=3): gap widens (3 vs 6).
Level 4 (n=4): gap widens further (6 vs 12).
The absolute cost gap grows with embedding depth.
Crossed is strictly cheaper for n ≥ 2.
Proof by element-wise comparison via Finset.sum_lt_sum: at each verb
position k ∈ {1,…,n}, unintegratedCount (identity n) ≤ unintegratedCount
(reverse n), with strict inequality at k = 1 (the first verb heard).
Total NP-verb dependency length across all n pairs.
Equations
- BachBrownMarslenWilson1986.totalNPVerbDist σ = ∑ i ∈ Finset.range n, if hi : i < n then Features.VerbClusterBinding.npVerbDist n σ ⟨i, hi⟩ else 0
Instances For
General case: both patterns yield total distance n².
Crossed dependencies are formally harder (mildly context-sensitive) but psycholinguistically easier — formal complexity ≠ processing complexity.
Two independent arguments against PDA parsing:
- Dutch is comprehensible at Level 2 despite requiring MCS power (a PDA cannot handle crossed deps at any depth)
- Dutch is easier than German at Level 3+ (a PDA predicts nested should be easier or equal)
Integration cost difference is NOT explained by dependency length.
Equations
- BachBrownMarslenWilson1986.instDecidableEqLangGroup x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Test sentence comprehensibility ratings × 100 (Table 1). Original scale: 1 = easy, 9 = hard. Levels 1–4 indexed 0–3.
Equations
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.dutch 0 = 114
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.dutch 1 = 234
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.dutch 2 = 542
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.dutch 3 = 766
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 109
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 277
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 616
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanInf 3 = 826
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 124
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 263
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 581
- BachBrownMarslenWilson1986.testRating BachBrownMarslenWilson1986.LangGroup.germanPart 3 = 766
Instances For
Paraphrase sentence ratings × 100 (Table 1, Levels 2–4 indexed 0–2). Paraphrases express the same propositions using right-branching structure, controlling for propositional complexity.
Equations
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.dutch 0 = 211
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.dutch 1 = 406
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.dutch 2 = 594
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 202
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 413
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 589
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 236
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 385
- BachBrownMarslenWilson1986.paraRating BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 562
Instances For
Comprehension accuracy × 100 for Test sentences (Table 3). Questions tested whether each subject NP was correctly associated with its predicate verb phrase. Levels 2–3 indexed 0–1.
Equations
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.dutch 0 = 168
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.dutch 1 = 117
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 173
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 89
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 158
- BachBrownMarslenWilson1986.testComprehension BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 79
Instances For
Comprehension accuracy × 100 by NP position at Level 3, Test (Table 4). NP1 = matrix subject (highest clause), NP3 = most deeply embedded.
Equations
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.dutch 0 = 117
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.dutch 1 = 83
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.dutch 2 = 150
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 88
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 67
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 112
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 102
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 38
- BachBrownMarslenWilson1986.comprehensionByNP BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 97
Instances For
Test−Paraphrase error rate difference × 100 by NP at Level 3 (Table 5). Higher = more syntactic disruption (Test harder relative to Paraphrase).
Equations
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.dutch 0 = 11
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.dutch 1 = 59
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.dutch 2 = 0
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanInf 0 = 32
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanInf 1 = 91
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanInf 2 = 41
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanPart 0 = 31
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanPart 1 = 67
- BachBrownMarslenWilson1986.errorDiffByNP BachBrownMarslenWilson1986.LangGroup.germanPart 2 = 36
Instances For
At Level 2, German/Participle does not differ from Dutch (spread = 29). German/Infinitive is slightly worse throughout (spread = 43). The paper reports a significant overall Ger/Inf disadvantage but no difference for Ger/Part vs Dutch at Level 2.
At Level 3, Dutch rates Test sentences as easier than both German groups.
At Level 3, Dutch comprehension accuracy exceeds both German groups.
The syntactic complexity effect (Test − Paraphrase) grows faster for both German groups than Dutch from Level 2 to Level 3, paralleling the model's prediction that the integration cost gap grows with depth.
NP2 (middle NP) is hardest for all three groups (Table 4, Test). This is an interference effect: NP2 is distinguished by neither primacy (NP1) nor recency (NP3), making it hardest to retrieve regardless of the dependency pattern.
Dutch advantage is largest for NP3 (most deeply embedded clause).
Dutch shows ZERO Test−Para error for NP3 (errorDiffByNP .dutch 2 = 0), while both German groups show substantial error (41, 36). The paper explains: in Dutch, NP3's verb (V₃) arrives last and integrates into an already-built matrix structure. In German, NP3's verb (V₃) arrives first — the proposition is immediately parseable but floats without a matrix root, so the information decays before it can be used.
A push-down automaton (PDA) parsing model predicts that difficulty should track the Chomsky hierarchy: context-free patterns (nested) should be easier than mildly-context-sensitive patterns (crossed), because PDAs can parse the former but not the latter (@cite{evers-1975}).
The data refutes this: crossed is empirically easier at n ≥ 3. This is the paper's central argument against stack-based parsing.
The model predicts crossed < nested, the data confirms it, and dependency length cannot explain the difference.