[VTVMZG16] — Scalar Diversity #
Theory-neutral empirical data and argumentation chain from [VTVMZG16].
Central Question #
Do all scalar expressions yield scalar implicatures at comparable rates? The "uniformity assumption" — implicit in decades of SI research focused on ⟨some, all⟩ and ⟨or, and⟩ — predicts yes.
Argumentative Structure #
Scalar diversity is real (Exps 1–2, §2): SI rates vary continuously from 4% (⟨content, happy⟩) to 100% (⟨cheap, free⟩) across 43 scales. Experiment 1 (N=25) uses neutral content (pronouns); Experiment 2 (N=29) uses non-neutral content (full NPs). Results correlate highly (r=.91), confirming robustness to sentential context.
Availability does not explain diversity (Exp 3, §4): Four operationalisations of scale availability all fail to predict SI rates:
- Association strength (modified cloze task, Exp 3, N=60): β=0.16, n.s.
- Grammatical class (open vs closed): β=−0.38, n.s.
- Relative word frequency (corpus log-ratio): β=−0.15, n.s.
- Semantic relatedness (LSA cosine): β=0.01, n.s.
Distinctness does explain diversity (Exp 4, §5): Two measures of how easy it is to distinguish scalemates both predict SI rates:
- Semantic distance (7-point rating, Exp 4, N=24): β=0.65, p=.018
- Boundedness (stronger term is endpoint): β=−1.87, p<.001
Combined model (§6, Table 5): Mixed model with all six predictors explains R²=0.52 of variance (0.22 fixed effects, 0.30 random). Of the fixed-effects variance, boundedness alone accounts for 10.8%, distance for 2.7%, and all four availability measures combined for <1%.
Remaining variance (§6): ~78% of variance is unexplained, suggesting item-specific statistical learning from language use — hearers track frequencies of upper-bounding inferences for individual scales and use Gricean reasoning to combine prior likelihoods with the current context.
Grammatical category of a scale. Van Tiel et al. distinguish open vs closed classes for availability hypothesis.
- adjective : GrammaticalClass
- adverb : GrammaticalClass
- auxiliaryVerb : GrammaticalClass
- mainVerb : GrammaticalClass
- quantifier : GrammaticalClass
Instances For
Equations
- VanTielEtAl2016.instDecidableEqGrammaticalClass x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Whether a scale is "open" or "closed" class. Closed class = smaller search space for alternatives (predicted more available).
Equations
Instances For
Complete data for each scale tested in [VTVMZG16].
Fields capture all predictors tested in the paper:
- SI rates from Experiments 1 and 2
- Availability measures (cloze, frequency, LSA)
- Distinctness measures (semantic distance, boundedness)
- weakerTerm : String
Weaker scalar term
- strongerTerm : String
Stronger scalar term
- category : GrammaticalClass
Grammatical category
- siRateExp1 : ℕ
SI rate in Experiment 1 (neutral content, %)
- siRateExp2 : ℕ
SI rate in Experiment 2 (non-neutral content, %)
- clozeNeutral : Option ℕ
Cloze task: % mentioning stronger term (Exp3, neutral, lenient)
- clozeNonNeutral : Option ℕ
Cloze task: % mentioning stronger term (Exp3, non-neutral, lenient)
- freqRatio : Option ℚ
Log ratio of weaker/stronger term frequencies
- lsaRelatedness : Option ℚ
LSA semantic relatedness (0-1)
- semanticDistance : ℚ
Mean semantic distance rating (1-7 scale, Exp4)
- bounded : Bool
Whether stronger term denotes an endpoint (bounded scale)
Instances For
Equations
- VanTielEtAl2016.instReprScaleDatum = { reprPrec := VanTielEtAl2016.instReprScaleDatum.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨cheap, free⟩ - highest SI rate (100%)
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨sometimes, always⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨some, all⟩ - the "workhorse" of SI research
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨possible, certain⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨may, will⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨difficult, impossible⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨rare, extinct⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨may, have to⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨warm, hot⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨few, none⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨low, depleted⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨hard, unsolvable⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨allowed, obligatory⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨try, succeed⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨palatable, delicious⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨memorable, unforgettable⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨like, love⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨good, perfect⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨good, excellent⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨cool, cold⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨hungry, starving⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨adequate, good⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨unsettling, horrific⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨dislike, loathe⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨believe, know⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨start, finish⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨participate, win⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨wary, scared⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨old, ancient⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨big, enormous⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨snug, tight⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨attractive, stunning⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨special, unique⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨pretty, beautiful⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨intelligent, brilliant⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨funny, hilarious⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨dark, black⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨small, tiny⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨ugly, hideous⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨silly, ridiculous⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨tired, exhausted⟩
Equations
- One or more equations did not get rendered due to their size.
Instances For
⟨content, happy⟩ - lowest SI rate (4%)
Equations
- One or more equations did not get rendered due to their size.
Instances For
All 43 scales tested in [VTVMZG16]
Equations
- One or more equations did not get rendered due to their size.
Instances For
Number of scales tested
Equations
Instances For
Bounded scales
Equations
- VanTielEtAl2016.boundedScales = List.filter (fun (x : VanTielEtAl2016.ScaleDatum) => x.bounded) VanTielEtAl2016.allScales
Instances For
Non-bounded scales
Equations
- VanTielEtAl2016.nonBoundedScales = List.filter (fun (x : VanTielEtAl2016.ScaleDatum) => !x.bounded) VanTielEtAl2016.allScales
Instances For
Number of bounded scales
Equations
Instances For
Number of non-bounded scales
Equations
Instances For
SI rates span 96 percentage points: 4% to 100%. ⟨content, happy⟩ is the floor, ⟨cheap, free⟩ the ceiling.
Bounded scales yield far more SIs than non-bounded scales.
Total SI rate (Exp 1) across 21 bounded scales: 1287%. Total SI rate (Exp 1) across 22 non-bounded scales: 465%. Even though bounded scales have fewer items, their total is nearly 3× higher. The paper reports mean bounded ≈ 62% vs mean non-bounded ≈ 25%.
⟨some, all⟩ — the "workhorse" of SI research — sits near the top at 96%, far above the mean. Generalizing from ⟨some, all⟩ to all scales is unjustified.
In this sample, every closed-class scale is also bounded. This confound partially explains the nonsignificant grammatical-class effect: closed-class scales look high-SI because they're all bounded, not because the search space for alternatives is smaller.
Experiments 1 and 2 agree directionally: no scale reverses from high to low or vice versa (defined as >50% in one experiment and <15% in the other).
Explaining diversity: distinctness, not availability #
The mixed model (Table 5) regressed SI rates on six predictors. Of these, only
the two distinctness measures are significant: semantic distance (β = 0.65,
SE = 0.27, Z = 2.36, p = .018, R² = .027) and boundedness (β = −1.87, SE = 0.40,
Z = −4.72, p < .001, R² = .108 — the negative sign reflects bounded = 1 against a
"no" = 1 dependent variable, i.e. bounded scales project more). The four
availability measures are all null: association strength (β = 0.16, p = .611),
grammatical class (β = −0.38, p = .606), relative word frequency (β = −0.15,
p = .461), and LSA relatedness (β = 0.01, p = .355). The full model explains
R² = .52 (.22 fixed); boundedness alone accounts for ≈ 10× the variance of all
availability measures combined. Effect sizes stay in prose; the qualitative
content is carried structurally by bounded_total_exceeds_nonbounded (the
dominant distinctness factor, read directly off the SI data) and
closed_class_subsumes_bounded (why the grammatical-class availability measure is
confounded out once boundedness is in the model).
Following [Soa82] and [Sau04], a scalar inference from φ[α] to ¬φ[β] is computed in two steps. The primary step yields that the speaker does not believe the stronger alternative (¬Bel_S φ[β]). A competence assumption — the speaker is opinionated about φ[β] (Bel_S φ[β] ∨ Bel_S ¬φ[β]) — upgrades this to the scalar inference Bel_S ¬φ[β]. Scalar diversity is then variation in whether the competence step fires: it is better warranted when the scalemates are distinct (bounded or semantically distant), so that the speaker is plausibly opinionated about the stronger term — which is exactly why distinctness, not availability, predicts the rates.
The two-stage (epistemic) model: the primary inference together with the competence assumption yields the scalar inference. The conclusion is derived from the premises, not stipulated.
Competence is load-bearing: the primary step alone leaves the stronger alternative epistemically open (the speaker may be agnostic), so no scalar inference follows. Cross-scale variation in this step is where diversity lives.