[TG19]: The Language of Generalization #

Psychological Review, 126(3), 395–436.

Core Insight #

Generics ("Robins lay eggs") use the SAME uncertain threshold semantics as gradable adjectives. The scale is prevalence rather than height/degree:

⟦gen⟧(p, θ) = 1 if prevalence p > threshold θ

This IS positiveMeaning from Degree — the generic meaning is grounded in scalar adjective semantics by construction, not by bridge theorem.

Model #

Interpretation model (L0, Eq. 1): L(p, θ | u) ∝ δ_{⟦u⟧(p,θ)} · P(θ) · P(p)

Endorsement model (S1, Eq. 3): S(u | p) ∝ (∫_θ L(p, θ | u) dθ)^λ

The threshold θ is marginalized BEFORE exponentiation (matching the paper). With N discrete thresholds, the marginalized L0 is: L0(p | generic) ∝ P(p) · |{θ : p > θ}| = P(p) · p.toNat

This analytical marginalization eliminates the latent variable entirely. The model is the mathlib-PMF RSA pipeline ([FG12]): the literal listener L0gen prior u : PMF Prevalence normalises the marginalized meaning meaningE prior u p = P(p) · |{θ : ⟦u⟧(p,θ)}| (lifted to ℝ≥0∞), and the speaker S1gen prior p : PMF Utterance is RSA.S1Belief with α = 1, zero cost. Each prediction is one application of S1Belief_apply_lt_iff_score_lt (the rsa simp set): the partition cancels, leaving an L0 comparison that reduces to the cue validity test p.toNat > E[k | prior]. The prior expectation is the shared PMF.condExpect of the silent listener's posterior (expectedBin_eq_condExpect).

Parameters #

All parameters from the paper's code (analysis/model-simulations.Rmd, exampleParameters list, GitHub: mhtess/genlang-paper):

α = 2 in the paper (experimental fit: 2.47). We use α = 1 since the binary comparison S1(generic) > S1(silent) is α-invariant for α > 0
Bins: paper uses 98 bins (0.01–0.98); we use 21 bins (0%, 5%, ..., 100%) for exact rational arithmetic. Qualitative predictions are preserved.
Null component: Beta(1, 50)

Property	Stable Beta	φ (mix)	Ref. prev.	Paper endorse
bark	Beta(5,1)	0.4	95%	0.88
hasSpots	Beta(5,1)	0.7	10%	0.02
dontEatPeople	Beta(10,1)*	1.0	80%	0.41
laysEggs	Beta(10,10)	0.2	50%	0.95
isFemale	Beta(10,10)	1.0	50%	0.50
carriesMalaria	Beta(1,30)	0.1	10%	0.97

*Paper uses Beta(50,1); we use Beta(10,1) for tractable arithmetic (avoids k^49 terms). Both give the same qualitative prediction.

Prior Model #

Prevalence priors are mixtures of two Beta distributions (Figure 2): P(p) = φ · Beta_stable(p) / Z_s + (1-φ) · Beta_null(p) / Z_n

where φ is the probability a category has the stable causal mechanism, Beta_stable varies per property, and Beta_null = Beta(1,50) for all properties (representing categories lacking the property mechanism).

Each component is NORMALIZED before mixing (matching the WebPPL code, which uses categorical to normalize each component independently). We achieve this without ℚ division by computing: P(p) ∝ φ · BW_s(p) · Z_n + (1-φ) · BW_n(p) · Z_s

Verified Predictions #

#	Finding	Prior	p_ref	Theorem
1	"Dogs bark" endorsed	bark	95%	`bark_endorsed`
2	"Kangaroos have spots" NOT endorsed	hasSpots	10%	`spots_not_endorsed`
3	"Sharks don't eat people" NOT endorsed	dontEatPeople	80%	`dontEatPeople_not_endorsed`
4	"Robins lay eggs" endorsed despite 50%	laysEggs	50%	`laysEggs_endorsed`
5	"Robins are female" borderline at 50%	isFemale	50%	`isFemale_borderline`
6	"Mosquitos carry malaria" endorsed at 10%	carriesMalaria	10%	`malaria_endorsed`
7	Max prevalence satisfies all thresholds	—	—	`generic_top_true`
8	Zero prevalence fails all thresholds	—	—	`generic_zero_false`
9	Only rareWeak endorsed at 20%	all four causal	20%	`causal_20pct_pattern`
10	3/4 causal conditions endorsed at 70%	all four causal	70%	`causal_70pct_pattern`
11	Endorsement ⟺ exceeds E[k	prior]	—	—

[TG19]: The Language of Generalization #

Core Insight #

Model #

Parameters #

Prior Model #

Verified Predictions #

Mixture-of-Betas infrastructure #

Endorsement model (Eq. 3) #

Analytical endorsement condition #

The endorsement boundary is the silent listener's expected prevalence #

Symmetric priors put the endorsement boundary at the centre #

Case Study 2: Habitual Language #

Case Study 3: Causal Language #

Cue Validity and Endorsement #

Unified Architecture #