[FG12] reference game (Measure/Kernel-native) #
"Predicting Pragmatic Reasoning in Language Games", Science 336, 998.
The Rational Speech Act model of the paper, on the Measure/Kernel-native analytic
foundation of Pragmatics/RSA/Gibbs.lean. The informative speaker (the paper's
eq. 2) is a MeasureTheory.Measure.tilted Gibbs measure: Measure.count
restricted to the applicable utterances W(r), tilted by the surprisal utility
score w = log (1 / |w|). The closed form is exactly
S₁(w | r) ∝ |w|⁻¹ over w ∈ W(r),
[FG12]'s eq. (2). The architectural content — the speaker is a Gibbs
measure, monotone in utility, and the rational optimizer of expected-utility-minus-
KL (RSA.Gibbs.speaker_isGreatest) — is the substrate; here it is instantiated at
the paper's stimulus (Fig. 1A).
The pragmatic listener (eq. 1) is the Bayesian posterior of the speaker against the
salience prior, RSA.Gibbs.listener; its pragmatic inferences are driven by the
speaker asymmetries proved below (narrowing, unique reference). Empirical fit
(speaker r = 0.98, listener r = 0.99) is reported in the paper, not as a
theorem here.
Main statements #
prefers_informative— the speaker prefers the uniquely-identifyingcircleover the ambiguousbluefor the target (Fig. 1A);prefers_informative_alphashows this holds at every rationalityα > 0, andfully_rational_picks_circlethat theα → ∞speaker concentrates all its mass oncircle(consumingRSA.Gibbs.speakerAlphaand its zero-temperature limit).size_principle— generally, the speaker prefers the smaller-extension applicable utterance.narrowing_blue/narrowing_square— pragmatic narrowing: the speaker is less likely to use an ambiguous word at a referent that has a uniquely-identifying alternative; this asymmetry is what lets the listener narrow.unique_green/unique_circle— unique reference: a uniquely-applying word gets zero mass where it does not apply, so the listener recovers the referent.
Stimulus (Fig. 1A) #
Three objects and four feature-words. Two features (green, circle) are uniquely
identifying; two (blue, square) are ambiguous between two objects each.
Equations
- FrankGoodman2012.instDecidableEqObject x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- FrankGoodman2012.instReprObject = { reprPrec := FrankGoodman2012.instReprObject.repr }
Equations
Equations
- FrankGoodman2012.instDecidableEqFeature x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- FrankGoodman2012.instReprFeature = { reprPrec := FrankGoodman2012.instReprFeature.repr }
Equations
Equations
The denotation matrix: which feature applies to which object.
Equations
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.blue FrankGoodman2012.Object.blueSquare = true
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.blue FrankGoodman2012.Object.blueCircle = true
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.green FrankGoodman2012.Object.greenSquare = true
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.square FrankGoodman2012.Object.blueSquare = true
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.square FrankGoodman2012.Object.greenSquare = true
- FrankGoodman2012.appliesTo FrankGoodman2012.Feature.circle FrankGoodman2012.Object.blueCircle = true
- FrankGoodman2012.appliesTo x✝¹ x✝ = false
Instances For
The applicable utterances at a referent — [FG12]'s W(r), the
support over which the speaker normalizes.
Equations
- FrankGoodman2012.applicable r = {w : FrankGoodman2012.Feature | FrankGoodman2012.appliesTo w r = true}
Instances For
The extension size |w| — the number of objects the feature applies to.
Equations
- FrankGoodman2012.numApplies w = {o : FrankGoodman2012.Object | FrankGoodman2012.appliesTo w o = true}.card
Instances For
Informativity utility (rationality α = 1, no cost): the surprisal
score w = log (1 / |w|) = - log |w|. Tilting by this realizes eq. (2)'s
|w|⁻¹.
Equations
- FrankGoodman2012.score w = -Real.log ↑(FrankGoodman2012.numApplies w)
Instances For
The informative speaker (eq. 2): the Gibbs measure tilting Measure.count
restricted to the applicable utterances W(r) by the surprisal score.
Equations
- FrankGoodman2012.speakerAt r = RSA.Gibbs.speaker (MeasureTheory.Measure.count.restrict ↑(FrankGoodman2012.applicable r)) FrankGoodman2012.score
Instances For
Speaker API at this stimulus #
These wrappers carry by decide defaults for the applicability side-conditions, so
the concrete predictions below never spell out w ∈ applicable r proofs.
At an applicable utterance, the speaker mass is the softmax over W(r).
A non-applicable utterance gets zero speaker mass.
Speaker preference at a referent reduces to the surprisal comparison.
α-speaker preference at a referent (α > 0) reduces to the surprisal comparison.
Numerical bookkeeping #
Predictions #
The speaker prefers the uniquely-identifying description (Fig. 1A): for the
target (blueCircle), circle (which uniquely identifies it) gets strictly more
mass than the ambiguous blue. Reduces via speaker_countRestrict_lt_iff_score_lt
to the surprisal comparison score blue < score circle.
Robustness to rationality: the informativeness preference holds at every
rationality level α > 0, not just the canonical α = 1 (prefers_informative).
A consumer of the α-generalized speaker RSA.Gibbs.speakerAlpha.
The fully-rational speaker is deterministic (α → ∞): at the target, the
speaker concentrates all its mass on the uniquely-identifying circle. The
zero-temperature limit of prefers_informative, via
RSA.Gibbs.speakerAlpha_countRestrict_tendsto_one_of_isMax.
Size principle: among applicable utterances, the speaker prefers the one
with the smaller extension (lower numApplies).
Pragmatic narrowing for "blue": the speaker assigns less mass to "blue" at
blueCircle (where "circle" uniquely identifies, raising the partition) than at
blueSquare (where the only alternative "square" is equally ambiguous). The
numerators are equal; the comparison is the partition comparison 3/2 > 1 — which
is what lets a listener hearing "blue" narrow toward blueSquare.
Pragmatic narrowing for "square": symmetrically, "square" is less likely at
greenSquare (where "green" uniquely identifies) than at blueSquare.
Unique reference for "green": "green" applies only to greenSquare, so it
gets zero mass at blueSquare and positive mass at greenSquare — the listener
hearing "green" identifies greenSquare.
Unique reference for "circle": "circle" applies only to blueCircle.