RSA reference game — parametric substrate #
The classic Rational Speech Act reference game from @cite{frank-goodman-2012} as a substrate, parametric in the meaning matrix and world prior. Every paper using a "speaker normalises inverse-extension- size weights" pattern inherits the architectural theorems below.
The architectural pitch #
WebPPL/Python implementations re-derive the size principle and pragmatic narrowing per stimulus. Here both are proved once, parametric in the vocabulary, the meaning matrix, and the world prior. Specific empirical stimuli become one-liner instances.
Main definitions #
RSA.ReferenceGame.IsCovering meaning— the structural assumption: every utterance has a non-empty extension AND every world is named by some utterance. Captured as a typeclass to avoid threading two hypotheses.RSA.ReferenceGame.L0— literal listener (Eq. S3): uniform on extension.RSA.ReferenceGame.S1— pragmatic speaker (Eq. S4 with α=1, no cost):PMF.normalizeofL0weights — the Tenenbaum-Griffiths size principle.RSA.ReferenceGame.L1— pragmatic listener:PMF.posterioragainst any full-support world prior.
Main statements #
S1_apply_of_appliesTo— the size principle as Eq. S4:S1 w u = (1/|⟦u⟧|) / Σ_{u'} L0 u' wwhenuapplies tow.S1_lt_of_partition_gt— the pragmatic-narrowing theorem: whenuapplies to two worlds, the speaker prefers the world whose alternative-set has SMALLER partition function.L1_gt_iff_S1_lt— narrowing transfers from S1 to L1 (uniform prior).L1_eq_pure_of_unique_extension— when an utterance applies to a unique world, the listener concentrates entirely there (full mass).
Concrete instantiation #
Phenomena/Reference/Studies/FrankGoodman2012.lean instantiates these
theorems at the paper's 3-object × 4-feature stimulus. Each of the 4 paper
findings is a one-liner corollary.
§1. The covering hypothesis #
A meaning matrix is covering if every utterance has a non-empty
extension AND every world is named by some utterance. Minimal data needed
to set up an RSA reference game on the model class. Captured as a typeclass
so consumers state [IsCovering meaning] once and inherit both facts.
- extension_nonempty (u : Word) : (extensionOf meaning u).Nonempty
Every utterance applies to at least one object.
- covering (w : Object) : ∃ (u : Word), meaning u w = true
Every object is named by at least one utterance.
Instances
§2. Model definitions #
Extension of an utterance: the Finset of objects to which it applies.
Equations
- RSA.ReferenceGame.extension meaning u = RSA.extensionOf meaning u
Instances For
Literal listener (Eq. S3): uniform on the utterance's extension.
Equations
- RSA.ReferenceGame.L0 meaning u = RSA.L0OfBoolMeaning meaning u ⋯
Instances For
The L0 sum over utterances at any world is non-zero (covering).
The L0 sum is finite.
Pragmatic speaker (Eq. S4 with α=1, no cost): PMF.normalize of
the literal-listener weights. The normalisation factor cancels the per-
utterance constant from the Boltzmann derivation; what remains is the
size principle — probability inversely proportional to extension size.
Equations
- RSA.ReferenceGame.S1 meaning w = PMF.normalize (fun (u : Word) => (RSA.ReferenceGame.L0 meaning u) w) ⋯ ⋯
Instances For
The marginal Σ_w worldPrior w · S1 w u is non-zero whenever the
prior puts positive mass on some world to which u applies — the
substrate condition for PMF.posterior.
Pragmatic listener: posterior of S1 against any world prior.
Equations
- RSA.ReferenceGame.L1 meaning worldPrior u h_marg = PMF.posterior (RSA.ReferenceGame.S1 meaning) worldPrior u h_marg
Instances For
§3. The partition function #
The "partition function" Σ_u L0 u w is the size-principle weight of all
applicable utterances at world w. It is the denominator of the speaker's
softmax (Eq. S4) and the load-bearing object for pragmatic narrowing.
The structural theorem partition_eq_filter_sum rewrites it as a sum over
the applicable-utterance Finset — the form most consumers want.
The partition function at world w: the size-principle weight summed
over all utterances. Equals Σ_{u : meaning u w} (1/|⟦u⟧|) (theorem
partition_eq_filter_sum below).
Equations
- RSA.ReferenceGame.partition meaning w = ∑' (u : Word), (RSA.ReferenceGame.L0 meaning u) w
Instances For
The partition is non-zero (covering).
The partition is finite.
Partition as filter sum: the partition function equals the sum of
inverse extension cardinalities over applicable utterances. The
"non-applicable" terms vanish (their L0 value is 0); what remains is
the size-principle weight of each alternative.
This is the form most consumers use to compute concrete partition values.
§4. Apply formulas (Eqs. S3, S4) #
Size principle (Eq. S4): for an utterance applying to w, the
speaker probability is (1/|⟦u⟧|) divided by the sum of 1/|⟦u'⟧| over
all utterances u' applying to w — the partition function.
The speaker assigns zero probability to utterances that don't apply.
§4. Pragmatic narrowing — the architectural theorem #
Pragmatic-narrowing theorem (parametric): when u applies to two
worlds w and w', the speaker prefers u at w' over w iff the
partition function at w exceeds that at w'.
Architectural reading: a larger partition at a world means more competition
from informative alternatives, so the speaker is less likely to choose u
at that world; the listener therefore narrows toward the world with fewer
informative competitors. This is the load-bearing structural theorem of
@cite{frank-goodman-2012} — every paper finding is an instance.
Direct via PMF.normalize_lt_of_apply_eq_of_sum_lt: same numerator
(L0 u w = L0 u w' = (extension u).card⁻¹), comparing partitions.
Cross-base companion: when u applies to only one of the two worlds,
the speaker prefers the applying world (the other gets zero mass).
§5. L1 narrowing under uniform prior #
L1 narrowing follows S1 narrowing (uniform world prior): the posterior at uniform prior reverses the speaker's preference, so L1 favours the world S1 favours.
§6. Unique reference #
Unique-reference theorem: when utterance u applies to a unique
world w_unique (no other world satisfies meaning u), the L1 posterior
concentrates entirely on w_unique — full mass 1.
Architectural reading: a uniquely-identifying utterance leaves the listener
with no Bayesian uncertainty. Direct via
PMF.posterior_eq_one_of_singleton_score_support.