[HGG21]: the division of labor in communication #
A resource-rational extension of RSA for perspective-taking in the
[KLB03] director–matcher reference game. Perspective-taking is costly,
so each agent allocates effort via a mixture weight w ∈ [0,1], and the optimal
effort depends on the partner's expected effort.
Two PMF reference games formalize the task, built on the canonical operators
(RSA.L0OfBoolMeaning, RSA.S1Belief, PMF.posterior):
- egocentric (
egoL0/egoS1/egoL1) — three visible objects, shape alone identifies the target. - asymmetric (
asymL0/asymS1/asymL1) — a hidden object behind an occlusion, whose feature-match profile is the latent variable (latentPrior), marginalized viaRSA.marginalizeKernel; the speaker hedges with more specific utterances.
The mixture model and resource-rational optimization sit outside the RSA loop, in ℝ, grounded in the PMF literal listener.
Main declarations #
egoL1,asymL1— the two pragmatic-listener posteriors.MontaguGrounding.grounding_ego_meaning— the literal semantics is intersective predicate modification (Modifier.intersective).asym_S1_prefers_specificity_when_shape_matches— under asymmetry, the speaker prefers a more specific utterance when the hidden object shares a feature.mixUtility,rrUtilityFull— log-space mixture utility and resource-rational utility over the perspective-taking weight.no_cost_prefers_full_pt,high_cost_penalizes_full_pt— full perspective-taking is worth its cost only when the cost is low.
Empirical anchors #
Experiment 1 (83 dyads, 2×2 occlusion × distractor): speakers used more words under occlusion (+1.3 words) and under a same-shape distractor (+0.6 words). Experiment 2 (116 dyads, a [KLB03] replication): scripted directors elicited ~51% critical errors vs. ~20% for naive directors, listeners adapted from 43% to 30% errors over four critical trials, and informativity predicted accuracy (ρ = −0.81). The eight critical items are the [KLB03] materials (the paper's Table 1). Effect sizes and model fits: [HGG21].
The RSA model #
The egocentric game (egoL0/egoS1/egoL1) is over three visible objects with
a belief-based α = 2 speaker. The asymmetric game adds a hidden object whose
feature-match profile is the latent variable, each feature matching the target
independently with probability 1/4. Utterance semantics are intersective
predicate modification (see MontaguGrounding).
Finite types #
The 3 visible objects in the example display.
target: shape=0, color=0, texture=0 d1: shape=1, color=0, texture=0 (shares color+texture with target) d2: shape=2, color=1, texture=1 (differs on all features)
Instances For
Equations
- HawkinsGweonGoodman2021.instDecidableEqVisObj x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
Equations
- One or more equations did not get rendered due to their size.
Equations
- HawkinsGweonGoodman2021.instDecidableEqAsymObj x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
Equations
- One or more equations did not get rendered due to their size.
Equations
- HawkinsGweonGoodman2021.instDecidableEqUtt x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- HawkinsGweonGoodman2021.instReprUtt = { reprPrec := HawkinsGweonGoodman2021.instReprUtt.repr }
Equations
Equations
- One or more equations did not get rendered due to their size.
Utterance cost: number of features mentioned
Equations
- HawkinsGweonGoodman2021.Utt.null.cost = 0
- HawkinsGweonGoodman2021.Utt.s.cost = 1
- HawkinsGweonGoodman2021.Utt.c.cost = 1
- HawkinsGweonGoodman2021.Utt.t.cost = 1
- HawkinsGweonGoodman2021.Utt.sc.cost = 2
- HawkinsGweonGoodman2021.Utt.st.cost = 2
- HawkinsGweonGoodman2021.Utt.ct.cost = 2
- HawkinsGweonGoodman2021.Utt.sct.cost = 3
Instances For
Literal semantics #
Does utterance apply to an entity with given feature-match profile? For each feature the utterance mentions, the entity must match the target.
Instances For
Egocentric literal meaning: does utterance apply to visible object? Target matches on all features. d1 differs only on shape. d2 differs on all.
Equations
- HawkinsGweonGoodman2021.egoMeaning u HawkinsGweonGoodman2021.VisObj.target = true
- HawkinsGweonGoodman2021.egoMeaning u HawkinsGweonGoodman2021.VisObj.d1 = u.applies false true true
- HawkinsGweonGoodman2021.egoMeaning u HawkinsGweonGoodman2021.VisObj.d2 = u.applies false false false
Instances For
Asymmetric literal meaning: includes hidden object behind occlusion.
The hidden object's match profile is the latent variable l = (matchShape, matchColor, matchTexture).
Each feature independently matches target with P = 1/4.
Equations
- HawkinsGweonGoodman2021.asymMeaning l u HawkinsGweonGoodman2021.AsymObj.target = true
- HawkinsGweonGoodman2021.asymMeaning l u HawkinsGweonGoodman2021.AsymObj.d1 = u.applies false true true
- HawkinsGweonGoodman2021.asymMeaning l u HawkinsGweonGoodman2021.AsymObj.d2 = u.applies false false false
- HawkinsGweonGoodman2021.asymMeaning l u HawkinsGweonGoodman2021.AsymObj.hidden = u.applies l.1 l.2.1 l.2.2
Instances For
Model #
Both configurations use the canonical PMF reference-game operators
(RSA.L0OfBoolMeaning, RSA.S1Belief, PMF.posterior), as in
Studies/TesslerFranke2020PMF: L0 is
uniform on an utterance's extension, S1 is the belief-based speaker with
α = 2 and no cost, and L1 is the Bayesian posterior under a uniform world
prior. The asymmetric model marginalizes the hidden-object profile via
RSA.marginalizeKernel.
Egocentric model #
Every utterance applies to the target (it matches on all features), so each extension is non-empty.
Belief-based speaker: S1(u | w) ∝ L0(w | u)^2, no cost (α = 2).
Equations
- HawkinsGweonGoodman2021.egoS1 w = RSA.S1Belief HawkinsGweonGoodman2021.egoL0 (fun (x : HawkinsGweonGoodman2021.Utt) => 1) 2 w ⋯ ⋯
Instances For
Uniform world prior over the three visible objects.
Equations
- HawkinsGweonGoodman2021.egoWorldPrior = PMF.uniformOfFintype HawkinsGweonGoodman2021.VisObj
Instances For
An utterance applying only to the target makes the listener certain: the posterior puts full mass on the target.
Asymmetric model #
The hidden object's feature-match profile is the latent variable Profile,
with prior weight 1 for a match and 3 for a non-match on each feature (each
feature matches the target with probability 1/4). The speaker is conditioned
on the latent; the listener marginalizes it via RSA.marginalizeKernel.
Whether the hidden object matches the target on (shape, color, texture).
Equations
- HawkinsGweonGoodman2021.Profile = (Bool × Bool × Bool)
Instances For
Literal listener under hidden profile l.
Equations
Instances For
Speaker conditioned on the hidden-object profile.
Equations
- HawkinsGweonGoodman2021.asymS1 l w = RSA.S1Belief (HawkinsGweonGoodman2021.asymL0 l) (fun (x : HawkinsGweonGoodman2021.Utt) => 1) 2 w ⋯ ⋯
Instances For
Latent prior weight: 1 per matching feature, 3 per non-match.
Equations
- HawkinsGweonGoodman2021.profileWeight l = ((if l.1 = true then 1 else 3) * if l.2.1 = true then 1 else 3) * if l.2.2 = true then 1 else 3
Instances For
Prior over hidden-object profiles.
Equations
Instances For
Listener's marginal speaker: hidden profile integrated out.
Equations
Instances For
Uniform world prior over the four objects (three visible + hidden).
Equations
- HawkinsGweonGoodman2021.asymWorldPrior = PMF.uniformOfFintype HawkinsGweonGoodman2021.AsymObj
Instances For
Pragmatic listener: posterior of the latent-marginalized speaker.
Equations
Instances For
Compositional grounding #
The literal semantics is intersective predicate modification [HK98]:
each mentioned feature is an intersective adjective — Modifier.intersective
applied to a feature property — and grounding_ego_meaning shows egoMeaning
is exactly their iterated conjunction. This grounds the Bool RSA meaning in
the project-canonical Prop-valued modifier rather than a local copy.
Shape adjective: holds of the target (the only shape-0 object).
Equations
Instances For
Color adjective: holds of the objects sharing the target's color (target, d1).
Equations
Instances For
Texture adjective: holds of the objects sharing the target's texture (target, d1).
Equations
Instances For
Compositional utterance denotation: each mentioned feature is an intersective
adjective (Modifier.intersective), composed over the trivial base property.
Equations
- One or more equations did not get rendered due to their size.
- HawkinsGweonGoodman2021.MontaguGrounding.compositionalMeaning HawkinsGweonGoodman2021.Utt.null = fun (x : HawkinsGweonGoodman2021.VisObj) => True
Instances For
Grounding: the RSA literal meaning egoMeaning holds exactly when the
intersective predicate modification of the mentioned feature adjectives does.
Predictions #
The egocentric model captures the no-occlusion case; the asymmetric model captures occlusion. Predictions are structural PMF proofs over the canonical operators (no interval reflection).
Egocentric predictions #
Shape-only uniquely identifies the target among visible objects: it applies to no other visible object, so the listener concentrates on the target.
The listener is equally confident about the target whether hearing shape-only or the full description: both apply only to the target among visible objects, so each makes the listener certain.
The speaker is indifferent between shape-only and full description at the
target: both apply only to the target, so both have L0 = 1 and equal score.
Asymmetric predictions #
Paper prediction: when the hidden object matches the target's shape but
not its color or texture, the speaker prefers the full description over
shape-only — shape-only fails to distinguish the target from the hidden object
(L0 = 1/2), while the full description succeeds (L0 = 1).
When the hidden object matches no features, the speaker is indifferent:
both shape-only and the full description apply only to the target (L0 = 1).
Paper prediction: under asymmetry, the full description yields a higher listener posterior for the target than shape-only — the hidden object can match individual features, so a more specific utterance is more reliably informative.
TODO: this compares two latent-marginalized posteriors at different
conditioning utterances (L1 .sct vs L1 .s), with distinct normalizing
constants. It reduces to a finite ℝ≥0∞ comparison over the eight hidden
profiles but is beyond hand-discharge; it awaits the planned pmf_score_compare
tactic (cf. Studies/TesslerFranke2020PMF).
Shape+color also beats shape-only: each additional feature narrows the set of possible hidden distractors.
TODO: same shape as asym_full_desc_better_reference (cross-utterance
latent-marginalized posterior comparison); awaits pmf_score_compare.
Resource-rational extensions #
The mixture model and resource-rational optimization sit outside the standard
RSA loop. They are defined in ℝ, grounded in the PMF literal listener (egoL0,
asymL0, via .toReal) and the hidden-profile prior latentPrior.
The mixture operates in log-space (over utilities, not probabilities): the
mixture speaker uses a weighted geometric mean of L0 values,
exp(w_S · E[log L0^asym] + (1 − w_S) · log L0^ego). The model uses α = 2 and
a uniform cost that cancels in the S1 normalization. See
[HGG21] for the asymmetric/egocentric speaker utilities,
the mixture, and the resource-rational utility.
L0 success rates #
Egocentric L0 success rate: the literal listener's target probability given
u, read off egoL0.
Equations
Instances For
Asymmetric L0 success rate: the literal listener's target probability
averaged over hidden profiles, weighted by latentPrior.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Log-space mixture utilities #
Expected log-L0 under the asymmetric model (the asymmetric-utility
component): E_l[log P_L0(target | u, l)]. By Jensen's inequality this is
≤ log (asymInfR u).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Mixture speaker utility (Eq. 5): U^mix(u; w_S) = w_S · E_h[log P_L0^asym(target|u,h)] + (1−w_S) · log P_L0^ego(target|u) Uniform cost (0.03) omitted: it cancels in S1 normalization.
Equations
- HawkinsGweonGoodman2021.mixUtility u wS = wS * HawkinsGweonGoodman2021.asymLogInfR u + (1 - wS) * Real.log (HawkinsGweonGoodman2021.egoInfR u)
Instances For
Mixture S1 score: P_S1^mix(u | target, w_S) ∝ exp(α · U^mix(u; w_S)). Paper Eq. 1 with the mixture utility from Eq. 5.
Equations
- HawkinsGweonGoodman2021.mixS1Score u wS α = Real.exp (α * HawkinsGweonGoodman2021.mixUtility u wS)
Instances For
Full resource-rational model #
The full model marginalizes over listener perspective-taking weight w_L.
The simplified model (Eqs 2–5) treats w_L as fixed at 1. The full model
(Eqs 7–9) has the speaker consider a range of listener weights, and the
resource-rational analysis (Eq. 10) measures accuracy averaged over w_L.
**Mixture L0** (Eq. 8): P_{L_0}^{mix}(target|u, l, w_L) =
w_L · P_{L_0}^{asym}(target|u, l) + (1−w_L) · P_{L_0}^{ego}(target|u).
At w_L = 0, the listener ignores hidden objects. At w_L = 1, the listener
accounts for all potential hidden distractors.
**Marginalized S1** (Eq. 9): the speaker's utility integrates over w_L,
discretized to 5 grid points {0, 1/4, 1/2, 3/4, 1} with uniform weight.
**Accuracy** (Eq. 10): since listener accuracy is linear in w_L,
E_{uniform w_L}[accuracy] = (egoInfR + asymInfR) / 2.
Mixture L0 accuracy: probability the mixture listener at weight w_L
correctly identifies the target, given hidden object profile l.
Equations
- HawkinsGweonGoodman2021.mixL0Target u l wL = wL * ((HawkinsGweonGoodman2021.asymL0 l u) HawkinsGweonGoodman2021.AsymObj.target).toReal + (1 - wL) * HawkinsGweonGoodman2021.egoInfR u
Instances For
Asymmetric speaker utility at a specific listener weight:
U^asym(u; w_L) = Σ_l P(l) · log P_L0^mix(target | u, l, w_L).
Equations
- HawkinsGweonGoodman2021.asymUtilityAtWL u wL = ∑ l : HawkinsGweonGoodman2021.Profile, (HawkinsGweonGoodman2021.latentPrior l).toReal * Real.log (HawkinsGweonGoodman2021.mixL0Target u l wL)
Instances For
Mixed speaker utility at specific (w_S, w_L) (Eq. 8).
Equations
- HawkinsGweonGoodman2021.mixUtilityFull u wS wL = wS * HawkinsGweonGoodman2021.asymUtilityAtWL u wL + (1 - wS) * Real.log (HawkinsGweonGoodman2021.egoInfR u)
Instances For
W_L-marginalized speaker utility (Eq. 9 inside the exp). Discretized: 5 uniform grid points at w_L ∈ {0, 1/4, 1/2, 3/4, 1}.
Equations
- HawkinsGweonGoodman2021.mixUtilityMarg u wS = 1 / 5 * ∑ k : Fin 5, HawkinsGweonGoodman2021.mixUtilityFull u wS (↑↑k / 4)
Instances For
Full S1 score with w_L marginalization (Eq. 9).
Equations
- HawkinsGweonGoodman2021.mixS1ScoreFull u wS α = Real.exp (α * HawkinsGweonGoodman2021.mixUtilityMarg u wS)
Instances For
Listener accuracy averaged over uniform w_L (for Eq. 10). Since accuracy(u, w_L) = w_L·asymInfR(u) + (1−w_L)·egoInfR(u) is linear in w_L, the expectation under uniform P(w_L) is the midpoint.
Equations
Instances For
Full expected accuracy (Eq. 10) with w_L marginalization. Uses the w_L-marginalized S1 for speaker production and the w_L-averaged listener accuracy for evaluation.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Full resource-rational utility (Eqs 10–11). U_RR(w_S) = ExpAccuracy_full(w_S) − β · w_S
Equations
- HawkinsGweonGoodman2021.rrUtilityFull wS α β = HawkinsGweonGoodman2021.expectedAccuracyFull wS α - β * wS
Instances For
Structural properties #
At w_S = 0, the simplified mixture utility reduces to egocentric log-L0.
At w_S = 1, the simplified mixture utility reduces to asymmetric expected log-L0.
Resource-rational predictions #
These three are transcendental ℝ inequalities over Real.exp/Real.log of the
L0 success rates. The retired interval-reflection tactic discharged them with its
numeric/interval backend; the PMF migration has no equivalent, so they are
stated with sorry per CLAUDE.md "prefer sorry over weakening". They reduce
to finite real arithmetic and await real-analysis/interval lemmas (or a
pmf_score_compare-style numeric tactic).
Paper prediction (β = 0): when perspective-taking is free, full PT (w_S = 1) achieves higher expected accuracy than no PT (w_S = 0) — the asymmetric speaker produces more specific utterances, improving listener accuracy.
TODO: transcendental ℝ inequality (exp/log of L0 rates); awaits numeric tactic.
Paper prediction (high β): when perspective-taking is costly, the cost
term β · w_S dominates, making w_S = 0 preferable to w_S = 1.
TODO: transcendental ℝ inequality (exp/log of L0 rates); awaits numeric tactic.
Interior-optimum limitation: the paper's central result is that at
moderate cost (β = 0.2) an intermediate weight w*_S ≈ 0.36 outperforms both
extremes. This 3+1-object reference game is too simple to produce that effect:
shape alone identifies the target among visible objects (egoInfR .s = 1), so
the egocentric baseline accuracy is near-ceiling and the marginal gain from
perspective-taking is far below the β = 0.2 cost. So instead no-PT beats full-PT
for all tested β ≥ 1/50.
TODO: transcendental ℝ inequality (exp/log of L0 rates); awaits numeric tactic.
Listener belief adaptation #
Listener's belief about speaker's perspective-taking weight. Over time, listeners update their expectation of w_S based on observed utterance informativity.
- wS_expectation : ℝ
- observations : ℕ
Instances For
Initial uniform belief: E[w_S] = 1/2
Equations
- HawkinsGweonGoodman2021.initialBeliefs = { wS_expectation := 1 / 2, observations := 0 }
Instances For
Update beliefs after observing utterance informativity. Short/uninformative utterances → lower w_S estimate; long/informative utterances → higher w_S estimate.
Equations
- One or more equations did not get rendered due to their size.
Instances For
After seeing short utterances, listener expects lower w_S
Equations
- One or more equations did not get rendered due to their size.
Instances For
Paper prediction ([HGG21] §2.4.1): Listeners infer low speaker effort from under-informative utterances.
Optimal listener weight: compensate for low speaker effort. When the speaker uses low w_S, the listener should increase their own perspective-taking to compensate.
Equations
- HawkinsGweonGoodman2021.optimalListenerWeight speakerWS β = min 1 (max 0 (1 - speakerWS + β))
Instances For
Paper prediction ([HGG21] §2.4.1): Listener increases effort when speaker decreases theirs.