[And21]: conversation update for RSA #

[Luc59] [Sta02] [FG12]

Tell me everything you know (SCiL 2021, 244–253): multi-turn conversation in RSA. The common ground is a probability distribution over worlds substituted for the RSA world prior (Figure 4), updated each turn by convex combination with the pragmatic-listener posterior at a learning rate, with weighted, thresholded, and difference sampling for cooperative observation selection.

Main results #

updateCG_matches_linear_learning: the update rule is [Luc59]'s linear learning rule — multi-turn conversation is iterated learning over distributions.
lr_one_excludes_false_worlds / graded_update_keeps_false_world: set-intersection update is the lr = 1 degenerate limit; the graded common ground is non-monotonic by design (fn. 7).
Turn-1 and turn-2 predictions over the MutualFriends worlds (individuals typed by major × location), including turn2_breaks_symmetry: an updated common ground changes what the same utterance conveys.
toBToMSharedUpdate: Shared := PMF W instantiates BToMModel.sharedUpdate — BToM discourse dynamics with a distributional shared state.

Implementation notes #

The Figure-4 chain is exact ℚ≥0, parameterized by common-ground weights: each agent (l0Score/s1Score/l1Score/s2Score) is one PMF.normalizeScores application over the agent below it, the distributions are PMF.ofScores, and every prediction closes by the ofScores comparison family with one kernel certificate.

Distributional common ground #

PMF W wraps PMF W: [Sta02]'s context set with graded plausibility summing to one (§3.2), so entropy — Anderson's success criterion — and KL divergence are available on the carrier. toContextSet projects to the positive-mass worlds (PMF.support), PMF.uniformOfFintype is the empty common ground; ofWeights renormalizes non-negative weights (fn. 3). Unlike the classical context set, worlds can regain probability (fn. 7); intersection update survives only at lr = 1.

The common ground as a distribution #

Anderson's distributional common ground is a PMF W ([And21] §3.2): graded plausibility summing to one, with PMF.entropy — the success criterion — and PMF.toRealFn (the ℝ-valued masses every RSA consumer reads) already on the carrier. The classical context set is the support.

[And21]: conversation update for RSA #

Main results #

Implementation notes #

Distributional common ground #

The common ground as a distribution #

MutualFriends Domain #

Distributional Common Ground (re-exported from substrate) #

CommonGround Update #

Conversation State #

Observation Sampling #

BToM Shared-State Update #

The Figure-5 beliefs #

The Figure-4 model on ℚ≥0 scores #

The Figure-4 chain #

The score chain #

The prior-transparency mechanism #

Turn-1 predictions #

Turn 2 (Post-Update Prior) #

Turn-2 predictions #

The CG-adapted speaker #

Parametric RSA and Conversation Step #

Qualitative information-sharing properties #

Bridge to Classical CommonGround Update #

Exact Numerical Predictions (turn 1) #

Exact turn-1 values #

Approximate CommonGround Model (§5.2, Figure 6) #

Approximate common ground #

Belief Update Model (§6, Figure 8) #

Noncommittal Speaker Problem (§7.1) #

Redundancy and Difference Sampling (§7.2) #