Documentation

Linglib.Studies.FrankGoodman2012

[FG12] reference game (Measure/Kernel-native) #

[FG12]

"Predicting Pragmatic Reasoning in Language Games", Science 336, 998.

The Rational Speech Act model of the paper, on the Measure/Kernel-native analytic foundation of Pragmatics/RSA/Gibbs.lean. The informative speaker (the paper's eq. 2) is a MeasureTheory.Measure.tilted Gibbs measure: Measure.count restricted to the applicable utterances W(r), tilted by the surprisal utility score w = log (1 / |w|). The closed form is exactly

S₁(w | r) ∝ |w|⁻¹ over w ∈ W(r),

[FG12]'s eq. (2). The architectural content — the speaker is a Gibbs measure, monotone in utility, and the rational optimizer of expected-utility-minus- KL (RSA.Gibbs.speaker_isGreatest) — is the substrate; here it is instantiated at the paper's stimulus (Fig. 1A).

The pragmatic listener (eq. 1) is the Bayesian posterior of the speaker against the salience prior, RSA.Gibbs.listener; its pragmatic inferences are driven by the speaker asymmetries proved below (narrowing, unique reference). Empirical fit (speaker r = 0.98, listener r = 0.99) is reported in the paper, not as a theorem here.

Main statements #

Stimulus (Fig. 1A) #

Three objects and four feature-words. Two features (green, circle) are uniquely identifying; two (blue, square) are ambiguous between two objects each.

The three objects in the reference context.

Instances For
    @[implicit_reducible]
    Equations
    @[implicit_reducible]
    Equations
    • One or more equations did not get rendered due to their size.
    def FrankGoodman2012.instReprObject.repr :
    ObjectStd.Format
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      The four feature-word utterances.

      Instances For
        @[implicit_reducible]
        Equations
        @[implicit_reducible]
        Equations
        • One or more equations did not get rendered due to their size.
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          The applicable utterances at a referent — [FG12]'s W(r), the support over which the speaker normalizes.

          Equations
          Instances For

            The extension size |w| — the number of objects the feature applies to.

            Equations
            Instances For
              noncomputable def FrankGoodman2012.score (w : Feature) :

              Informativity utility (rationality α = 1, no cost): the surprisal score w = log (1 / |w|) = - log |w|. Tilting by this realizes eq. (2)'s |w|⁻¹.

              Equations
              Instances For
                noncomputable def FrankGoodman2012.speakerAt (r : Object) :
                MeasureTheory.Measure Feature

                The informative speaker (eq. 2): the Gibbs measure tilting Measure.count restricted to the applicable utterances W(r) by the surprisal score.

                Equations
                Instances For

                  Speaker API at this stimulus #

                  These wrappers carry by decide defaults for the applicability side-conditions, so the concrete predictions below never spell out w ∈ applicable r proofs.

                  theorem FrankGoodman2012.speakerAt_apply (r : Object) (w : Feature) (h : w applicable r := by decide) :
                  (speakerAt r) {w} = ENNReal.ofReal (Real.exp (score w) / xapplicable r, Real.exp (score x))

                  At an applicable utterance, the speaker mass is the softmax over W(r).

                  theorem FrankGoodman2012.speakerAt_apply_zero (r : Object) (w : Feature) (h : wapplicable r := by decide) :
                  (speakerAt r) {w} = 0

                  A non-applicable utterance gets zero speaker mass.

                  theorem FrankGoodman2012.speakerAt_lt_iff (r : Object) (w₁ w₂ : Feature) (h₁ : w₁ applicable r := by decide) (h₂ : w₂ applicable r := by decide) :
                  (speakerAt r) {w₁} < (speakerAt r) {w₂} score w₁ < score w₂

                  Speaker preference at a referent reduces to the surprisal comparison.

                  theorem FrankGoodman2012.speakerAtAlpha_lt_iff (r : Object) {α : } ( : 0 < α) (w₁ w₂ : Feature) (h₁ : w₁ applicable r := by decide) (h₂ : w₂ applicable r := by decide) :
                  (RSA.Gibbs.speakerAlpha (MeasureTheory.Measure.count.restrict (applicable r)) α score) {w₁} < (RSA.Gibbs.speakerAlpha (MeasureTheory.Measure.count.restrict (applicable r)) α score) {w₂} score w₁ < score w₂

                  α-speaker preference at a referent (α > 0) reduces to the surprisal comparison.

                  Numerical bookkeeping #

                  Predictions #

                  The speaker prefers the uniquely-identifying description (Fig. 1A): for the target (blueCircle), circle (which uniquely identifies it) gets strictly more mass than the ambiguous blue. Reduces via speaker_countRestrict_lt_iff_score_lt to the surprisal comparison score blue < score circle.

                  theorem FrankGoodman2012.prefers_informative_alpha {α : } ( : 0 < α) :
                  (RSA.Gibbs.speakerAlpha (MeasureTheory.Measure.count.restrict (applicable Object.blueCircle)) α score) {Feature.blue} < (RSA.Gibbs.speakerAlpha (MeasureTheory.Measure.count.restrict (applicable Object.blueCircle)) α score) {Feature.circle}

                  Robustness to rationality: the informativeness preference holds at every rationality level α > 0, not just the canonical α = 1 (prefers_informative). A consumer of the α-generalized speaker RSA.Gibbs.speakerAlpha.

                  theorem FrankGoodman2012.fully_rational_picks_circle :
                  Filter.Tendsto (fun (α : ) => (RSA.Gibbs.speakerAlpha (MeasureTheory.Measure.count.restrict (applicable Object.blueCircle)) α score) {Feature.circle}) Filter.atTop (nhds 1)

                  The fully-rational speaker is deterministic (α → ∞): at the target, the speaker concentrates all its mass on the uniquely-identifying circle. The zero-temperature limit of prefers_informative, via RSA.Gibbs.speakerAlpha_countRestrict_tendsto_one_of_isMax.

                  theorem FrankGoodman2012.size_principle (r : Object) (w₁ w₂ : Feature) (h₁ : w₁ applicable r) (h₂ : w₂ applicable r) (h : numApplies w₂ < numApplies w₁) :
                  (speakerAt r) {w₁} < (speakerAt r) {w₂}

                  Size principle: among applicable utterances, the speaker prefers the one with the smaller extension (lower numApplies).

                  Pragmatic narrowing for "blue": the speaker assigns less mass to "blue" at blueCircle (where "circle" uniquely identifies, raising the partition) than at blueSquare (where the only alternative "square" is equally ambiguous). The numerators are equal; the comparison is the partition comparison 3/2 > 1 — which is what lets a listener hearing "blue" narrow toward blueSquare.

                  Pragmatic narrowing for "square": symmetrically, "square" is less likely at greenSquare (where "green" uniquely identifies) than at blueSquare.

                  Unique reference for "green": "green" applies only to greenSquare, so it gets zero mass at blueSquare and positive mass at greenSquare — the listener hearing "green" identifies greenSquare.