Documentation

Linglib.Studies.BergenGoodman2015

[BG15]: the strategic use of noise #

Topics in Cognitive Science 7(2), 336–350. RSA over a noisy channel P_N(u_p | u_i): the listener reasons about which intended utterance the perceived one came from (eq. 6), the speaker about which utterances survive noise (eqs. 7–8).

Main results #

Implementation notes #

Ellipsis is exact ℚ≥0: each meaning has a unique truthful full sentence, so the eq. 7 softmax is degenerate and eq. 8 reduces to the channel row (s1nQ); listeners are PMF.ofScores. Prosody is transcendental — the eq. 7 utilities are channel-weighted geometric means of literal posteriors (xAtom, yAtom) — and fully parametric in ε: the mechanism theorem xAtom_lt_yAtom places the atoms strictly on either side of the unstressed posterior by the two-factor GM bounds in Pragmatics/RSA/Atoms.lean, and the headline reduces to that ordering plus algebra. No magnitude certificates.

Ellipsis (§3) #

Three meanings, seven utterances (full sentences plus fragments); only full sentences have literal meaning, and per-word deletion (rate δ) turns them into fragments.

Meanings: who went to the movies.

Instances For
    @[implicit_reducible]
    Equations
    @[implicit_reducible]
    Equations
    • One or more equations did not get rendered due to their size.
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Utterances: full sentences and fragments. Fragments arise from noise deleting words from full sentences.

      Instances For
        @[implicit_reducible]
        Equations
        @[implicit_reducible]
        Equations
        • One or more equations did not get rendered due to their size.
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          noncomputable def BergenGoodman2015.EllipsisModel.noiseChannel (δ : ) :
          UtteranceUtterance

          Deletion channel: a full sentence survives with probability 1 − δ or loses its predicate with probability δ (the path relevant to "Bob").

          Equations
          Instances For

            The ellipsis chain #

            ℚ noise rate (1%; Fig. 1 shows robustness across 10⁻⁵–10⁻¹).

            Equations
            Instances For

              ℚ noisy literal-listener score (eq. 6 numerator).

              Equations
              • One or more equations did not get rendered due to their size.
              Instances For

                The noise-folded pragmatic speaker (eqs. 7–8). Each meaning has exactly one truthful full sentence, so eq. 7's softmax has singleton support and the intended-speaker distribution is degenerate at fullOf m; eq. 8's fold then reduces to the channel row of fullOf m.

                Equations
                Instances For

                  Pragmatic listener over meanings (eq. 8, uniform meaning prior).

                  Equations
                  Instances For
                    noncomputable def BergenGoodman2015.EllipsisModel.noisyMeaning (δ : ) (u_p : Utterance) (m : Meaning) :

                    Noisy L0 meaning (Eq. 6 numerator).

                    meaning(u_p, m) = Σ_{u_i} ⟦u_i⟧(m) · P(u_i) · P_N(u_p | u_i)

                    The listener considers all intended utterances u_i with meaning m, weighted by how likely noise would produce the perceived u_p.

                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      Ellipsis Predictions #

                      Hearing the fragment "Bob" — no literal meaning — L0 infers "Bob went": the only full sentence producing "Bob" by deletion.

                      Parametric robustness (Fig. 1, left panel) #

                      The noisy meaning at "bob" is δ for bobWent and 0 for all others.

                      Only "Bob went to the movies" can produce "Bob" via noise deletion, and only with meaning bobWent. Therefore L0("bob") = δ/δ = 1.

                      Fragment interpretation at every noise rate δ > 0 — "this reasoning will work even if the noise rate is arbitrarily close to 0, so long as it is positive": "Bob" only arises from "Bob went", so L0 gives it probability δ/δ = 1.

                      Prosody (§4) #

                      Stress halves the noise rate on the stressed word (§4.1's ε/n at n = 2). An exhaustive-knowledge speaker must protect "Bob" from mishearing, a non-exhaustive one need not — so the listener reads stress as exhaustivity.

                      Meanings: who went to the movies (exhaustive vs non-exhaustive).

                      Instances For
                        @[implicit_reducible]
                        Equations
                        @[implicit_reducible]
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For

                          Utterances: with and without prosodic stress (CAPS = stress).

                          Instances For
                            @[implicit_reducible]
                            Equations
                            @[implicit_reducible]
                            Equations
                            • One or more equations did not get rendered due to their size.
                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For

                              Literal meaning: lower-bound semantics. "Alice went" is true if Alice went (regardless of others).

                              Equations
                              Instances For
                                noncomputable def BergenGoodman2015.ProsodyModel.noiseChannel (ε : ) :
                                UtteranceUtterance

                                Noise channel with prosody.

                                The confusion is between subjects: "Alice" ↔ "Bob".

                                • No stress: ε chance of subject confusion
                                • With stress: ε/2 chance (stress reduces noise)
                                Equations
                                Instances For
                                  noncomputable def BergenGoodman2015.ProsodyModel.noisyMeaning (ε : ) (u_p : Utterance) (m : Meaning) :

                                  Noisy L0 meaning (Eq. 6 numerator) for the prosody model.

                                  Equations
                                  • One or more equations did not get rendered due to their size.
                                  Instances For
                                    noncomputable def BergenGoodman2015.ProsodyModel.l0 (ε : ) (u_p : Utterance) (m : Meaning) :

                                    Literal listener over meanings given the perceived utterance (eq. 6, uniform priors).

                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For
                                      noncomputable def BergenGoodman2015.ProsodyModel.xAtom (ε : ) :

                                      Unstressed speaker-utility atom (eq. 7): the channel-weighted geometric mean of the literal posteriors for uttering "Bob went".

                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For
                                        noncomputable def BergenGoodman2015.ProsodyModel.yAtom (ε : ) :

                                        Stressed speaker-utility atom (eq. 7) for "BOB went".

                                        Equations
                                        • One or more equations did not get rendered due to their size.
                                        Instances For
                                          theorem BergenGoodman2015.ProsodyModel.xAtom_lt_yAtom {ε : } (hε0 : 0 < ε) ( : ε < 2 / 3) :
                                          xAtom ε < yAtom ε

                                          The paper's mechanism, at every noise rate ε < 2/3: the stressed atom strictly exceeds the unstressed one. Stress both concentrates the channel on the informative percept and sharpens that percept's posterior, so the weighted geometric means separate across the unstressed posterior (1 − ε/2)/(2 + ε) — no magnitude computation involved.

                                          theorem BergenGoodman2015.ProsodyModel.xAtom_pos {ε : } (hε0 : 0 < ε) (hε1 : ε < 1) :
                                          0 < xAtom ε
                                          theorem BergenGoodman2015.ProsodyModel.yAtom_pos {ε : } (hε0 : 0 < ε) (hε1 : ε < 1) :
                                          0 < yAtom ε
                                          theorem BergenGoodman2015.ProsodyModel.xAtom_eq_exp {ε : } (hε0 : 0 < ε) (hε1 : ε < 1) :
                                          xAtom ε = Real.exp ((1 - ε) * Real.log (l0 ε Utterance.bobWent Meaning.onlyBob) + ε * Real.log (l0 ε Utterance.aliceWent Meaning.onlyBob))

                                          The atoms are the paper's exponentiated eq.-7 utilities.

                                          noncomputable def BergenGoodman2015.ProsodyModel.l1Score (ε : ) (u_p : Utterance) (m : Meaning) :

                                          Pragmatic-listener score (eq. 8): speaker-normalized atoms folded through the channel.

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For
                                            noncomputable def BergenGoodman2015.ProsodyModel.l1PMF (ε : ) (u_p : Utterance) :

                                            Pragmatic listener (eq. 8).

                                            Equations
                                            Instances For

                                              Stress increases the exhaustive interpretation, at every noise rate ε < 2/3: "BOB went" is strictly more likely than "Bob went" to mean only Bob went (§4). By posterior dominance (Finset.div_sum_lt_div_sum), the comparison is cell-by-cell odds dominance: trivial at onlyAlice (the stressed row is zero there) and onlyBob, and the atom ordering xAtom < yAtom — the paper's mechanism — at both.

                                              Utterance adapter: a row's stress feature as an utterance.

                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For

                                                The model's L1 assigns the exhaustive meaning more probability at the stressed row's utterance than at the unstressed row's, matching the rows' recorded reading contrast.

                                                Stress widens the channel's correct-vs-confused gap by exactly ε (1 − ε stressed versus 1 − 2ε unstressed) — the channel-level face of the mechanism, at every noise rate.