Documentation

Linglib.Studies.SchlotterbeckWang2023

[SW23] — incremental RSA for adjective ordering #

[CGGP19] [DHG+20] [WD21]

Schlotterbeck, F. & Wang, H. (2023). An incremental RSA model for adjective ordering preferences in referential visual context. SCiL 6, 121–132.

Scope. This file formalizes the symmetric-PoE sanity-check slice of the paper, not its main asymmetric model: the order-independence of the incremental listener under symmetric per-class semantics, plus discrimination-driven ordering preferences at the speaker level as chain-rule trajectory products (per-step normalized; the paper's S1^inc normalizes once globally, and the two agree at the literal layer). Not formalized: asymmetric per-class semantics, the P_Lang grammaticality filter, and the size-first utterance-prior bias. The paper's α is the utterance-level softmax temperature; its β (word-level) is 1 in all reported simulations, matching the exponent-free chain here.

Main results #

Implementation notes #

The chain is exact ℚ≥0: lex is the Bernoulli-channel form of [DHG+20]'s continuous semantics (reliability if the word truly applies, the complementary noise floor otherwise), prefix meanings are List products, l0Score/s1Score normalize via PMF.normalizeScores, and the PMF speaker is PMF.ofScores. Trajectories are Fin-indexed products of speaker values, so ordering predictions close by PMF.prod_ofScores_lt with one kernel certificate each.

Referents in the reference game.

Instances For
    @[implicit_reducible]
    Equations
    @[implicit_reducible]
    Equations
    • One or more equations did not get rendered due to their size.
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Words available to the speaker: size adjectives, color adjectives, noun.

      Instances For
        @[implicit_reducible]
        Equations
        @[implicit_reducible]
        Equations
        • One or more equations did not get rendered due to their size.
        def SchlotterbeckWang2023.instReprWord.repr :
        WordStd.Format
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          def SchlotterbeckWang2023.rel (sRel cRel : ℚ≥0) :
          Wordℚ≥0

          Per-class perceptual reliability: size words use sRel, color words use cRel, the noun applies universally.

          Equations
          Instances For
            def SchlotterbeckWang2023.lex (sRel cRel : ℚ≥0) (w : Word) (r : Referent) :
            ℚ≥0

            Noisy word meaning: reliability if the word truly applies, the complementary noise floor otherwise — the Bernoulli-channel form of [DHG+20]'s continuous semantics.

            Equations
            Instances For
              def SchlotterbeckWang2023.meaning (lex : WordReferentℚ≥0) (us : List Word) (r : Referent) :
              ℚ≥0

              PoE prefix meaning: the product of per-word noisy meanings.

              Equations
              Instances For
                theorem SchlotterbeckWang2023.meaning_perm (lex : WordReferentℚ≥0) {us vs : List Word} (h : us.Perm vs) (r : Referent) :
                meaning lex us r = meaning lex vs r

                The listener-level sanity check: the PoE prefix meaning is order-independent, for every lexicon and reliability setting.

                Scenes #

                Scene A lexicon: size more reliable (99/100 vs 95/100).

                Equations
                Instances For

                  Scene B lexicon: color more reliable (95/100 vs 80/100).

                  Equations
                  Instances For

                    The incremental chain #

                    def SchlotterbeckWang2023.l0Score (lex : WordReferentℚ≥0) (scene : ReferentBool) (ctx : List Word) (u : Word) :
                    Referentℚ≥0

                    Literal listener over scene referents at each prefix extension.

                    Equations
                    Instances For
                      def SchlotterbeckWang2023.s1Score (lex : WordReferentℚ≥0) (scene : ReferentBool) (ctx : List Word) (r : Referent) :
                      Wordℚ≥0

                      Word-level speaker (β = 1, no cost): renormalized literal posteriors.

                      Equations
                      Instances For
                        noncomputable def SchlotterbeckWang2023.s1 (lex : WordReferentℚ≥0) (scene : ReferentBool) (ctx : List Word) (r : Referent) :
                        PMF Word

                        Word-by-word speaker at context ctx.

                        Equations
                        Instances For
                          noncomputable def SchlotterbeckWang2023.s1Traj (lex : WordReferentℚ≥0) (scene : ReferentBool) (r : Referent) (us : List Word) :
                          ENNReal

                          Trajectory probability of an utterance: the chain-rule product of per-step speaker values along its prefixes.

                          Equations
                          Instances For
                            def SchlotterbeckWang2023.s1TrajScore (lex : WordReferentℚ≥0) (scene : ReferentBool) (r : Referent) (us : List Word) :
                            ℚ≥0

                            ℚ≥0 shadow of s1Traj.

                            Equations
                            Instances For
                              theorem SchlotterbeckWang2023.s1Traj_lt {lex₁ lex₂ : WordReferentℚ≥0} {sc₁ sc₂ : ReferentBool} {r₁ r₂ : Referent} {us vs : List Word} (h : s1TrajScore lex₁ sc₁ r₁ us < s1TrajScore lex₂ sc₂ r₂ vs) :
                              s1Traj lex₁ sc₁ r₁ us < s1Traj lex₂ sc₂ r₂ vs

                              Predictions #

                              When size has the higher discriminatory power (Scene A), the incremental speaker prefers the size-first ordering.

                              Under equal discrimination with color more reliable (Scene B), the preference flips to color-first.

                              The ordering preference tracks the scene's discriminatory structure, not a fixed ordering rule.

                              In Scene A, "big" is the more informative first word for the target.

                              In Scene B, "blue" is the more informative first word.

                              After both adjectives, the target carries the highest meaning among Scene A members.

                              After both adjectives, the target carries the highest meaning among Scene B members.

                              theorem SchlotterbeckWang2023.lex_as_noiseChannel {sRel cRel : ℚ≥0} (hs : sRel 1) (hc : cRel 1) (w : Word) (r : Referent) :
                              (lex sRel cRel w r) = RSA.Noise.noiseChannel (↑(rel sRel cRel w)) (1 - (rel sRel cRel w)) (if wordApplies w r = true then 1 else 0)

                              lex is the unified noise channel of RSA.Noise at onMatch = reliability, onMismatch = its complement — the [DHG+20] parameterization.