Documentation

Linglib.Studies.BumfordRett2021

[BR21] — Rationalizing Evaluativity #

[BR21] [LG17] [BLG16] [Ret15] [Bar02b]

Proceedings of Sinn und Bedeutung 25, pp. 187–204.

Key Claims #

  1. Evaluativity is gradient: Degree constructions differ in the strength of their evaluative inferences. The model produces a strict ranking: positive > equative > comparative.

  2. Comparison-class uncertainty: Worlds are two-dimensional — subject height × comparison class center. The listener jointly infers height and CC statistics, extending [LG17]'s threshold RSA with [Bar02b]'s insight that CC uncertainty is informative.

  3. Lexical uncertainty for markedness: Antonym-sensitive evaluativity (marked equatives like "as short as" are evaluative; unmarked "as tall as" are not) emerges from [BLG16]'s lexical uncertainty, where marked forms have higher production cost.

  4. Evaluativity metric: E[ht − μ] — the expected deviation of the subject's height from the CC center — measures evaluativity strength.

Model Architecture (§2.1) #

Discretization #

The paper uses heights 1–17, CC centers 5–14, |ht − μ| ≤ 4 (SD = 2). We scale to heights 1–9, CC centers 3–7, |ht − μ| ≤ 2 (SD = 1), preserving the qualitative predictions with a 45-world grid (25 valid worlds after the Gaussian truncation).

Verified Predictions (Table 1) #

  1. Positive construction (Simulation 1): both antonyms evaluative
  2. Exact equative (Simulation 2): marked antonym evaluative, unmarked weakly evaluative
  3. ≥ Equative (Simulation 3): marked evaluative, unmarked barely evaluative
  4. Comparative (Simulation 4): neither antonym evaluative — evaluative world does NOT win over non-evaluative world, unlike equatives
  5. Antonym asymmetry: marked equative produces stronger evaluative inference
  6. Cross-construction contrast: equative marked IS evaluative but comparative marked is NOT, confirming that partial antonymic competition (not just cost) drives evaluativity

Connection to [Ret15] #

[Ret15] derives evaluativity categorically via Q/R-implicature (formalized in Pragmatics/Implicature/Evaluativity.lean). This RSA model derives the same predictions gradiently: evaluativity strength varies continuously across constructions. Both accounts agree on the antonym-sensitive pattern (equative marked > equative unmarked).

The RSA model adds two things the Neo-Gricean account lacks:

@[reducible, inline]

A world is a pair (height index, CC center index).

Height index i ∈ Fin 9 represents height i + 1 (range 1–9). CC center index j ∈ Fin 5 represents center j + 3 (range 3–7). Valid worlds satisfy |height − center| ≤ 2 (enforced via prior).

Equations
Instances For

    Height value (1–9) from world indices.

    Equations
    Instances For

      CC center value (3–7) from world indices.

      Equations
      Instances For

        Deviation of height from CC center: ht − μ.

        Equations
        Instances For

          Gaussian-weighted prior over valid worlds.

          CC center is uniform; height weight decreases with distance from center. Approximates N(μ, 1) truncated at |ht − μ| ≤ 2. Weights: d=0 → 10, d=1 → 6, d=2 → 1, d>2 → 0 (invalid world).

          Equations
          Instances For

            Utterance type: unmarked (positive-polar), marked (negative-polar), or null.

            For the positive construction: unmarked = "tall", marked = "short". For the exact equative: unmarked = "as tall as K", marked = "as short as K". Cost asymmetry (marked = 2, unmarked = 1) drives antonym-sensitive evaluativity via [BLG16]'s lexical uncertainty.

            Instances For
              Equations
              • One or more equations did not get rendered due to their size.
              Instances For
                @[implicit_reducible]
                Equations
                @[implicit_reducible]
                Equations
                • One or more equations did not get rendered due to their size.
                @[reducible, inline]

                Threshold offset σ ∈ {−2, −1, 0, 1, 2}.

                Determines how far above the CC center a person must be to count as "tall." Index s ∈ Fin 5 represents σ = s − 2. Higher σ means a more exclusive threshold.

                Equations
                Instances For

                  Integer offset value: index s ↦ σ = s − 2.

                  Equations
                  Instances For
                    noncomputable def BumfordRett2021.worldPriorR (w : EvalWorld) :

                    World prior as ℝ.

                    Equations
                    Instances For

                      Utterance cost as a natural exponent: exp(−α·C(u)) = e^(costN u) at the paper's α = 4, where e := exp(−4).

                      Equations
                      Instances For

                        Companion architecture on PMF, parameterized by the cost-factor base e (= exp(−4) at the paper's α = 4; only the speaker depends on e):

                        L₀(w | u, σ) ∝ ⟦u⟧(σ,w) · P(w)               (`meaningE`, `L0v`)
                        S₁(u | w, σ) ∝ L₀(w | u, σ)⁴ · e^C(u)         (`Sk`)
                        L₁(w, σ | u) ∝ S₁(u | w, σ) · P(w) · P(σ)     (`listener`, `PMF.posterior`)
                        

                        The prior is baked into the graded L₀ kernel (eq 10, L₀ ∝ P(w)·⟦u⟧(w)); null is licensed everywhere, so the speaker normaliser vanishes only at invalid (zero-prior) worlds, which carry joint weight 0 and are handled by a dite wrapper. Statements are e-generic over 0 < e < 1; exp(−4) is instantiated only in bridging corollaries.

                        def BumfordRett2021.meaningE (sem : UtteranceSigmaEvalWorldBool) (σ : Sigma) (u : Utterance) (w : EvalWorld) :
                        ENNReal

                        Prior-weighted meaning ⟦u⟧(σ,w) · P(w), lifted to ℝ≥0∞.

                        Equations
                        Instances For
                          noncomputable def BumfordRett2021.L0v (sem : UtteranceSigmaEvalWorldBool) (σ : Sigma) (u : Utterance) (w : EvalWorld) :
                          ENNReal

                          Literal-listener value L₀(w | u, σ) = ⟦u⟧(σ,w)·P(w) / D (well-defined and 0 on empty extensions, since 0 · ⊤ = 0 in ℝ≥0∞).

                          Equations
                          Instances For
                            noncomputable def BumfordRett2021.spkW (sem : UtteranceSigmaEvalWorldBool) (e : ) (s : EvalWorld × Sigma) (u : Utterance) :
                            ENNReal

                            Speaker weight L₀(w|u,σ)⁴ · e^C(u).

                            Equations
                            Instances For
                              noncomputable def BumfordRett2021.Sk (sem : UtteranceSigmaEvalWorldBool) (e : ) (s : EvalWorld × Sigma) :

                              Speaker S₁(· | w, σ) : PMF Utterance, dite-guarded so it is total even at invalid worlds (where every weight vanishes; those carry joint prior 0).

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For
                                def BumfordRett2021.jointW (s : EvalWorld × Sigma) :
                                ENNReal

                                Unnormalised joint prior P(w) · P(σ) (uniform latent absorbed).

                                Equations
                                Instances For
                                  def BumfordRett2021.mkW (h : Fin 9) (m : Fin 5) :

                                  Concise world constructor: mkW h m = (Fin h, Fin m).

                                  Equations
                                  Instances For
                                    noncomputable def BumfordRett2021.jointK :
                                    PMF (EvalWorld × Sigma)

                                    Listener's joint prior over world × σ.

                                    Equations
                                    Instances For
                                      theorem BumfordRett2021.marg_ne_zero {sem : UtteranceSigmaEvalWorldBool} {e : } (he0 : 0 < e) {u : Utterance} {w0 : EvalWorld} {σ0 : Sigma} (hval : worldPrior w0 0) (hnull : sem Utterance.null σ0 w0 = true) (hlic : sem u σ0 w0 = true) :
                                      PMF.marginal (Sk sem e) jointK u 0

                                      Single-witness discharge of the listener's marginal positivity: a valid world w0 licensed for u at some σ0 (with null also licensed there).

                                      Structural speaker/listener monotonicity #

                                      Evaluativity is proved structurally, with no normaliser computation: the per-latent speaker order follows from licensing-set inclusion between two equal-prior worlds. Two equal-prior worlds with the same licensing bit for u have identical speaker numerators; a wider licensing set only enlarges the denominator. Hence a world that is licensed for fewer alternatives (its licensing set is contained in the other's) puts more mass on the observed u. Only 0 < e is used (for strict positivity); nothing needs e < 1.

                                      Positive construction meaning (eq 12):

                                      • unmarked ("tall"): ht_w(j) ≥ μ_w + σ
                                      • marked ("short"): ht_w(j) ≤ μ_w + σ
                                      • null: true
                                      Equations
                                      Instances For
                                        noncomputable def BumfordRett2021.posL1 (u : Utterance) {e : } (he0 : 0 < e) :
                                        PMF (EvalWorld × Sigma)

                                        Positive-construction pragmatic listener L₁(· | u) at cost base e.

                                        Equations
                                        Instances For

                                          Prediction 1: "Tall" shifts height above CC mean #

                                          Hearing "Jane is tall" (unmarked) shifts L₁'s posterior toward worlds where the subject's height exceeds the CC center. At fixed CC center μ = 5 (index 2), height 6 (index 5, dev = +1) becomes more likely than height 4 (index 3, dev = −1). Both worlds have equal prior (6), so the asymmetry is entirely due to pragmatic reasoning. The paper reports E[ht − μ] = 2.08 at L₁.

                                          Prediction 2: "Short" shifts height below CC mean #

                                          Mirror image: hearing "Jane is short" (marked) shifts posterior toward worlds where height is below the CC center. E[ht − μ] = −3.18 at L₁. The marked form is even more evaluative than the unmarked form, because the extra cost signals that the speaker's reason for speaking must be particularly strong.

                                          theorem BumfordRett2021.pos_short_evaluative (e : ) (he0 : 0 < e) :

                                          Keisha's height k, fixed and known to both speaker and listener. In our scaled model: k = 5 (height index 4).

                                          Equations
                                          Instances For
                                            def BumfordRett2021.eqMeaning (u : Utterance) (σ : Sigma) (w : EvalWorld) :
                                            Bool

                                            Exact equative meaning (eq 14):

                                            • unmarked ("as tall as K"): ht = k ∧ k ≥ μ + σ
                                            • marked ("as short as K"): ht = k ∧ k ≤ μ + σ
                                            • null: true

                                            The equative fixes height to k. The informative content is about where k sits relative to the CC center: does the threshold σ classify k as "tall-enough" (unmarked) or "short-enough" (marked)?

                                            Equations
                                            Instances For
                                              noncomputable def BumfordRett2021.eqL1 (u : Utterance) {e : } (he0 : 0 < e) :
                                              PMF (EvalWorld × Sigma)

                                              Exact-equative pragmatic listener L₁(· | u) at cost base e.

                                              Equations
                                              Instances For

                                                Prediction 3: Marked equative is evaluative #

                                                Hearing "Jane is as short as Keisha" (marked) shifts L₁'s posterior toward high-μ worlds where k = 5 is below the CC center — i.e., Keisha's height is atypically low. The paper reports E[k − μ] = −1.06 at L₁.

                                                The mechanism: the speaker paid extra cost (C = 2) for the marked form. By [BLG16]'s logic, L₁ infers the speaker must have had a strong reason — the marked form is distinctively informative in worlds where k is atypically low.

                                                theorem BumfordRett2021.eq_marked_evaluative (e : ) (he0 : 0 < e) :

                                                Prediction 4: Unmarked equative is weakly evaluative #

                                                Hearing "Jane is as tall as Keisha" (unmarked) produces a weak evaluative inference: the posterior slightly favors worlds where k is above the CC center (μ < k). The paper reports E[k − μ] = 0.84 at L₁, much weaker than the marked form's −1.06.

                                                This asymmetry — marked evaluative, unmarked weakly/not evaluative — is the antonym-sensitive pattern that [Ret15] identifies categorically and this model derives gradiently.

                                                ≥ Equative (Minimum-Standard) Semantics #

                                                The "at least as tall as" equative uses ≥ instead of = for height. Unlike the exact equative, the unmarked and marked forms are NOT synonymous: "at least as tall as K" and "at least as short as K" have different truth conditions. Antonymic competition is therefore partial, predicting evaluativity intermediate between the exact equative (full synonymy) and the comparative (no overlap).

                                                ≥ equative meaning (eq 16):

                                                • unmarked ("at least as tall as K"): ht ≥ k ∧ k ≥ μ + σ
                                                • marked ("at least as short as K"): ht ≤ k ∧ k ≤ μ + σ
                                                • null: true
                                                Equations
                                                Instances For
                                                  noncomputable def BumfordRett2021.geqL1 (u : Utterance) {e : } (he0 : 0 < e) :
                                                  PMF (EvalWorld × Sigma)

                                                  ≥-equative pragmatic listener L₁(· | u) at cost base e.

                                                  Equations
                                                  Instances For

                                                    Prediction 5: Marked ≥ equative is evaluative #

                                                    Hearing "Jane is at least as short as Keisha" (marked) shifts L₁'s posterior toward worlds where k is below the CC center. The paper reports E[k − μ] = −1.52 at L₁.

                                                    theorem BumfordRett2021.geq_marked_evaluative (e : ) (he0 : 0 < e) :

                                                    Prediction 6: Unmarked ≥ equative is barely evaluative #

                                                    Hearing "Jane is at least as tall as Keisha" (unmarked) barely shifts the posterior. The paper reports E[k − μ] = 0.11 at L₁ — the weakest evaluative effect of any construction.

                                                    Comparative Semantics #

                                                    The comparative uses strict inequality for height (ht > k / ht < k). Unlike the equatives, the comparative forms have NO semantic overlap: "taller than K" and "shorter than K" are not even close to synonymous. With no antonymic competition and high informativity (the comparative provides clear information worth the cost), there is no pressure for evaluative inference.

                                                    The paper predicts E[k − μ] = −0.74 (unmarked) and −0.44 (marked) at L₁ — both close to zero. The listener guesses Keisha is slightly below the CC mean, but this is a generic probabilistic consequence of learning relative height, not an evaluative inference.

                                                    Comparative meaning (eq 18):

                                                    • unmarked ("taller than K"): ht > k ∧ k ≥ μ + σ
                                                    • marked ("shorter than K"): ht < k ∧ k ≤ μ + σ
                                                    • null: true
                                                    Equations
                                                    Instances For
                                                      noncomputable def BumfordRett2021.compL1 (u : Utterance) {e : } (he0 : 0 < e) :
                                                      PMF (EvalWorld × Sigma)

                                                      Comparative pragmatic listener L₁(· | u) at cost base e.

                                                      Equations
                                                      Instances For

                                                        Prediction 7: Comparative marked does not strongly shift k #

                                                        Unlike the equative, the marked comparative does not push the listener toward worlds where k is far from the CC center. At equal prior (dev = ±1), the world with k near the mean is preferred over the world with k well above the mean. The paper reports E[k − μ] = −0.44 at L₁ — a very weak effect, unlike the equative's −1.06.

                                                        theorem BumfordRett2021.comp_marked_weak (e : ) (he0 : 0 < e) :

                                                        Prediction 8: Comparative unmarked is counter-evaluative #

                                                        Hearing "Jane is taller than Keisha" (unmarked) does NOT make the listener infer that Keisha is tall. In fact, the paper reports E[k − μ] = −0.74, slightly negative: Keisha is inferred to be slightly below the CC mean. This is because knowing Jane exceeds Keisha's height leaves more room for Keisha to be below average.

                                                        theorem BumfordRett2021.comp_unmarked_counter_evaluative (e : ) (he0 : 0 < e) (he_lo : 1 / 100 e) :

                                                        Counter-evaluative comparative — a prior-magnitude effect, not a licensing one. Unlike the seven Tier-A predictions (which hold for every cost base e ∈ (0,1) via evaluative_of_incl's bare 0 < e), here the speaker distribution depends on a world only through its licensing set (the prior cancels inside Sk), so the 10:1 world prior of mkW 5 3 (k at the CC mean) over mkW 5 1 (k above it) is the sole asymmetry and it dominates.

                                                        The prior dominates only when markedness costs are not extreme. The sharp threshold is e ≥ (D_unm(1)/D_null)⁴ = (25/120)⁴ ≈ 0.0019: for e below it, the cost factor e^C so heavily discounts the informative "taller than" utterance in the high-threshold worlds that the informativity cost dominates the prior mass and the inequality flips. We therefore assume 1/100 ≤ e (comfortably above the threshold, and met by the paper's e = exp(−4) ≈ 0.018; see comp_unmarked_counter_evaluative_exp).

                                                        The counter-evaluative comparative at the paper's cost base e = exp(−4). The hypothesis 1/100 ≤ exp(−4) reduces to exp 4 ≤ 100, and exp 4 = (exp 1)⁴ < 2.7182818286⁴ ≈ 54.6 < 100.

                                                        Gradient evaluativity ranking #

                                                        The paper's central prediction (Table 1) is a strict ranking of evaluativity strength across constructions:

                                                        Constructionunmarked E[ht−μ]marked E[ht−μ]Evaluative?
                                                        Positive2.08−3.18✓ ✓
                                                        = Equative0.84−1.06✗ ✓
                                                        ≥ Equative0.11−1.52✗ ✓
                                                        Comparative−0.74−0.44✗ ✗

                                                        This ranking emerges from two factors:

                                                        1. Informativity: The positive construction has an open degree argument (threshold is entirely unknown), making it maximally vague. Equatives fix height to k, reducing uncertainty. Comparatives provide strict ordering, leaving little room for evaluative inference.
                                                        2. Cost asymmetry: The marked form's extra cost (C = 2 vs 1) forces L₁ to seek explanations for the speaker's costly choice, driving evaluative inferences in worlds where the marked form is distinctively informative (i.e., atypical worlds).

                                                        The theorems above verify the key qualitative pattern across all four constructions:

                                                        RSA ↔ Neo-Gricean Agreement #

                                                        [Ret15]'s Neo-Gricean account (formalized in Pragmatics/Implicature/Evaluativity.lean) classifies constructions categorically using AdjectivalConstruction and Polarity:

                                                        This RSA model derives the same pattern gradiently: each L₁ prediction above confirms a qualitative directional prediction that matches the categorical classification. The RSA model adds:

                                                        1. Graded predictions — evaluativity has a continuous strength, not just ±
                                                        2. Unified mechanism — rational communication replaces separate Q/R principles
                                                        3. ≥ equative predictions — partial overlap produces intermediate evaluativity, a novel prediction the categorical account does not make
                                                        @[reducible, inline]

                                                        Construction labels for each simulation, connecting to the AdjectivalConstruction type from Semantics/Degree/Gradability/Construction.lean.

                                                        Equations
                                                        Instances For

                                                          Cross-theory agreement: the RSA model and [Ret15]'s Neo-Gricean account agree on the full evaluativity paradigm.

                                                          • Positive: Neo-Gricean says evaluative for both polarities (Q-implicature). RSA confirms: both "tall" and "short" shift the posterior away from the CC mean.
                                                          • Equative: Neo-Gricean says evaluative for negative only (Manner implicature). RSA confirms: marked form shifts strongly, unmarked weakly.
                                                          • Comparative: Neo-Gricean says never evaluative (no applicable implicature). RSA confirms: neither polarity shifts strongly.

                                                          This theorem connects two independent formalizations — the categorical deriveEvaluativity function and the RSA L1 predictions — proving they make compatible predictions despite using entirely different mechanisms.

                                                          Relationship to [LG17] #

                                                          LassiterGoodman2017PMF.lean formalizes the threshold RSA model for the positive construction only (1D world = height, latent = threshold). This file extends that model to 2D worlds (height × CC center) and adds cost-driven antonym competition. The positive construction predictions here subsume LassiterGoodman2017's: both show that hearing "tall"/"short" shifts the height posterior.

                                                          Architecture #

                                                          The model runs on the mathlib-PMF RSA pipeline (RSA.Canonical.L1), with the latent threshold offset σ as the joint-listener's second coordinate:

                                                          Evaluativity (Tier A) is proved structurally from licensing-set inclusion and needs only 0 < e; the counter-evaluative comparative is the sole prior-magnitude case and needs 1/100 ≤ e.