Documentation

Linglib.Studies.ScontrasTonhauser2025

[ST25]: projection without lexical presupposition #

An RSA model of know under negation (SuB 29, pp. 1431–1448): the pragmatic listener jointly infers the world state and the speaker's private assumptions (eq. 7), the speaker scores QUD-projected informativity (eq. 6, α = 10, fn. 12), and the literal listener answers the QUD within the assumption set (eq. 5). Projection of the complement C emerges with no lexically-specified constraint on the common ground. Domain: 6 utterances × 4 worlds (BEL × C) × 15 belief states × 2 QUDs; literal semantics from Semantics.Attitudes.Factivity (know = factivePos, think = nonFactivePos).

Main results #

Implementation notes #

α = 10 is a natural power, so the chain is exact ℚ≥0: the speaker is RSA.Score.s1 over the QUD-aggregated literal listener with degenerate cells falling back to cPos (one declaration covering both faces), the listener is PMF.ofScores, and every prediction is one kernel-verified event-mass comparison.

Utterance cost (§3.1: complex = 2 × simple) is omitted: exp(−αC) is transcendental, and the omission reverses prediction (2a)'s world-marginal direction (the paper derives know > think with cost, Figure 7a); predictions (2b) and (2c) are robust to it. Prior parameters follow §3.1: P(C) ∈ {2/3, 1/3}, P(BEL | C) = 1/2, uniform over the 15 assumption sets.

TODO #

Model the cost term (log-rational costs keep ℚ exactness) and derive prediction (2a)'s world-marginal direction. Prove the structural equivalence with [QGL16] (fn. 10: "private assumptions" vs "common ground" is interpretive, not computational) — see the matching TODO in QingGoodmanLassiter2016.lean.

Worlds and utterances #

World state: (BEL, C) where BEL = Cole believes C, C = complement is true. Flat inductive for tactic enumerability.

Instances For
    @[implicit_reducible]
    Equations
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For
      @[implicit_reducible]
      Equations
      • One or more equations did not get rendered due to their size.

      Attitude verb utterances about Cole's mental state, plus bare complement assertions.

      Instances For
        @[implicit_reducible]
        Equations
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          @[implicit_reducible]
          Equations
          • One or more equations did not get rendered due to their size.

          Literal truth conditions (from Factivity) #

          Speaker belief states (the 15 non-empty subsets of W) #

          Speaker's private assumptions: all 15 non-empty subsets of W. Section 3 follows [QGL16]: A ranges over all non-empty subsets of the world space.

          Instances For
            @[implicit_reducible]
            Equations
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For
              @[implicit_reducible]
              Equations
              • One or more equations did not get rendered due to their size.

              Membership in belief state. Boolean operations on WorldState fields reduce cleanly under kernel evaluation.

              Equations
              Instances For

                Priors #

                World prior sums to 1 whenever P(C) is a probability.

                The score chain #

                def ScontrasTonhauser2025.l0Score (pC : ℚ≥0) (bs : BeliefState) (u : Utterance) :
                WorldStateℚ≥0

                Literal listener within the assumption set (eq. 5): prior conditioned on worlds where u is true, empty extensions normalizing to zero rows.

                Equations
                • One or more equations did not get rendered due to their size.
                Instances For

                  QUD aggregation of the literal listener (eq. 5).

                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For

                    Speaker (eq. 6 at α = 10, zero cost): RSA.Score.s1 over the QUD-aggregated literal listener; degenerate cells fall back to cPos — one declaration covering both faces.

                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      Listener world score (eq. 7; uniform assumption prior).

                      Equations
                      Instances For
                        noncomputable def ScontrasTonhauser2025.l1World (qud : Semantics.Attitudes.Factivity.QUD) (pC : ℚ≥0) (u : Utterance) :

                        Pragmatic listener over worlds (eq. 7).

                        Equations
                        Instances For
                          noncomputable def ScontrasTonhauser2025.l1CEvent (qud : Semantics.Attitudes.Factivity.QUD) (pC : ℚ≥0) (u : Utterance) :
                          ENNReal

                          The listener's C-marginal.

                          Equations
                          Instances For

                            Listener predictions #

                            Under the BEL? QUD the C-marginal equals the prior, for every utterance: S1 depends on the world only through its BEL-cell, so the C-dimension washes out of the world posterior. Projection is invisible at the world marginal.

                            Under the C? QUD, knowNeg is evidence against C — ¬(BEL∧C) is compatible with C-false worlds — so thinkNeg preserves P(C) better.

                            Prediction 2a: utterance effect (know > think) #

                            The paper derives 2a with utterance cost (Figure 7a); the cost-free model does not — under BEL? both marginals sit at the prior, and under C? the direction reverses (c_qud_thinkNeg_higher). Fn. 11's A-marginal measure (P(A ⊧ C | u)) may capture the effect without cost; see the module TODO.

                            Prediction 2c (QUD effect): under BEL? the C-marginal stays at the prior (bel_qud_marginal_eq_prior_high), while under C? the literal semantics of "doesn't know C" lowers it.

                            Structural properties #

                            "know" entails C (from factivePos_entails_c).

                            "think" does NOT entail C.

                            "know" entails BEL (from factivePos_entails_bel).

                            Know entails think (factivity is strictly stronger than belief).

                            knowNeg is compatible with strictly more worlds than thinkNeg (3 vs 2), making it the weaker (less informative) negation.

                            Exactly 3 of 15 belief states assume C: onlyW11, onlyW01, cTrue.

                            Experimental data #

                            Effect size from a linear mixed-effects model. p values are upper bounds (paper reports "< .001").

                            • β :
                            • se :
                            • t :
                            • p :
                            Instances For
                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For

                                Exp 1: Utterance effect (know > think).

                                Equations
                                Instances For

                                  Exp 1: Prior effect (higher > lower).

                                  Equations
                                  Instances For

                                    Exp 1: QUD effect (NOT significant — manipulation too weak).

                                    Equations
                                    Instances For

                                      Exp 2: Utterance effect (know > think).

                                      Equations
                                      Instances For

                                        Exp 2: QUD effect (significant with stronger manipulation).

                                        Equations
                                        Instances For
                                          Instances For
                                            @[implicit_reducible]
                                            Equations
                                            Equations
                                            • One or more equations did not get rendered due to their size.
                                            Instances For

                                              Which experiments support each hypothesis. Exp 1: (2a) yes, (2b) yes, (2c) no (QUD not significant). Exp 2: (2a) yes, (2b) not tested, (2c) yes.

                                              Equations
                                              Instances For

                                                BToM bridge #

                                                Characteristic function: does the speaker assume C?

                                                Equations
                                                Instances For

                                                  Belief states that assume C have indicator 1.

                                                  Belief states that don't assume C have indicator 0.

                                                  Model–data connection #

                                                  Connection to [DT21] #

                                                  The prior effect found by S&T 2025 (β = 0.16 > 0) replicates the positive prior effect of [DT21] (β = 0.14 categorical, β = 0.28 individual): higher prior probability of the complement content leads to stronger projection. [DT21] makes this structural — any prior-sensitive (monotone) account predicts the modulation (DegenTonhauser2021.sensitive_predicts_modulation) — and the RSA model's prediction_2b provides the same explanation here.

                                                  Compositional filtering vs BToM #

                                                  Compares the BToM model against compositional filtering ([Hei83]): presuppositions project through connectives via local context computation; the filtering presupposition of "if A then B_p" is "A → p".

                                                  The key empirical argument: the BToM model predicts that even non-factive "think" — which has NO presupposition to filter — exhibits projection effects, because L1 can still infer what the speaker takes for granted. Compositional filtering predicts a trivial presupposition for non-presuppositional items.

                                                  The filtering prediction for "if A then know-C": the presupposition of the consequent (= C) is filtered by the antecedent. Result: conditional presupposes "A → C".

                                                  Equations
                                                  Instances For

                                                    The filtering prediction for "if A then think-C": "think" has no presupposition, so filtering produces a trivial result.

                                                    Equations
                                                    Instances For
                                                      theorem ScontrasTonhauser2025.filtering_know_nontrivial {W : Type u_1} (a c : WProp) (h : ∃ (w : W), a w ¬c w) :
                                                      ∃ (w : W), ¬(filteringPrediction_know a c).presup w

                                                      Filtering predicts non-trivial presupposition for "know": The presupposition of "if A then know-C" is ¬A ∨ C (= A → C), which is NOT tautological.

                                                      Filtering predicts TRIVIAL presupposition for "think": The presupposition of "if A then think-C" is always true, regardless of A, because "think" contributes no presupposition.

                                                      BToM predicts projection effects for ANY verb in conditional environments, because projection arises from pragmatic reasoning about the speaker's private assumptions, not from lexical presupposition.

                                                      The key mechanism: when A and C are related (correlated in the prior), the listener infers that a speaker who utters "if A, X Vs C" likely takes C for granted — regardless of whether V is factive.

                                                      • factive_projects : Bool

                                                        Whether projection is predicted for factive verbs in conditionals.

                                                      • nonFactive_projects : Bool

                                                        Whether projection is predicted for non-factive verbs in conditionals.

                                                      • relatedness_modulates : Bool

                                                        Whether relatedness modulates projection strength.

                                                      Instances For

                                                        BToM predictions: both factive and non-factive show conditional presupposition, modulated by relatedness.

                                                        Equations
                                                        Instances For

                                                          Filtering predictions: only factive shows conditional presupposition, with no role for relatedness (it's purely structural).

                                                          Equations
                                                          Instances For

                                                            Strict subsumption: BToM predicts everything filtering predicts (factive conditional presupposition) plus more (non-factive conditional presupposition, relatedness modulation).

                                                            The critical divergence: For non-factive "think" in conditionals, filtering predicts trivial presupposition (no projection), while BToM predicts non-trivial projection modulated by relatedness.

                                                            This is the [ST25] argument: if projection were due to compositional filtering alone, non-presuppositional items like "think" should show no effect. But BToM predicts projection effects even for "think", because L1 infers the speaker's private assumptions regardless of the verb's factivity status.

                                                            theorem ScontrasTonhauser2025.filtering_is_limiting_case {W : Type u_1} (a c : WProp) (h_entails : ∀ (w : W), a wc w) (w : W) :

                                                            Filtering is a special case: When relatedness is maximal (A entails C), BToM's projection prediction converges to the filtering prediction. Filtering captures the structural component; BToM adds the probabilistic modulation.