Linglib.Theories.Pragmatics.RSA.Speaker.CombinedUtility

RSA.CombinedUtility.normalizeWeights3 wA wB wC = if (wA + wB + wC == 0) = true then (0, 0, 0) else (wA / (wA + wB + wC), wB / (wA + wB + wC), wC / (wA + wB + wC))

Instances For

source

def RSA.CombinedUtility.goalOrientedUtility (uEpi uGoal β : ℚ) :

ℚ

Goal-oriented speaker utility: U_epi + β · U_goal.

This parameterization naturally models argumentative/persuasive speakers:

@cite{barnett-griffiths-hawkins-2022}: U_goal = ln P_L0(w*|u), β controls persuasive bias
@cite{cummins-franke-2021}: U_goal = argStr(u, G), β → ∞ for pure argStr speaker

Equivalent to combinedWeighted(1, β, U_epi, U_goal). The parameter β controls the cooperativity spectrum:

β = 0: fully cooperative (standard RSA)
0 < β < ∞: partially argumentative
β → ∞: purely argumentative

Equations

RSA.CombinedUtility.goalOrientedUtility uEpi uGoal β = uEpi + β * uGoal

Instances For

source

theorem RSA.CombinedUtility.goalOriented_eq_combinedWeighted (uEpi uGoal β : ℚ) :

goalOrientedUtility uEpi uGoal β = combinedWeighted 1 β uEpi uGoal

Goal-oriented utility = combinedWeighted(1, β,...)

source

theorem RSA.CombinedUtility.goalOriented_cooperative (uEpi uGoal : ℚ) :

goalOrientedUtility uEpi uGoal 0 = uEpi

At β=0, goal-oriented utility reduces to pure epistemic (cooperative RSA)

source

theorem RSA.CombinedUtility.goalOriented_mono_beta (uEpi uGoal β₁ β₂ : ℚ) (hβ : β₁ < β₂) (hGoal : 0 < uGoal) :

goalOrientedUtility uEpi uGoal β₁ < goalOrientedUtility uEpi uGoal β₂

Higher β increases utility of goal-supporting utterances (U_goal > 0)

source

theorem RSA.CombinedUtility.goalOriented_antimono_beta_neg (uEpi uGoal β₁ β₂ : ℚ) (hβ : β₁ < β₂) (hGoal : uGoal < 0) :

goalOrientedUtility uEpi uGoal β₂ < goalOrientedUtility uEpi uGoal β₁

Negative U_goal DECREASES utility as β increases — the speaker is penalized for utterances that argue AGAINST the goal.

source

def RSA.CombinedUtility.betaToLam (β : ℚ) :

ℚ

Convert additive bias parameter β ∈ [0,∞) to convex weight λ ∈ [0,1).

β/(1+β) maps [0,∞) → [0,1): β=0 ↦ 0, β=1 ↦ 1/2, β→∞ ↦ 1.

This bridges goalOrientedUtility (additive: U + β·V) and combined (convex: (1-λ)·U + λ·V).

Equations

RSA.CombinedUtility.betaToLam β = β / (1 + β)

Instances For

source

def RSA.CombinedUtility.lamToBeta (lam : ℚ) :

ℚ

Convert convex weight λ ∈ [0,1) back to additive bias parameter β.

λ/(1-λ) maps [0,1) → [0,∞): λ=0 ↦ 0, λ=1/2 ↦ 1.

Equations

RSA.CombinedUtility.lamToBeta lam = lam / (1 - lam)

Instances For

source

theorem RSA.CombinedUtility.betaToLam_lamToBeta_inv (lam : ℚ) (hlam0 : 0 ≤ lam) (hlam1 : lam < 1) :

betaToLam (lamToBeta lam) = lam

Round-trip: betaToLam (lamToBeta λ) = λ for λ ∈ [0,1).

source

theorem RSA.CombinedUtility.lamToBeta_betaToLam_inv (β : ℚ) (hβ : 0 ≤ β) :

lamToBeta (betaToLam β) = β

Round-trip: lamToBeta (betaToLam β) = β for β ≥ 0.

source

theorem RSA.CombinedUtility.goalOriented_eq_scaled_combined (uEpi uGoal β : ℚ) (hβ : 0 ≤ β) :

goalOrientedUtility uEpi uGoal β = (1 + β) * combined (betaToLam β) uEpi uGoal

The key bridge: goalOrientedUtility = (1+β) · combined(β/(1+β),...).

U_epi + β·U_goal = (1+β) · ((1 - β/(1+β))·U_epi + β/(1+β)·U_goal)

Scaling by (1+β) > 0 preserves utterance rankings, so the additive and convex forms are strategically equivalent.

source

theorem RSA.CombinedUtility.goalOriented_same_ranking (uEpi uGoal uEpi' uGoal' β : ℚ) (hβ : 0 ≤ β) (hord : goalOrientedUtility uEpi uGoal β > goalOrientedUtility uEpi' uGoal' β) :

combined (betaToLam β) uEpi uGoal > combined (betaToLam β) uEpi' uGoal'

Utterance ranking equivalence: for β ≥ 0, goalOrientedUtility and combined rank any two utility pairs the same way (scaling by (1+β) > 0 preserves ordering).

If U_epi + β·U_goal > U_epi' + β·U_goal', then combined(β/(1+β), U_epi, U_goal) > combined(β/(1+β), U_epi', U_goal').