Bidirectional Optimality Theory #
@cite{blutner-2000} @cite{horn-1984} @cite{atlas-levinson-1981}
Bidirectional OT integrates production and comprehension optimality into a single framework. The key insight is that the Gricean maxims decompose into two opposing optimization pressures:
Q-principle (production optimality): for a given meaning τ, the optimal form A is the one that produces ⟨A, τ⟩ most harmonically. Blocking mechanism: suppresses interpretations that can be expressed more economically by an alternative form. Corresponds to the first part of Grice's Quantity maxim ("be as informative as required") and the force of diversification / auditor's economy (@cite{atlas-levinson-1981}, @cite{horn-1984}).
I-principle (comprehension optimality): for a given form A, the optimal meaning τ is the one that produces ⟨A, τ⟩ most harmonically. Interpretation mechanism: selects the most coherent/stereotypical reading. Corresponds to the second part of Grice's Quantity maxim ("do not make your contribution more informative than required"), Relation, Manner, and the force of unification / speaker's economy (@cite{atlas-levinson-1981}, @cite{horn-1984}'s R-principle).
These correspond to the speaker/hearer optimization layers in RSA (@cite{frank-goodman-2012}) and IBR (@cite{franke-2011}).
Strong vs Weak BiOT #
@cite{blutner-2000} defines two versions:
Strong (eq. 9): Q and I are independent — ⟨A, τ⟩ is optimal iff it survives BOTH Q and I evaluated against the full candidate set. Solution concept: Nash Equilibrium. Substrate:
strongOptimal(Finset, computable) +IsStrongOptimal(Set-valued Prop).Weak (eq. 14): Q and I mutually constrain each other — competition under Q is restricted to I-surviving pairs, and vice versa. Solution concept: greatest fixed point of the blocking operator. Substrate:
superoptimal(Finset, computable) +superoptimalSet(Set-valued, anchored in mathlib'sOrderHom.gfp).
The weak version derives @cite{horn-1984}'s division of pragmatic labour: unmarked forms pair with unmarked (stereotypical) meanings, and marked forms pair with marked (marked) meanings. The strong version incorrectly predicts that marked forms are blocked in ALL their interpretations.
The structural meta-theorem strong ⊂ weak (@cite{blutner-2000} p. 12)
lives in Core/Constraint/Evaluation/Superoptimal.lean as
isStrongOptimal_imp_mem_superoptimalSet (Set-valued, coinductive
proof against OrderHom.gfp) and its Finset corollary
strongOptimal_subset_superoptimal.
Connection to RSA and IBR #
All three frameworks — BiOT, IBR, RSA — perform alternating speaker/hearer optimization. They differ in the optimization criterion:
| Framework | Comparison | Solution |
|---|---|---|
| Strong BiOT | lexicographic (OT) | Nash equilibrium |
| Weak BiOT | lexicographic (OT) | greatest fixed point |
| IBR | argmax (set-level) | iterated best response |
| RSA | softmax (probabilistic) | Bayesian posterior |
The relationship: RSA α→∞ approximates IBR, and IBR fixed points
correspond to weak-BiOT super-optimal pairs for scalar games
(see IBR/ScalarGames.lean).
These are List-Bool versions of the per-pair Q and I checks, used by
consumer studies (e.g. Blutner2000.lean's accommodation_blocked_is_Q)
that want to check Q or I in isolation rather than the conjunction. The
relationship to the substrate's Set-valued Blocks predicate:
`¬ Blocks profile S p ↔ (no Q-blocker in S for p) ∧ (no I-blocker in S for p)`
where Q-blocker = same meaning, different form; I-blocker = same form,
different meaning. So `satisfiesQ` and `satisfiesI` together pin down the
`¬ Blocks` condition that defines `IsStrongOptimal` and (via the gfp)
`superoptimalSet`.
Q-principle (production optimality, @cite{blutner-2000} eq. 9(Q)): ⟨A, τ⟩ satisfies Q iff no other pair with the same meaning τ but a different form A' is strictly more harmonic.
Operationally: the speaker chose A because it is the best way to express τ. If a better form A' existed, A would be blocked. This is the blocking mechanism — it suppresses interpretations that can be triggered more economically by an alternative form (@cite{horn-1984}'s Q-based implicature).
Equations
- Pragmatics.Bidirectional.satisfiesQ gen profile p = gen.all fun (q : F × M) => decide (q.2 = p.2 → q.1 ≠ p.1 → ¬Core.Constraint.Evaluation.LexLT (profile q) (profile p))
Instances For
I-principle (comprehension optimality, @cite{blutner-2000} eq. 9(I)): ⟨A, τ⟩ satisfies I iff no other pair with the same form A but a different meaning τ' is strictly more harmonic.
Operationally: the hearer chose τ because it is the best interpretation of A. If a better meaning τ' existed, τ would be dispreferred. This is the interpretation mechanism — it selects the most coherent/stereotypical reading (@cite{horn-1984}'s R-based inference).
Equations
- Pragmatics.Bidirectional.satisfiesI gen profile p = gen.all fun (q : F × M) => decide (q.1 = p.1 → q.2 ≠ p.2 → ¬Core.Constraint.Evaluation.LexLT (profile q) (profile p))
Instances For
@cite{horn-1984}'s division of pragmatic labour: "unmarked forms tend to be used for unmarked situations and marked forms for marked situations." This emerges naturally from weak BiOT but NOT from strong BiOT. The example below formalizes @cite{blutner-2000}'s tableau (15).
Classic examples:
- kill (unmarked) / cause to die (marked): kill ↔ stereotypical
causation (direct), cause to die ↔ non-stereotypical (indirect)
- pork (unmarked) / pig-meat (marked, blocked): pork ↔ meat reading,
pig ↔ animal reading — *pig-meat is totally blocked because pork
exists
- him (unmarked) / himself (marked): himself ↔ coreferential,
him ↔ disjoint reference
Forms in Horn's division example.
Instances For
Equations
- Pragmatics.Bidirectional.instDecidableEqHornForm x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Meanings in Horn's division example.
- stereotypical : HornMeaning
- nonStereotypical : HornMeaning
Instances For
Equations
- Pragmatics.Bidirectional.instDecidableEqHornMeaning x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
All form-meaning pairs (forms are semantically equivalent).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Constraint profile: [F-violations, C-violations]. F penalizes marked forms; C penalizes non-stereotypical meanings.
Equations
- Pragmatics.Bidirectional.hornProfile (Pragmatics.Bidirectional.HornForm.unmarked, Pragmatics.Bidirectional.HornMeaning.stereotypical) = [0, 0]
- Pragmatics.Bidirectional.hornProfile (Pragmatics.Bidirectional.HornForm.unmarked, Pragmatics.Bidirectional.HornMeaning.nonStereotypical) = [0, 1]
- Pragmatics.Bidirectional.hornProfile (Pragmatics.Bidirectional.HornForm.marked, Pragmatics.Bidirectional.HornMeaning.stereotypical) = [1, 0]
- Pragmatics.Bidirectional.hornProfile (Pragmatics.Bidirectional.HornForm.marked, Pragmatics.Bidirectional.HornMeaning.nonStereotypical) = [1, 1]
Instances For
Weak BiOT derives Horn's division: unmarked ↔ stereotypical, marked ↔ non-stereotypical. Both pairs survive.
Strong BiOT blocks the marked form entirely — only the unmarked pair survives. This is empirically wrong: marked forms DO get used (for marked meanings).
The weak version admits strictly MORE pairs than the strong version for this case. This is the key empirical argument for weak BiOT.
When an unmarked form has a SPECIALIZED meaning (not semantically equivalent to the marked form), the marked form can be totally blocked — it loses ALL its interpretations.
Example: pork (= pig-meat) blocks *pig in the meat reading.
Unlike Horn's division, here the unmarked form doesn't compete
for the non-stereotypical meaning because Gen doesn't pair them.
Forms for total blocking: a listed (lexicalized) form and a derived (productive) form.
Instances For
Equations
- Pragmatics.Bidirectional.instDecidableEqLexForm x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- Pragmatics.Bidirectional.instDecidableEqLexMeaning x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Gen for total blocking: the listed form only covers the specialized meaning; the derived form covers both meanings.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Profile: listed form is less marked (0 F-violations); specialized meaning is less marked (0 C-violations).
Equations
- Pragmatics.Bidirectional.totalBlockProfile (Pragmatics.Bidirectional.LexForm.listed, Pragmatics.Bidirectional.LexMeaning.specialized) = [0, 0]
- Pragmatics.Bidirectional.totalBlockProfile (Pragmatics.Bidirectional.LexForm.derived, Pragmatics.Bidirectional.LexMeaning.specialized) = [1, 0]
- Pragmatics.Bidirectional.totalBlockProfile (Pragmatics.Bidirectional.LexForm.derived, Pragmatics.Bidirectional.LexMeaning.general) = [1, 1]
- Pragmatics.Bidirectional.totalBlockProfile (Pragmatics.Bidirectional.LexForm.listed, Pragmatics.Bidirectional.LexMeaning.general) = [0, 1]
Instances For
Total blocking: the derived form loses the specialized meaning (blocked by the listed form) but retains the general meaning. Result: listed ↔ specialized, derived ↔ general.
Strong BiOT over-blocks: the derived form loses ALL interpretations because (.derived, .specialized) is I-better than (.derived, .general) under the same form. This makes strong BiOT too aggressive for partial blocking — only the listed form survives.
Weak BiOT correctly handles partial blocking — the derived form keeps the general meaning because its I-competitor (.derived, .specialized) was removed by Q-blocking from (.listed, .specialized).
The structural meta-theorem strongOptimal pairs profile ⊆ superoptimal pairs profile (@cite{blutner-2000} p. 12) is proved coinductively in the
substrate at Core/Constraint/Evaluation/Superoptimal.lean via
isStrongOptimal_imp_mem_superoptimalSet (Set-valued, against mathlib's
OrderHom.gfp) and strongOptimal_subset_superoptimal (Finset
corollary using the bridge theorem). The applications below are one-line
discharges of the convergence hypothesis via by decide.
Substrate's strong ⊂ weak applied to the Horn example.
Strict inclusion in the Horn example: weak admits the marked-pair that strong eliminates.
Substrate's strong ⊂ weak applied to total blocking.