@cite{beaver-2004}: The Optimization of Discourse Anaphora #

Beaver reformulates @cite{grosz-joshi-weinstein-1995} Centering (and the @cite{brennan-friedman-pollard-1987} algorithmic version) as an Optimality Theory system over six ranked constraints (his §3.2 p. 14):

AGREE > DISJOINT > PRO-TOP > FAM-DEF > COHERE > ALIGN

His main theorem (his §3.4 (20), proven Appendix A): given a sentence in which the only definite expressions are proper nouns and pronouns, if either COT or BFP uniquely predicts an interpretation involving fully anaphoric resolution, then both do, and they resolve anaphors identically.

@cite{poesio-stevenson-eugenio-hitzeman-2004} §3.1 fn 12 endorse this OT reformulation as the right framework for "unpacking" Centering's constraints into ranked-OT terms.

Design: deep reuse of Centering primitives #

Per the audit's mathlib-style recommendation, three of Beaver's six constraints are LITERAL RESTATEMENTS of existing Centering primitives (Beaver acknowledges this in his §3.2-3.3 prose):

Beaver constraint	Beaver's source	Our definition
PRO-TOP	"essentially the effect of Centering's Rule 1" (his p. 15-16)	via `Rule1Gordon` from `Centering/Rule1.lean` (NOT `Rule1GJW95`)
COHERE	"the conditions used in BFP to specify transition types" (his p. 17)	via `cb` (topic stability test)
ALIGN	same	via `cb` + `cp` (topic in preferred-center position)
AGREE	binding theory (his p. 14)	NEW (substrate-gap-flagged: `Realization` lacks number/gender features)
DISJOINT	Principle B (his p. 15)	NEW (substrate-gap-flagged: `Utterance` lacks predicate-argument structure)
FAM-DEF	familiarity (Heim 1982; his p. 16)	NEW (substrate-gap-flagged: `Realization` lacks definiteness)

The 3 reused constraints make Theorem (20)'s equivalence with BFP structural on those clauses (it follows from the constraints being literal restatements). The 3 novel constraints fire as user-supplied boolean flags on the candidate type — substrate gaps that future commits can fill (number/gender features; predicate-argument structure; definiteness marking).

This is the PMF-vs-Measure mathlib pattern applied: Centering and OT are sibling substrates with their own vocabularies; the bridge lives in this paper-anchored study file rather than as a substrate-level identity. Where Beaver explicitly equates his constraints with Centering primitives, we define them via those primitives (deep reuse). Where Beaver introduces independent content (AGREE/DISJOINT/ FAM-DEF), we keep them independent.

Scope #

This file mechanizes:

The 6 COT constraints with the design above.
Beaver's tableau examples (5b), (12c) — predicting BFP-equivalent resolutions.
Theorem (20) witnesses on those examples (the COT-optimal candidate matches the BFP-survivor).
Beaver §4.1's PRO-TOP demotion: a re-ranked constraint hierarchy (PRO-TOP below FAM-DEF) gives Beaver (2c) "Mary likes tennis. She plays Jim. He used to play doubles with Mary." the bound reading where BFP overpredicts. The PSDH §3.1 fn 12 critique mechanized.

Out of scope (Beaver's §5 extensions):

Production / bidirectional grammar (his §5.1-5.3)
Multi-sentence text optimization (his §5.4 with examples (24)/(25))
Accented pronoun interpretation (his §6)
Beaver's full Appendix A proof of Theorem (20) — we mechanize per-example witnesses, not the general schematic proof.