@cite{poesio-stevenson-eugenio-hitzeman-2004}: Centering as a Parametric Theory #

Poesio, Stevenson, Di Eugenio & Hitzeman (2004), "Centering: A Parametric Theory and Its Instantiations." Computational Linguistics 30(3): 309–363. URL https://aclanthology.org/J04-3003/.

PSDH's headline contribution: Centering is not a single theory; it is a parameter family. The variants of CB definition, utterance unit, "previous utterance," realization, CF filter, ranking, "pronoun" filter, and segmentation that have been proposed in the literature since @cite{grosz-joshi-weinstein-1995} are parameters of the framework. Different parameter settings make different empirical claims true.

The 8 parameter axes (PSDH §3.4 p. 326) #

Parameter	Substrate location	Variants formalized in linglib
`CBdef` (which definition of CB)	`Centering/Basic.lean` `cb`	Constraint 3 (the canonical Cb-via-highest-Cf-realized definition)
`uttdef` (sentence vs finite clause vs verbed clause)	`Centering/Defs.lean` `Utterance`	sentence-level only (no clause-decomposition substrate)
`previous utterance` (Kameyama vs Suri-McCoy adjunct treatment)	not formalized	–
`realization` (direct vs indirect/bridging)	`Realizes` typeclass	direct only (`utteranceRealizes`); bridging not formalized
`CF-filter` (1st/2nd person, predicative NPs)	not formalized	–
`rank` (CfRanker choice)	`CfRankerOf E R` typeclass	`GrammaticalRole` (Kameyama 1986), `StrubeHahnInfoStatus` (Strube-Hahn 1999, projecting from `Features.GivennessStatus` per the post-Krifka substrate); LinearOrder (Rambow 1993) deferred — substrate gap (Realization lacks position field)
`prodef` (which "pronouns" count for Rule 1)	`Pronominalizes` typeclass	utterance's `isPronoun` flag
`segmentation`	not formalized	–

The 4 axes the substrate plugs into via typeclasses (CBdef, Realizes, Pronominalizes, CfRankerOf) ARE the parametric story in Lean form — different instances of these typeclasses produce different cb/cbAll/Rule-1-satisfaction predictions on the same data. The 4 axes left unformalized are corpus-operational choices (sentence-segmentation rules, NP-filter rules) — we mention them in prose, not as Lean parameters, per the audit's "formalize the type-changing axes, not the bookkeeping ones" recommendation.

What this file mechanizes #

PSDH §4.1.1 example (10) — the corner-cupboard / Branicki utterance pair where partial GF ranking yields two CBs (since two NPs tie at the lowest grammatical-function rank among realized entities). This is the load-bearing example for cbAll — cb returns just the first, cbAll returns the complete tied-at-top set, exposing the weak-Constraint-1 violation.
Sidner1983 partial-GF witness (not a structural bridge — per audit) — PSDH §5.3.4 (p. 358) say two-CB-under-partial-GF configurations are "reminiscent of" the examples that led @cite{sidner-1979} to argue for two foci. Our witness theorem (psdh_two_cb_witnesses_sidner_two_foci) establishes that the PSDH (10) configuration AND a constructed Sidner-side encoding both exhibit "two-ness" — two CBs vs two distinct foci. The structural translation function centeringToSidner that would derive the Sidner state from the Centering data is deferred (see §5 future work).
PSDH §5.2.2 entity-coherence dissociability witness — PSDH §5.2.2 (p. 353) argue entity coherence (Centering's domain) and relational coherence (Hobbs/Kehler/RST) are dissociable: entity coherence can be ABSENT while a discourse remains locally coherent through relational connections. PSDH (23) (the Product A pharmaceutical leaflet, p. 354) is the canonical example: every adjacent utterance pair has cb = none (NULL transition under any vanilla instantiation), but the discourse is coherent via instructional connectives ("If you have any questions ... ask your doctor"). Our witness theorem establishes the "every transition NULL" property on this discourse; the relational-coherence side is in prose since Coherence.lean's bridge doesn't model instructional/temporal connectives.

What this file deliberately does NOT formalize #

PSDH's GNOME corpus statistics (Tables 1-15). We don't have access to the corpus; encoding their reported percentages as opaque Nat × Nat data adds code without deriving content. Cited in prose; deferred to a possible future commit.
The full BFP 87 4-way Transition as substrate primitive. The 4-way private structure BFPTransition lives below as a study-file-local definition per the audit's "extract on second consumer" discipline. Will promote to Centering/Transition.lean when a second consumer (Walker 1989, Brennan 1995, etc.) lands.
The CenteringConfig bundled record. Per the mathlib audit, PSDH's "8 parameters" map onto typeclass instances + variant predicates, not a CenteringConfig structure. The fact that "GJW95 = [CfRanker GrammaticalRole] + Rule1GJW95 + pairRank" is documented in prose, not as a structure-typed value. (Mathlib precedent: Mathlib.Analysis.Calculus has FDeriv/Deriv/ HasDerivAt/HasFDerivAt/lineDeriv as separate definitions with cross-implications, not a DerivativeConfig bundling them.)
The OT-bridge to Core.Constraint.OT.Tableau.optimal per Beaver 2004. PSDH §3.1 fn 12 endorse Beaver's OT reformulation of Centering, but the bridge theorem belongs in Phenomena/Reference/Studies/Beaver2004.lean (queued as a separate commit), per mathlib's PMF vs Measure precedent.

Throughout, examples use String entities for readability and Utterance String GrammaticalRole from the substrate. The IS ranker (Strube-Hahn) is illustrated separately.

@cite{poesio-stevenson-eugenio-hitzeman-2004}: Centering as a Parametric Theory #

The 8 parameter axes (PSDH §3.4 p. 326) #

What this file mechanizes #

What this file deliberately does NOT formalize #

§5.1 Two totalizers for PSDH (10): Strube-Hahn vs Beaver #

§5.1.1 The structural underpinning #

Items deferred from this commit #