Erk & Herbelot 2024 — How to Marry a Star #

@cite{erk-herbelot-2024}

Erk, K. & Herbelot, A. (2024). How to Marry a Star: Probabilistic Constraints for Meaning in Context. Journal of Semantics 40(4), 549–583.

Status: Phase 3 (paper-faithful instantiation) #

This file instantiates the SDS substrate from Theories/Semantics/Probabilistic/SDS/{GraphicalModel,JointPosterior}.lean on the paper's running examples — currently the bat-in-player sentence (paper §5.1, Figure 5, Table 1). Phase 4 will add the astronomer-married- star sentence (Table 2).

The previous version of this file used the legacy SDSConstraintSystem flat-substrate (a Product-of-Experts caricature that collapsed the paper's directed graphical model to two functions over the concept space). Replaced because the caricature could not reproduce Tables 1–2 nor the qualitative α-monotonicity result paper §5.2 advertises. The new version uses the paper-faithful multi-node graphical model.

Numerical reproduction #

Closed-form derivation in our framework (see derivations in theorem docstrings) gives:

α	P(BAT-STICK \| obs)	P(BAT-ANIMAL \| obs)
0.5	3/4 = 0.75	1/4 = 0.25
0.1	11/12 ≈ 0.917	1/12 ≈ 0.083

Paper Table 1 (p. 571, WebPPL Monte Carlo, 2000 samples):

α	p(stick)	p(animal)
0.5	0.82	0.18
0.1	0.96	0.04

After detailed re-read of paper §4.1, §4.2, and §5.1 graphical-model descriptions (PDF pp. 13-25), the ~7pp discrepancy at α=0.5 is NOT explained by:

HOLD-AGENT selectional choice: PLAYER is observed at the player node, so selectional(hold-agent, PLAYER) is a constant factor in all configurations and cancels in normalization. Any non-zero spec — uniform or otherwise — gives the same posterior at the bat node, given the observation.
Bernoulli role-existence nodes (paper §4.1, p. 563): these are typically = 1 for mandatory roles (paper: "the sleeper always needs to be realized in a sleeping event, that is, P(SLEEP-THEME | SLEEP) = 1"). HOLD-AGENT and HOLD-THEME for "hold" are both presumably mandatory; even if not, observing Agent/Theme conditions pins the Bernoulli to "yes" with constant likelihood factor across configs.
Other-verb sem.role nodes (paper §4.1: "PAINT-AGENT and PAINT-THEME, both with zero probability of occurring as roles of SLEEP"): contribute multiplicative factor 1 to all configurations, wash out.
Verb concept-node lacking a role contribution (paper p. 569: verb concept (5) "is conditionally dependent on node (3)" only, not on any role node): my model places a uniform-PMF placeholder verb_self at the verb node. Since c_verb = HOLD is observed, the uniform-PMF contribution is a constant factor across configs.
Soft vs hard role-Bernoullis with non-unit P(role | verb): doesn't change posterior given observation pins the Bernoulli.

The most plausible remaining explanations:

Monte Carlo noise in paper's 2000-sample WebPPL simulation. SE for p=0.82 with N=2000 is ≈ 0.009, so the 95% CI on paper's true probability is roughly [0.80, 0.84]. Our 0.75 is 8 SDs below 0.82 — within MC noise only if paper's underlying probability is much closer to 0.78 than 0.82.
A graphical-model structural element we haven't identified, such as additional dependencies between concept nodes and the scenario-mix node that aren't visible in the figures we read.
Paper's WebPPL implementation specifics (rejection-sampling bias, etc.) that effectively change the implied posterior.

The qualitative direction — lower α → more BAT-STICK (the BASEBALL-favored sense) — matches the paper. Both rows of Table 1 show the same direction in our closed-form derivation.

The numbers above are what the closed-form joint posterior of our graphical model evaluates to; we do not back-solve parameters to match the paper's WebPPL output (per the user-locked decision in the 0.230.298 redo: "compute the true closed-form joint posterior — don't back-solve, don't intervals").

Provenance for paper-cited values #

9-concept inventory: paper p. 569 ("BALL, BAT-ANIMAL, HOLD, BAT-STICK, CANDLE, CAT, PLAYER, STONE, VAMPIRE")
BASEBALL scenario distribution: paper p. 569 ("equal probability to the concepts BALL, BAT-STICK, HOLD, PLAYER, and STONE, and zero probability to all other concepts")
GOTHIC scenario distribution: paper p. 569 ("equal probability to the concepts BAT-ANIMAL, CANDLE, CAT, HOLD, and VAMPIRE, and zero probability otherwise")
HOLD-THEME selectional: paper p. 569 ("P(c | HOLD-THEME) = {0 for c=HOLD; 0.125 else}")
HOLD-AGENT selectional: NOT specified by paper; we assume uniform 1/8 over non-HOLD concepts (analogous to HOLD-THEME)
VERB-SELF selectional (placeholder for the no-role hold concept-node): uniform 1/9 over all concepts. Contributes a constant factor that washes out in normalization, so doesn't affect the posterior.

α	Our framework P(STAR-PERSON)	Paper Table 2
1/2	285/337 ≈ 0.846	0.82
1/10	19/31 ≈ 0.613	0.57

Erk & Herbelot 2024 — How to Marry a Star #

Status: Phase 3 (paper-faithful instantiation) #

Numerical reproduction #

Provenance for paper-cited values #

Closed-form derivation #

h_supp discharge: structural blocker lemmas #

α-monotonicity (paper §5.1, p. 571) #

Paper §5.2: "an astronomer married a star" #

Closed-form derivation for the astronomer-married-star posterior #

`h_supp` discharge: structural blocker lemmas #