Hale (2001): a probabilistic Earley parser as a psycholinguistic model #

[Hal01] (NAACL 2001) proposes that cognitive load is the total probability of the structural analyses disconfirmed by the input so far (§4): for a consistent grammar, the effort for a prefix is one minus its prefix probability, and word-by-word reading time is proportional to log (αₙ₋₁ / αₙ) — the word's surprisal — where α is [Sto95]'s prefix probability, computed by a probabilistic Earley parser that is strong-competence, frequency-sensitive, and eager (Principles 1–3). The parser is a total-parallelism theory (§3): garden-pathing needs no reanalysis — it happens exactly at words where the disconfirmed analyses comprise most of the probability mass (§6.1: at "fell" in the horse raced past the barn fell, grammar (1) puts the pre-word prefix probability more than ten times the post-word one), and the subject/object relative asymmetry falls out the same way (§7).

Main definitions #

surprisal — the word-level linking hypothesis, in the paper's log-of-ratio form.
disconfirmed — the prefix-level load: total mass of disconfirmed analyses.

Main results #

surprisal_eq_neg_log_nextProb — the ratio form is the conditional-word- probability form (the identity [Lev08] later grounds in relative entropy).
surprisal_eq_log_one_add, log_le_surprisal — surprisal grows with the ratio of disconfirmed to surviving mass: the paper's account of garden-pathing, stated for any generative process.

Implementation notes #

The demonstrations' numerics (grammars (1)–(3), Figs. 3–8) require Stolcke's Earley chart over recursive PCFGs; they are cited in prose above and their formalization awaits an Earley substrate. The theorems here are the grammar-independent content of the linking hypothesis.

source

noncomputable def Hale2001.surprisal {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) (ws : List W) (w : W) :

ℝ

Word-level cognitive load (§4): the log of the ratio of the prefix probability before the word to the prefix probability after it.

Equations

Hale2001.surprisal P str ws w = Real.log ((Processing.Expectation.prefixMass P str ws).toReal / (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal)

Instances For

source

noncomputable def Hale2001.disconfirmed {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) (ws : List W) :

ENNReal

Prefix-level cognitive load (§4): the total probability of the analyses the prefix has disconfirmed.

Equations

Hale2001.disconfirmed P str ws = 1 - Processing.Expectation.prefixMass P str ws

Instances For

source

@[simp]

theorem Hale2001.disconfirmed_nil {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) :

disconfirmed P str [] = 0

No analysis has been disconfirmed before any input is seen.

source

theorem Hale2001.surprisal_eq_neg_log_nextProb {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) (ws : List W) (w : W) :

surprisal P str ws w = -Real.log (Processing.Expectation.nextProb P str ws w).toReal

The paper's ratio form of surprisal is the negative log conditional probability of the word — the form [Lev08] derives as the relative entropy of the belief update.

source

theorem Hale2001.surprisal_eq_log_one_add {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) (ws : List W) (w : W) (ha : 0 < (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal) :

surprisal P str ws w = Real.log (1 + ((Processing.Expectation.prefixMass P str ws).toReal - (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal) / (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal)

Surprisal in disconfirmation form: the log of one plus the ratio of the mass disconfirmed at the word to the mass surviving it.

source

theorem Hale2001.log_le_surprisal {T : Type u_1} {W : Type u_2} (P : PMF T) (str : T → List W) (ws : List W) (w : W) {k : ℝ} (hk : 0 ≤ k) (ha : 0 < (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal) (hdis : k * (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal ≤ (Processing.Expectation.prefixMass P str ws).toReal - (Processing.Expectation.prefixMass P str (ws ++ [w])).toReal) :

Real.log (1 + k) ≤ surprisal P str ws w

Garden-pathing (§6.1): if the mass disconfirmed at a word is at least k times the surviving mass, the word's surprisal is at least log (1 + k) — difficulty spikes exactly where the disconfirmable analyses comprise a great amount of probability.

Documentation

Linglib.Studies.Hale2001

Hale (2001): a probabilistic Earley parser as a psycholinguistic model #

Main definitions #

Main results #

Implementation notes #