Lossy-Context Surprisal: Bridge to Classical Surprisal #
@cite{futrell-gibson-levy-2020}
MemoryProcess (in Basic.lean) is the abstract type underlying the
lossy-context surprisal model of @cite{futrell-gibson-levy-2020}
("Lossy-Context Surprisal: An Information-Theoretic Model of Memory
Effects in Sentence Processing", Cog Sci 44, e12814).
This file proves the paper's §3.5.1 reduction: when the memory encoder
loses no information (a Dirac at some history-summarising function
f), expected surprisal collapses to classical surprisal under the
language model induced by predict ∘ f.
In the paper's framing (§3.5.1), surprisal theory becomes a definitional
special case of lossy-context surprisal: the encoder M giving a lossless
representation of context (a Dirac at the true context, or any
deterministic summary) recovers Shannon's −log p(w | c).
Main definitions #
MemoryProcess.IsLosslessFor— the process exactly realises an LMLangModel.virtualLM— the LM induced by composingpredictwith a history-summarising function
Main theorem #
expectedSurprisal_eq_surprisal_of_lossless— losslessmpoflmyieldsmp.expectedSurprisal = lm.surprisal
A memory process is lossless for a language model lm if some
deterministic history-summary f makes the encoder a Dirac at f c
and the predictor's distribution at f c equal to lm.next c.
(@cite{futrell-gibson-levy-2020} §3.5.1: this is the "perfect memory" regime in which lossy-context surprisal collapses to classical surprisal.)
Equations
- mp.IsLosslessFor lm = ∃ (f : List Voc → Mem), mp.IsDirac f ∧ ∀ (c : List Voc), mp.predict (f c) = lm.next c
Instances For
Lossless reduction (§3.5.1). A memory process that is lossless
for lm produces exactly the classical surprisal of lm. Lossy-context
surprisal generalises surprisal — it does not replace it.
Reading: when no information is lost in encoding, the integral over
memory states in expectedSurprisal (Eq. 3) degenerates to a single
deterministic prediction, recovering Shannon's -log p(w | c).
The "virtual" language model induced by a deterministic
history-summary f and a memory predictor predict.
Instances For
A memory process is lossless for its own virtual LM whenever the
encoder is a Dirac at f. This is the "construction-side" complement
of expectedSurprisal_eq_surprisal_of_lossless: any Dirac encoder
does realise some LM, namely virtualLM.
Lossless ⇒ classical surprisal of the virtual LM. Any Dirac
memory process realises classical surprisal under its induced
language model. This is the reduction in its purely structural form,
without an external lm parameter.