Kullback-Leibler divergence of a conditional measure #

The conditioning identity: for a probability measure μ and an event s of nonzero mass, klDiv (μ[|s]) μ = −log μ(s) — Bayesian update by pure restriction costs exactly the information content of the event.

[UPSTREAM] candidate for Mathlib/InformationTheory/KullbackLeibler/ (placement and namespace mirror ChainRule.lean, the directory's other file combining klDiv with probability constructions).

source

theorem InformationTheory.klDiv_cond_self {Ω : Type u_1} [MeasurableSpace Ω] (μ : MeasureTheory.Measure Ω) [MeasureTheory.IsProbabilityMeasure μ] {s : Set Ω} (hs : MeasurableSet s) (hs0 : μ s ≠ 0) :

klDiv μ[|s] μ = ENNReal.ofReal (-Real.log (μ s).toReal)

The Kullback-Leibler divergence of the conditional measure μ[|s] from μ is the information content of the event: −log μ(s). The core of [Lev08]'s equivalence of relative-entropy difficulty and surprisal.

Documentation

Linglib.Core.InformationTheory.KullbackLeibler.Cond

Kullback-Leibler divergence of a conditional measure #