[DHH+26]: the Value-of-Information clarify-or-commit model #

A PMF-level formalisation of the decision-theoretic Value of Information (VoI) framework for adaptive human–agent communication. An agent holds a belief b : PMF Θ over latent user intents and may either commit to an action now or clarify by asking a question before committing. The paper operationalises the choice through the classical Value of Information: ask a question only when its expected improvement in the downstream decision outweighs the communication cost.

A question is modelled as an answer kernel κ : Θ → PMF Y — the paper's p(y ∣ q, θ), the distribution of answer y were θ the true intent. Its answer marginal p(y ∣ q, b) and the updated belief b_y are the project's PMF.marginal κ b and PMF.posterior κ b y; weightedPosteriorValue_eq identifies the (total) per-answer term used here with p(y) · V(b_y).

Main definitions #

EU U b a — expected utility of committing to action a under belief b.
V U b — value of acting now, ⨆ a, EU U b a.
Vpost U b κ — expected value after asking κ, ∑' y, p(y) · V(b_y).
VoI U b κ — value of information, Vpost U b κ - V U b.
NetVoI c U b κ — VoI net of the per-question cost c.
worthAsking c U b κ — the clarify-or-commit decision, c < VoI U b κ.

Main statements #

V_le_Vpost — information never has negative value: V U b ≤ Vpost U b κ.
V_add_VoI — VoI is the honest increment: V U b + VoI U b κ = Vpost U b κ.
VoI_smul — VoI is positive-homogeneous in the utility (stakes scale).
worthAsking_mono_stakes — holding belief, question, and cost fixed, raising the stakes (scaling the utility) keeps a question worth asking, so the commit-without-asking region shrinks as stakes rise. This is the ceteris-paribus mechanism behind the paper's Mixed-Stakes prediction: scaling utility scales VoI, so a question clearing a fixed cost at low stakes (U = 1, guessing an animal) clears it at high stakes (U = 10, diagnosing a disease). medical_worth_asking_of_animal is the named instance. The model isolates the utility-scaling mechanism; it does not encode the experiments' differing candidate-set sizes or answer models.

Implementation notes #

Utilities are ℝ≥0∞-valued so the model lives natively on PMF. VoI uses truncated subtraction, but V_le_Vpost makes the gap genuine rather than clipped to 0. Homogeneity needs only s ≠ ∞ (via ENNReal.mul_sub); no finiteness of V/Vpost is assumed, so the core results hold for arbitrary intent, action, and answer types.

The worth-asking region is the strict c < VoI U b κ (equivalently 0 < NetVoI), matching the paper's "commit when max_q NetVoI ≤ 0" rule. The cross-question argmax selection of the policy is out of scope: the results here concern the per-question clarify-or-commit decision.

This PMF/ℝ≥0∞ formulation parallels the ℝ-valued expected-information-gain substrate Core.Agent.ExperimentDesign.eig (with value function V U): V_le_Vpost is the PMF analogue of ExperimentDesign.eig_nonneg_of_convex and of TsvilodubEtAl2026.evpi_nonneg.

ClarifyRule is the shared clarify-or-commit decision-rule contract: both this paper and [TMS+26] decide clarification from a net value-of-information signal, this paper through the sharp threshold sharpRule (worthAsking_iff_sharpRule), Tsvilodub et al. through a logistic gate (TsvilodubEtAl2026.softGateRule).

Todo #

Discharge the claim that EVPI (TsvilodubEtAl2026.evpi) is the upper bound on VoI for any question into a theorem worthAsking c U b κ → c < EVPI.
Relate VoI / V_le_Vpost to Core.Agent.ExperimentDesign.eig / eig_nonneg_of_convex (bridging the ℝ≥0∞-on-PMF and ℝ-on-Fintype carriers) so the two statements of "information has nonnegative value" become one fact.

[DHH+26]: the Value-of-Information clarify-or-commit model #

Main definitions #

Main statements #

Implementation notes #

Todo #

Expected utility and the value of acting now #

The value of a question #

Information never has negative value #

Stakes: positive-homogeneity of the value of information #

Uninformative questions carry no value #

A worked Mixed-Stakes 20 Questions instance #

The decision rule: a sharp threshold #