Semiring View of Deterministic Decoders #

When the noise kernel is dirac (no noise), score aggregation in a constraint system is itself an algebraic operation: the additive operation in a tropical or max-plus semiring. This is the bridge from the RUM/decoder picture to the parsing-theoretic semiring picture (@cite{goodman-1999}; @cite{eisner-2002}; @cite{mohri-2002}).

Decoder	Underlying algebraic structure
`argminDecoder`	min-plus tropical semiring `Tropical R`
`argmaxDecoder`	max-plus semiring (= tropical over `OrderDual R`)
`softmaxDecoder α`	log-sum-exp ("warped") semiring on `ℝ`

In each row, the additive identity of the semiring is the "winner takes all" choice operation across alternatives, and the multiplicative identity is the per-candidate score-combination operation. The decoder's job is then to identify which candidates realise the additive identity.

What this file proves #

The bridge to mathlib's Tropical R (a min-plus semiring): the tropical sum of trop-tagged scores is trop of their inf. Specialised to a Finset of candidates, this says that choosing the additive identity in the tropical semiring picks out exactly the score that argminDecoder distributes its mass over — making the algebraic and operational views of OT coincide.

The max-plus dual follows by replacing R with OrderDual R: HG's argmax is then the same algebraic operation in the dual semiring.

The dequantized bridge #

§ 4 connects lseFinset (the warped semiring's additive operator at finite temperature, in Dequantization/LogSumExp/Basic.lean) to argmax via the α → ∞ limit (lseFinset_tendsto_sup' in Dequantization/LogSumExp/Limit.lean): a candidate is an argmax winner exactly when its score equals lim_{α → ∞} lseFinset α cands score. This is the semiring-level statement of MaxEnt → HG: as the inverse temperature sweeps to its limit, the soft "log-sum-exp aggregator" deforms to the hard "max aggregator", and "this candidate's score equals the aggregate" is exactly the argmax condition.

source

theorem Core.Constraint.trop_sum_eq_inf_score {Cand : Type u_1} {R : Type u_2} [LinearOrder R] [OrderTop R] (cands : Finset Cand) (score : Cand → R) :

Tropical.untrop (∑ c ∈ cands, Tropical.trop (score c)) = cands.inf score

The tropical sum of scores equals the inf of the underlying values.

Aggregating "or" alternatives in Tropical R (where + is min) computes the minimum. This is the algebraic statement underlying argminDecoder: the OT winner is the candidate realising the tropical sum.

Direct restatement of mathlib's Finset.untrop_sum' for clarity in the constraint-system context.

source

theorem Core.Constraint.argmin_winner_iff_trop_sum {Cand : Type u_1} {R : Type u_2} [LinearOrder R] [OrderTop R] {cands : Finset Cand} {score : Cand → R} {c : Cand} (hc : c ∈ cands) :

(∀ c' ∈ cands, score c ≤ score c') ↔ score c = Tropical.untrop (∑ c' ∈ cands, Tropical.trop (score c'))

A candidate c ∈ cands is an argminDecoder winner exactly when its score equals the tropical-semiring sum of all candidate scores.

This is the operational ↔ algebraic bridge for OT: the decoder's notion of "winner" coincides with the candidate realising the additive identity of the tropical semiring.

Max-plus semiring for HG #

The max-plus semiring on R is the tropical semiring on OrderDual R: addition is max (= min in the dual order), multiplication is the underlying +. For real-valued harmony scores in HG, the relevant structure is max-plus on WithBot ℝ (with -∞ as the additive identity).

The argmax decoder is the max-plus analogue of argminDecoder: argmaxDecoder picks the candidate(s) realising the max-plus sum of all candidate scores, i.e., the maximum harmony.

Stating this via the existing tropical machinery requires going through OrderDual plus WithTop instances; for our purposes the concrete argminDecoder ↔ trop_sum bridge above is the main statement, and the HG case is its dual.

source

theorem Core.Constraint.argmax_winner_iff_lse_max_limit {Cand : Type u_1} {cands : Finset Cand} (hne : cands.Nonempty) {score : Cand → ℝ} {c : Cand} (hc : c ∈ cands) :

(∀ c' ∈ cands, score c' ≤ score c) ↔ Filter.Tendsto (fun (α : ℝ) => lseFinset α cands score) Filter.atTop (nhds (score c))

A candidate c ∈ cands is an argmax winner exactly when its score equals the α → ∞ limit of lseFinset α cands score.

This is the dequantized analogue of argmin_winner_iff_trop_sum: the operational notion of "winner" (achieves the maximum) coincides with the algebraic notion (score realises the additive identity of the dequantized warped semiring, i.e., the limit of the soft aggregator). The forward direction uses lseFinset_tendsto_sup' (Dequantization/LogSumExp/Limit.lean), and the algebraic characterisation reduces to score c = cands.sup' hne score.

Documentation

Linglib.Core.Constraint.Semiring

Semiring View of Deterministic Decoders #

What this file proves #

The dequantized bridge #

Max-plus semiring for HG #