Pin's algebraic characterization of subregular language classes #

The classical algebraic-automata-theory characterization of four basic subregular varieties via omega-power equations on the syntactic monoid:

Variety	Equation	Meaning
`𝒟` (definite)	`s · [w]^ω = [w]^ω`	left-absorbing
`𝒦` (reverse-definite)	`[w]^ω · s = [w]^ω`	right-absorbing
`𝒩` (co/finite)	both `𝒟`'s and `𝒦`'s	both-sided absorbing
`ℒℐ` (generalized definite)	`[w]^ω · s · [w]^ω = [w]^ω`	sandwich-absorbing

Where [w]^ω = Monoid.omegaPow (L.syntacticClass w) is the unique idempotent in the cyclic submonoid of [w] (see Linglib/Core/Algebra/IdempotentPower.lean). The variables s range over L.syntacticMonoid and w over non-empty List α (alphabet-relativized form — see Equations.lean for the trivial-letter counterexample motivating the non-empty-w restriction).

Why omega-power and not finite-`k`? #

[Lam26] Props 53/57 (in Equations.lean) give finite-k characterizations parameterized by the suffix/prefix length k. Pin's forms are the unbounded versions: a single k-free equation in the syntactic monoid characterizes membership in the variety. The omegaPow substrate is what eliminates the k parameter.

The two characterizations cohere: L.IsDefinite k → kDefiniteEquation L k is the finite-k half; (∃ k, L.IsDefinite k) ↔ pinDefiniteEquation L is the unbounded half. The unbounded form is the natural Pin/Eilenberg form used throughout algebraic automata theory.

Main definitions #

Language.pinDefiniteEquation L — s · [w]^ω = [w]^ω.
Language.pinReverseDefiniteEquation L — [w]^ω · s = [w]^ω.
Language.pinCofiniteEquation L — conjunction of the above two.
Language.pinGeneralizedDefiniteEquation L — [w]^ω · s · [w]^ω = [w]^ω.

All four require [Finite L.syntacticMonoid] (equivalent to L being regular, by IsRegular.finite_syntacticMonoid).

Main results #

Language.exists_isDefinite_iff_satisfies_pinDefiniteEquation — Pin's 𝒟-iff.
Language.exists_isReverseDefinite_iff_satisfies_pinReverseDefiniteEquation — Pin's 𝒦-iff.
Language.isFiniteOrCofinite_iff_satisfies_pinCofiniteEquation — Pin's 𝒩-iff (additionally requires [Finite α]; the language-level reverse direction in Subregular/Language/Definite.lean does not hold for infinite alphabets).
Language.exists_isGeneralizedDefinite_iff_satisfies_pinGeneralizedDefiniteEquation — Pin's ℒℐ-iff. The reverse direction uses the same prefix-pigeonhole template as 𝒟/𝒦, replacing one-sided absorption with the LI sandwich identity (sandwich_absorbing_of_pin_pigeonhole).

Future work: replace the pigeonhole proofs with the kernel structure #

The *_pin_pigeonhole lemmas reprove by hand, for special cases, the structure of the minimal ideal (kernel) of a finite syntactic monoid. Once Green's relations and the Rees–Sushkevich theorem land in mathlib (in progress upstream: the Mathlib.Algebra.Group.GreensRelations development plus idempotent powers in finite semigroups), these characterizations should be rewritten as corollaries of the kernel being a band, which dissolves the pigeonhole entirely:

𝒟 (definite) ⟺ the kernel is a right-zero band;
𝒦 (reverse-definite) ⟺ the kernel is a left-zero band;
ℒℐ (generalized definite) ⟺ the kernel is a rectangular band;
𝒩 (co/finite) ⟺ the kernel is trivial.

Reference points for that rewrite: Rees–Sushkevich ([pin-mfa] Ch. V Thm 3.33), the minimal ideal of a finite semigroup ([pin-mfa] Ch. V Prop 4.37), and the aperiodic-simple = rectangular-band classification ([pin-mfa] Ch. V Cor 3.34). Do not fork that substrate here; consume it from mathlib when it merges.

References #