Pitman–Yor process #

@cite{pitman-2006} @cite{odonnell-2015}

The Pitman–Yor process (PYP) is a two-parameter Bayesian non-parametric distribution on partitions of [n], generalising the Chinese Restaurant Process (the one-parameter Dirichlet process). The mathematical reference is @cite{pitman-2006} §3.2 (Saint-Flour lectures); the linguistic application that motivates this file is @cite{odonnell-2015} §3.1.6 (memoization distribution for adaptor and fragment grammars in Theories/Morphology/FragmentGrammars/).

Naming convention #

@cite{pitman-2006} writes parameters as (α, θ) with α = discount and θ = concentration; @cite{odonnell-2015} writes (a, b) for the same two. We use (discount, concentration) to match neither convention's single letters but to be self-documenting.

Main definitions #

stepPochhammer x s m — generalised step product ∏_{k=0}^{m-1} (x + k·s) (@cite{pitman-2006} eq 3.7, @cite{odonnell-2015} eq 3.13). Specialises to the rising factorial (ascPochhammer R m).eval x at s = 1 and to the geometric power x^m at s = 0.
PitmanYor — the two-parameter family (discount, concentration) with 0 ≤ discount ≤ 1 and concentration ≥ -discount (@cite{pitman-2006} eq 3.5, second case).
PitmanYor.partitionProb — the exchangeable partition probability function (EPPF) of @cite{pitman-2006} Theorem 3.2 (eq 3.6) / @cite{odonnell-2015} eq 3.14.

What `partitionProb` actually computes #

partitionProb q evaluates Pitman's EPPF formula (@cite{pitman-2006} eq 3.6) at the multiset of block sizes q.parts. The EPPF is, per @cite{pitman-2006} p. 39, the probability that the random partition Π_n equals any specific (set) partition of [n] whose blocks have sizes (n_1, …, n_k). By the EPPF's symmetry, the value depends only on the multiset of sizes — which is what makes the Nat.Partition n → ℝ signature well-typed.

partitionProb q is therefore the prob of one specific set partition with multiset of block sizes q.parts, NOT the prob of the multiset q.parts itself. The two differ by the multiplicity factor

mult(q) = n! / (∏_{m ∈ q.parts} m! · ∏_{j} (q.parts.count j)!)

i.e. the number of set partitions of [n] whose block sizes are q.parts (@cite{pitman-2006} eq 2.2 / Nat.Partition.numSetPartitions).

Sum-to-1 identities #

Pitman 2006 gives several equivalent normalisations of the EPPF:

(a) ∑_{Π : set partition of [n]} EPPF(block sizes of Π) = 1
                                                          @cite{pitman-2006} Thm 3.2
(b) ∑_{q : Nat.Partition n} mult(q) · partitionProb q = 1
                                                          @cite{pitman-2006} eq 2.2
(c) ∑_{compositions (n_1,…,n_k) of n} (n choose n_1,…,n_k)·1/k! · EPPF(n_1,…,n_k) = 1
                                                          @cite{pitman-2006} p. 42

We formalise (a) as sum_partitionProb_set_eq_one, summing over Finpartition (Finset.univ : Finset (Fin n)). This is the form the downstream AdaptorGrammar consumer needs (since AG's Y is a labeled table assignment, equivalent to a set partition under the canonical "tables labeled by order of creation" convention).

The bare sum ∑_{q : Nat.Partition n} partitionProb q does NOT equal 1 in general — every multiset appears once in the sum, but the EPPF interpretation requires counting it mult(q) times. For example, at α = 0, θ = 1, n = 3 the bare sum is 2/3.

Limitations #