Documentation

Linglib.Phenomena.Phonology.Studies.Saito2025

Saito, Tomaschek & Baayen (2025): Frequency × inflectional status via DLM #

@cite{saito-tomaschek-baayen-2025} @cite{baayen-2019} @cite{heitmeier-chuang-baayen-2026}

Saito, M., Tomaschek, F., & Baayen, R. H. (2025). Interaction of frequency and inflectional status: An approach from discriminative learning. Preprint v1.

Empirical claims #

The paper investigates the apparent contradiction between two documented frequency effects on phonetic realization:

The study reanalyses German tongue position data (560 word tokens, 88 word types containing the rhyme [a(:)…t] from the Karl-Eberhard Corpus of spontaneous southern German) and finds:

  1. Inflectional status × frequency interaction: high-frequency non-inflected words show tongue raising (reduction); high- frequency inflected words show attenuated reduction (paper §2.2, Tables 1–2).
  2. DLM-derived semantic support replaces the binary morphological predictor: replacing the categorical InflStatus factor with the continuous SemSupSuffix measure (semantic support from word meaning to the suffix triphone, derived from the trained DLM) improves model fit by 142.87 AIC units (paper §3.3, Table 3).
  3. Architectural challenge to WEAVER++: the result undermines speech production models that posit intermediate symbolic morphological representations (@cite{levelt-roelofs-meyer-1999} and the Roelofs WEAVER lineage). Form ↔ meaning mappings without a morpheme layer suffice.

The substantive theoretical move: what looks like a morphological- boundary effect is driven by inflectional semantics.

Substrate (this file) #

This file is the third consumer of the LinearDiscriminativeLexicon substrate (Theories/Processing/Lexical/Discriminative/Defs.lean), after @cite{chuang-bell-tseng-baayen-2026} and @cite{lu-chuang-baayen-2026}. It validates the polymorphic carrier typing across every axis of variation:

AxisChuang 2026Lu 2026Saito 2025
LanguageMandarinMandarinGerman
Modalityf0 pitchf0 pitchEMA tongue position
Form rep50-dim ℝ100-dim ℝ14,404-dim binary
Meaning rep768-dim CKIP768-dim CKIP300-dim word2vec
Phenomenontonal realiz.tone sandhimorpho-phonetics

The same LinearDiscriminativeLexicon ℝ FormVec MeaningVec instantiates to all three. The "binary" structure of form vectors here (zero/one triphone presence) is a property of the training data the network sees, not of the type — Fin 14404 → ℝ accommodates {0,1}-valued sequences as a special case.

SemSup measures (substrate) #

The paper introduces a family of derived measures (paper §3.1) on top of a trained DLM: SemSup, SemSupVowel, SemSupSuffix, SemSupWord, PredAcc, FuncLoad, SemLen, UncertProd, UncertComp. The first four are paper-headline; the rest are diagnostic.

The general measures (semSup, semSupWord) plus their linearity lemmas live in Theories/Processing/Lexical/Discriminative/Measures.lean, having graduated to substrate when @cite{gahl-baayen-2024} landed as a second consumer. The paper-specific positional variants (semSupVowel, semSupSuffix) stay here as abbrevs wrapping the substrate primitive.

Cross-framework note: WEAVER++ challenged but unformalized #

The paper's headline architectural claim — that no intermediate morpheme layer is needed between semantics and articulation — directly challenges WEAVER++ (@cite{levelt-roelofs-meyer-1999} and the broader WEAVER lineage). linglib does not currently formalize WEAVER++; the cross-framework discrimination would require a Theories/Processing/Lexical/WEAVER/ sibling that explicitly posits a lemma layer. Defer until a second WEAVER-using study lands.

Sibling-framework engagement #

Like the Chuang and Lu DLM Studies, this file's claims sit in tension with the stored-lexicon assumptions in:

Phenomena/Phonology/Studies/BreissKatsudaKawahara2026.lean is the closest sibling phenomenon — Japanese morpho-phonetics with explicit discrimination among the four ItemSpecificity channels. Saito's DLM analysis would form a natural 5th line in that paper's discrimination table; the structural cross-framework theorem is deferred until a unified LexiconArchitecture typeclass exists (per the cross-framework reconciler's recommendation, see CHANGELOG 0.231.15).

Sections #

@[reducible, inline]

The full set of triphones in the paper's CELEX-derived word-by- triphone matrix (paper §3.1: matrix C of shape 64068 × 14404). Form vectors are zero/one binary indicators of triphone presence.

Equations
Instances For
    @[reducible, inline]

    The dimensionality of the pretrained word2vec German embeddings (paper §3.1: matrix S of shape 64068 × 300; paper cites the German word2vec model of Müller 2015 — bib entry omitted).

    Equations
    Instances For
      @[reducible, inline]

      A form vector: zero/one indicator of which triphones the word contains. The "binary" structure is data, not type — FormVec is Fin n → ℝ and binary vectors are a subset.

      Equations
      Instances For
        @[reducible, inline]

        The paper's specific DLM instantiation. The substrate type LinearDiscriminativeLexicon is in Theories/Processing/Lexical/Discriminative/Defs.lean; this abbreviation specialises it to the German triphone × word2vec dimensions.

        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          SemSupVowel and SemSupSuffix (paper §3.1 eq. 3, 4) #

          The general measures semSup and semSupWord (with linearity lemmas) live in Theories/Processing/Lexical/Discriminative/Measures.lean. The two positional variants below are paper-specific bindings naming which triphone index is being projected. abbrev for transparency (simp / rw see through to semSup).

          @[reducible, inline]

          SemSupVowel (paper §3.1 eq. 3): semantic support for the triphone centred on the word's stem vowel.

          Equations
          Instances For
            @[reducible, inline]

            SemSupSuffix (paper §3.1 eq. 4): semantic support for the triphone centred on the suffix exponent ([t] in this paper). The headline measure of paper §3.2.2: replacing the binary inflectional-status factor with semSupSuffix improves the tongue-position GAMM by 142.87 AIC units.

            Equations
            Instances For
              theorem Phenomena.Phonology.Studies.Saito2025.saito_close_meanings_imply_close_form (D : GermanInflectionalDLM) (s₁ s₂ : GermanWord2VecVec) {ε : } (h : s₁ - s₂ ε) :
              D.production s₁ - D.production s₂ LinearMap.toContinuousLinearMap D.production * ε

              Quantitative form of the DLM's articulation prediction. Paper §3.3 reports that the trained DLM's production map yields tongue-position predictions consistent with empirical measurements. The Lipschitz form: any GermanInflectionalDLM sends close-in-meaning word pairs to close-in-form triphone-vectors, with constant ‖production‖.

              This is the third consumer of dlm_neighbor_centroids_imply_neighbor_contours (after Chuang 2026 and Lu 2026), validating that the polymorphic substrate accommodates German morpho-phonetics with no specialisation beyond carrier-type instantiation.

              Findings as paper-supplied empirical facts #

              Per CLAUDE.md (Processing-scope guidance), specific GAMM fits, AIC deltas, and Random Forest variable-importance scores are out of scope as Lean theorems. They are recorded here as documented findings.

              Frequency × inflectional status interaction (paper §2.2.1, Table 1). For non-inflected words, higher (log) frequency predicts higher tongue- tip positions (articulatory reduction). For inflected words this reduction effect is significantly attenuated (interaction term te(Freq, Time):Inflected: edf=7.51, F=15.83, p<0.001). The same interaction is observed for tongue-body positions (paper §2.2.2, Table 2; F=3.29, p=0.020).

              SemSupSuffix is the strongest predictor of inflectional status (paper §3.2.1, Fig. 10). In a Random Forest analysis with nine DLM-derived semantic measures as candidate predictors, SemSupSuffix ranks highest; SemSupVowel ranks second; SemSupWord ranks fourth (after PredAcc). Inflected words have significantly higher SemSupSuffix (Mann-Whitney U=152,201, p<0.001) and higher SemSupWord; lower SemSupVowel (paper §3.2.2, Fig. 11abc).

              SemSupSuffix replaces InflStatus in the GAMM (paper §3.3, Table 3). Replacing the binary factor InflStatus with the continuous SemSupSuffix measure in the tongue-position GAMM reduces AIC by 142.87 units (62.64 ML score units), with one fewer effective degree of freedom. The DLM-derived continuous measure is structurally simpler than the categorical alternative and fits the data better.

              Frequency interaction explained by SemSupSuffix (paper §3.4). The interaction te(Time, SemSupSuffix, Freq) (edf=20.31, F=30.92, p<0.001) shows that higher SemSupSuffix correlates with lowered tongue positions for high-frequency words (= articulatory enhancement), while low-SemSupSuffix high-frequency words show raising (= articulatory reduction). The frequency paradox dissolves: both effects coexist, modulated by semantic support.

              Implications recorded in the paper's discussion #