Documentation

Linglib.Fragments.Japanese.Prosody

Japanese Prosody Fragment #

[BP86] [Kaw15]

Japanese prosodic entries following the autosegmental-metrical analysis of [BP86], with accent assignment rules and affix typology from [Kaw15].

Key Properties #

Default-accent rules (formerly Phonology/Prosody/Accent) #

Language-neutral accent-placement rules over a syllable-weight profile, used in the Japanese accent analysis below ([McC68]; [Hay95a]; [Kub06]; [PS93]).

The Antepenultimate Accent Rule ([McC68]): accent the syllable containing the antepenultimate (3rd-from-last) mora; words with fewer than three morae accent the initial syllable. Returns the 0-indexed syllable.

Equations
Instances For
    @[irreducible]

    The Latin Stress Rule ([Hay95a]): accent the penult if heavy (≥ 2μ), else the antepenult. Monosyllables accent the only syllable; disyllables the penult (= initial). [Kub06] argues it fits Japanese default accentuation better than defaultAccentAAR.

    Equations
    Instances For
      def Japanese.Prosody.nonFinalitySigma (accentSyll : Option ) (nSyll : ) :

      NonFinality(σ) ([PS93]): 1 if accent is on the word-final syllable, else 0 — drives the avoidance of final accent in Japanese compounds and loanwords ([Kaw15]).

      Equations
      Instances For

        Accent-to-tone derivation and compound accent (Japanese-specific) #

        The Japanese pitch-accent tonal melody and the compound-accent rules ([Kaw15]) — the accentual HL + initial rise and the N1/N2 compound rules, which build on the default-accent placement above.

        A level tone for pitch-accent systems. Japanese uses only H (high) and L (low) at the lexical level ([Kaw15]).

        Instances For
          @[implicit_reducible]
          Equations
          Equations
          • One or more equations did not get rendered due to their size.
          Instances For
            def Japanese.Prosody.accentToTones (accentMora : Option ) (nMorae : ) :

            Derive the surface tone of each mora from accent position and mora count ([Kaw15]), returning one LevelTone per mora:

            1. accentual HL — H on the accented mora, L on the next;
            2. initial rise — L on mora 0, H on mora 1 (blocked by initial accent);
            3. spreading — unspecified moras copy the rightmost specified tone.
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For
              def Japanese.Prosody.shortN2CompoundAccent (n1Morae : ) (n2Accent : Option ) (preAccenting : Bool) :
              Option

              Short-N2 compound accent ([Kaw15]): when the second member N2 is short (≤ 2μ), accent may pre-accent the last syllable of N1, or N2 retains its own accent. Pre-accenting N2s lose their accent to NonFinality(Ft) and receive a new one, like dominant pre-accenting suffixes.

              Equations
              • One or more equations did not get rendered due to their size.
              Instances For
                def Japanese.Prosody.longN2CompoundAccent (n1Morae : ) (n2Accent : Option ) (n2Morae : ) :
                Option

                Long-N2 compound accent ([Kaw15]): when N2 is long (≥ 3μ), an unaccented or final-accented N2 accents its initial syllable; otherwise N2's own accent is retained.

                Equations
                Instances For

                  A Japanese lexical entry with prosodic specification.

                  The accent is specified as the mora position of the linked H tone (0-indexed from the beginning of the word). Unaccented words have accentMora = none.

                  • form : String

                    Surface form (romanized)

                  • gloss : String

                    Gloss

                  • accentMora : Option

                    Mora position of the accent (none = unaccented)

                  • nMorae :

                    Number of morae in the word

                  Instances For
                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      Is this entry accented?

                      Equations
                      Instances For

                        Convert to Bool accentedness for bridge to AccentualPhrase.

                        Equations
                        Instances For

                          kami 'god' — accented on first mora (initial accent). Contrasts with kami 'paper' (unaccented) and kamí 'hair' (accent on second mora).

                          Equations
                          Instances For

                            kami 'paper' — unaccented. No HL fall in the accentual phrase.

                            Equations
                            Instances For

                              uma'i — accented adjective (§2.2, Figs. 6, 8, 9).

                              Equations
                              Instances For

                                amai — accented adjective (§2.3, Fig. 8).

                                Equations
                                Instances For

                                  mame — unaccented noun (§2.2, Fig. 6).

                                  Equations
                                  Instances For

                                    ame — unaccented noun (§2.2, Fig. 6).

                                    Equations
                                    Instances For

                                      A Japanese lexical entry extending JProsodicEntry with the two annotations needed for frequency-conditioned phonology (e.g., the Breiss-Katsuda-Kawahara compounds in Studies/BreissKatsudaKawahara2026.lean): a corpus token log-frequency and a free/bound flag.

                                      Following CLAUDE.md's "infrastructure on demand", these annotations are kept on a thin extension structure rather than added to JProsodicEntry, so existing accent-only consumers are unaffected. The jTokenFreq accessor below exposes its token frequency as the -valued channel that frequency-conditioned phonology reads.

                                      • form : String
                                      • gloss : String
                                      • accentMora : Option
                                      • nMorae :
                                      • tokenLogFreq :

                                        Token log-frequency in a reference corpus (e.g., BCCWJ). 0 conventionally means "log of 1 occurrence" — used as the no-info default for unannotated items. Stored as so that the lexicon remains computable while the abstract Phonology/ interface coerces to .

                                      • canStandAlone : Bool

                                        Can this morpheme stand alone as a wordform? false for bound stems that occur only in compounds (e.g., the bound N2s targeted in [BKK26]).

                                      Instances For
                                        Equations
                                        • One or more equations did not get rendered due to their size.
                                        Instances For
                                          noncomputable def Japanese.Prosody.jTokenFreq (e : JLexicalEntry) :

                                          The token-log-frequency of a JLexicalEntry: the fragment-level field cast to for frequency-conditioned phonology. noncomputable because is; the field itself stays computable for decide-style proofs.

                                          Equations
                                          Instances For

                                            A Japanese N1 + N2 nominal compound. Compound-medial position is the locus of voiced velar nasalisation (/g/ → [ŋ]) studied in [BKK26]: obligatory when N2 is bound, optional and frequency-conditioned when N2 is free.

                                            The compound's own tokenLogFreq is independent of N1's and N2's — high-frequency compounds with low-frequency components, and vice versa, both occur. Frequency-conditioned theories that treat the compound's frequency as inherited from constituents (e.g., some RepresentationStrength variants with multiplicative inheritance) must reconcile this independence with empirical reality.

                                            • compoundLogFreq :

                                              Token log-frequency of the compound as a unit — typically much lower than either constituent in isolation, but the principal conditioning variable on optional nasalisation.

                                            Instances For
                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For

                                                The compound's surface form: simple concatenation of N1 and N2 forms (the segmental alternation /g/→[ŋ] applies on top).

                                                Equations
                                                Instances For

                                                  A compound's nasalisation is obligatory iff its N2 is bound. The free-N2 case is the gradient one tested in [BKK26].

                                                  Equations
                                                  Instances For

                                                    Japanese accentual phrase tonal specification.

                                                    [BP86] §2.2: the AP is defined by:

                                                    • A boundary L at the beginning (or end of preceding AP)
                                                    • A phrasal H on the second sonorant mora
                                                    • An optional accent HL (if the word is accented)
                                                    • A boundary L at the end

                                                    The phrasal H is NOT the same as H-tone spreading from the accent; it has its own local pitch range and is always present, even in unaccented phrases (Fig. 3 vs earlier accounts).

                                                    • words : List JProsodicEntry

                                                      Words grouped in this AP

                                                    • hasPhrasalH : Bool

                                                      Whether the phrasal H is present (always true in Japanese)

                                                    Instances For
                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For

                                                        An AP is accented if any word in it is accented.

                                                        Equations
                                                        Instances For

                                                          Convert to the generic AccentualPhrase type. Japanese accent shape is always H*+L; unaccented APs get null.

                                                          Equations
                                                          Instances For

                                                            Accented words have accent location.

                                                            Unaccented words lack accent location.

                                                            The Japanese pitch accent shape is H*+L (a single bitonal accent).

                                                            A Japanese accented AP always triggers catathesis (because H*+L is bitonal).

                                                            A Japanese unaccented AP never triggers catathesis.

                                                            An AP containing only unaccented words is unaccented.

                                                            An AP containing an accented word is accented.

                                                            Japanese suffix accent specification.

                                                            Japanese suffixes exhibit the same dominant/recessive distinction as IE accent systems ([KH77]) and GT systems ([Rol18]). Dominant suffixes remove stem accent; recessive suffixes preserve it when present.

                                                            Instances For
                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                -teki (的): deaccenting suffix. Removes stem accent regardless of whether the stem is accented or unaccented — classified as subtractive-dominant in GT terms ([Kaw15]).

                                                                Equations
                                                                Instances For

                                                                  -si (氏): non-deaccenting suffix. Preserves stem accent when present — classified as recessive ([Kaw15]).

                                                                  Equations
                                                                  Instances For

                                                                    Deaccenting suffixes are dominant.

                                                                    Non-deaccenting suffixes are not dominant.

                                                                    Derive the accent of a suffixed word from stem accent + suffix dominance.

                                                                    Equations
                                                                    Instances For

                                                                      -teki deaccents kami 'god' (accented).

                                                                      -teki leaves kami 'paper' (unaccented) unchanged.

                                                                      -si preserves the accent on kami 'god'.

                                                                      -si preserves the unaccentedness of kami 'paper'.

                                                                      Loanword prosodic entry. Extends JProsodicEntry with syllable weight profile for testing AAR vs LSR predictions.

                                                                      Instances For
                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          kurisumasu 'Christmas' — accent on antepenultimate mora (su). [Kaw15] (10a).

                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            asufaruto 'asphalt' — accent on antepenultimate mora (fa). [Kaw15] (10g).

                                                                            Equations
                                                                            • One or more equations did not get rendered due to their size.
                                                                            Instances For

                                                                              makudonarudo 'McDonald' — accent on antepenultimate mora (na). [Kaw15] (10h).

                                                                              Equations
                                                                              • One or more equations did not get rendered due to their size.
                                                                              Instances For

                                                                                amerika 'America' — unaccented (4-mora with two final light σ). [Kaw15] (16a).

                                                                                Equations
                                                                                • One or more equations did not get rendered due to their size.
                                                                                Instances For

                                                                                  Loanword accent matches AAR prediction for all-light syllables.

                                                                                  Loanword accent matches LSR prediction for all-light syllables.

                                                                                  Japanese suffix with fine-grained accent classification.

                                                                                  Instances For
                                                                                    Equations
                                                                                    • One or more equations did not get rendered due to their size.
                                                                                    Instances For

                                                                                      -tara (conditional): recessive suffix — bears accent, loses to root. [Kaw15] (29).

                                                                                      Equations
                                                                                      Instances For

                                                                                        -ppoi (-ish): dominant suffix — bears accent, overrides root. [Kaw15] (30).

                                                                                        Equations
                                                                                        Instances For

                                                                                          -si (Mr.): recessive pre-accenting — inserts accent on root-final σ when root is unaccented, preserves root accent when present. [Kaw15] (31).

                                                                                          Equations
                                                                                          Instances For

                                                                                            -ke (family of): dominant pre-accenting — always inserts accent on root-final σ, deleting any root accent. [Kaw15] (32).

                                                                                            Equations
                                                                                            Instances For

                                                                                              -mono (thing): accent-shifting — shifts existing root accent to pre-suffix position. Unaccented roots stay unaccented. [Kaw15] (33).

                                                                                              Equations
                                                                                              Instances For

                                                                                                o- (honorific prefix): post-accenting — inserts accent after prefix. [Kaw15] (34).

                                                                                                Equations
                                                                                                Instances For

                                                                                                  -teki (的 -like): deaccenting — deletes root accent, no new accent. [Kaw15] (36).

                                                                                                  Equations
                                                                                                  Instances For

                                                                                                    -zu (group/plural): initial-accenting — inserts accent on root-initial σ. [Kaw15] (39).

                                                                                                    Equations
                                                                                                    Instances For

                                                                                                      Recessive pre-accenting is recessive at the coarse level (preserves root accent when present).

                                                                                                      Deaccenting is dominant at the coarse level (overrides root accent). This corrects the earlier classification of -teki in this fragment, which used ProsodicDominance.dominant — functionally the same projection, but the fine-grained type makes the behavior explicit.

                                                                                                      Accent-shifting is recessive at the coarse level: it only operates on accent that is already present, never creating new accent.

                                                                                                      kabuto+musi 'beetle': short N2 (musi, 2μ) pre-accents on N1-final syllable. [Kaw15] (22a).

                                                                                                      sin+yokohama 'Shin-Yokohama': long N2 (yokohama, 4μ, unaccented) → accent on N2-initial syllable. [Kaw15] (23a).

                                                                                                      sin+tamane'gi 'new onion': long N2 retains accent. [Kaw15] (24a).

                                                                                                      Unaccented trisyllable ame+ga → LHH (initial rise + H spread).

                                                                                                      Accented ka'mi+ga → HLL (accent HL + L spread).