Documentation

Linglib.Morphology.MorphRule

Morphological Infrastructure #

[Byb85] [Cha17] [Lin83] [ZP83]

Framework-agnostic types for morphological analysis and compositional morphological rules.

Typological Classification #

[Byb85] Relevance Hierarchy #

MorphCategory classifies morpheme functional categories ordered by semantic relevance to the stem:

stem < derivation < valence < voice < aspect < tense < mood < negation < agreement

Compositional Rules #

A MorphRule σ transforms a stem's surface form, morphosyntactic features, and meaning of type σ simultaneously. Rules where the word-level semantic contribution is delegated to a higher composition layer (e.g., tense rules that delegate to Semantics/Tense/, agreement rules that contribute no truth-conditional meaning) carry delegatedSemantics := true. The Bool flag is not a claim that the morpheme is meaningless — Bybee 1985 Ch 1 §3 explicitly argues against the vacuity-of-inflection position. It tracks where the meaning is computed (delegate to Theory layer vs. compute at the morphological word level), not whether meaning exists.

Side on which a bound morpheme attaches to its host.

Instances For
    @[implicit_reducible]
    Equations
    Equations
    • One or more equations did not get rendered due to their size.
    Instances For

      Typological position classification for formatives. [BN07] Table 2.

      Superset of AttachmentSide: adds simulfixation (process morphology), detached formatives (Wackernagel clitics, free auxiliaries), and endoclisis (clitic insertion inside a word).

      Instances For
        @[implicit_reducible]
        Equations
        Equations
        • One or more equations did not get rendered due to their size.
        Instances For

          How restrictive a morpheme is about what it can attach to.

          [ZP83] criterion A: clitics exhibit low selection (attach to virtually any word), while affixes exhibit high selection (attach only to specific stems or categories).

          • low : SelectionDegree

            Attaches to words of virtually any category (prepositions, verbs, adjectives, adverbs). Characteristic of simple clitics.

          • singleCategory : SelectionDegree

            Attaches to words of a single major category (e.g., past tense -ed to verbs, plural -s to nouns). Characteristic of inflectional affixes.

          • closedClass : SelectionDegree

            Attaches only to a closed list of stems (e.g., -n't only to finite auxiliaries). Maximally selective.

          Instances For
            @[implicit_reducible]
            Equations
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              Affixes are more selective than clitics.

              Equations
              Instances For

                Morphological status of a linguistic form.

                Classifies forms by their degree of syntactic independence and mode of combination. The clitic–affix boundary is the central question of [ZP83]: the criteria A–F serve to locate a given morpheme on this scale.

                • freeWord : MorphStatus

                  Syntactically independent word.

                • simpleClitic : MorphStatus

                  Simple clitic: phonologically bound form that can attach to hosts of virtually any syntactic category. [BN07]: defined primarily by low selectivity (categorical freedom) + phonological dependence, not necessarily by being a reduced variant of a free word. Many simple clitics have no free-word counterpart (Latin -que). English contracted auxiliaries ('s, 've, 'd) are a subcase where a free variant exists.

                • specialClitic : MorphStatus

                  Special clitic: either no corresponding free word exists, or the distribution differs from the free word. Romance pronominal clitics, Latin -que.

                • inflAffix : MorphStatus

                  Inflectional affix: paradigmatic, category-preserving, highly selective, with possible gaps and idiosyncrasies. English -ed, -s, -est, -n't.

                • derivAffix : MorphStatus

                  Derivational affix: potentially category-changing, often productive but may show lexical restrictions. English -ness, un-, -ize.

                Instances For
                  @[implicit_reducible]
                  Equations
                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For

                    Is this an affix (inflectional or derivational)?

                    Equations
                    Instances For

                      Is this a clitic (simple or special)?

                      Equations
                      Instances For

                        A single cell in a morphological paradigm: one form of a lexeme in a particular morphosyntactic context.

                        The type parameter F is the feature bundle type (e.g., UD.MorphFeatures for a full UD specification, or a simpler domain-specific type).

                        • features : F

                          The morphosyntactic features selecting this cell.

                        • form : Option String

                          The surface form, or none for a paradigm gap.

                        • regular : Bool

                          Is this form predictable from the stem by regular rule?

                        Instances For
                          def Morphology.instReprParadigmCell.repr {F✝ : Type} [Repr F✝] :
                          ParadigmCell F✝Std.Format
                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For
                            @[implicit_reducible]
                            instance Morphology.instReprParadigmCell {F✝ : Type} [Repr F✝] :
                            Repr (ParadigmCell F✝)
                            Equations
                            def Morphology.instBEqParadigmCell.beq {F✝ : Type} [BEq F✝] :
                            ParadigmCell F✝ParadigmCell F✝Bool
                            Equations
                            Instances For
                              @[implicit_reducible]
                              instance Morphology.instBEqParadigmCell {F✝ : Type} [BEq F✝] :
                              BEq (ParadigmCell F✝)
                              Equations

                              Does this cell represent a paradigm gap?

                              Equations
                              Instances For

                                Does this cell show irregularity (suppletion or unpredictable allomorphy)?

                                Equations
                                Instances For

                                  Morpheme functional category.

                                  Categories are ordered by semantic relevance to the verb stem: more relevant categories appear closer to the stem in suffixal morphology.

                                  Instances For
                                    Equations
                                    • One or more equations did not get rendered due to their size.
                                    Instances For
                                      def Morphology.instDecidableEqMorphCategory.decEq (x✝ x✝¹ : MorphCategory) :
                                      Decidable (x✝ = x✝¹)
                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For

                                        Predicate testing whether a MorphCategory is an agreement category, independent of which Controller role parameterizes it. Used for Bybee-style relevance-hierarchy code that doesn't care which role triggers the agreement, only that agreement IS the category.

                                        Equations
                                        Instances For

                                          Peripherality: numerical embedding of Bybee's relevance hierarchy where higher = farther from stem = less semantically relevant.

                                          In Bybee's text, "high relevance" means more semantically integrated with the stem ([bybee-1985] Ch 2 §2.1 p. 13). The substrate uses the opposite numerical direction: stem = 0 (most relevant), agreement = 8 (least relevant), so that Nat ordering mirrors stem-outward linear position in suffixing morphology (Ch 2 §6 iconicity, p. 33). The field name peripherality makes this directionality explicit and avoids the wrong-on-its-face gloss "high relevance rank means low relevance."

                                          Categories from Bybee 1985 Ch 2 §3 (verified against the book): valence, voice, aspect, tense, mood, agreement.

                                          Linglib extensions (NOT in Bybee 1985 — flag in any consumer that reads these ranks):

                                          • derivation (rank 1): Bybee Ch 4 argues lex/deriv/infl is a continuum, not a discrete level on the relevance scale.
                                          • number (rank 3): Bybee discusses verbal-number agreement at the low end (with person agreement). Noun number is treated separately (Ch 2 §6 cites Greenberg 1963 only, "stem < number < case" for nouns). Cross-comparison of noun-number rank with verb-aspect rank is an artifact of unifying both onto one scale.
                                          • degree (rank 5): Bybee never discusses adjectival degree morphology. Comparative morphology is often derivational cross-linguistically (Stassen WALS).
                                          • negation (rank 7): Bybee discusses negation as a kind of mood (Part II Ch 8 §5), not a separate level. Rank 7 is plausible per Miestamo 2005 cross-linguistic ordering data, but is a linglib extension.
                                          • nonfinite (rank 9): not on Bybee's hierarchy at all (nonfinite morphology often changes syntactic category, outside the scope of inflectional categories proper).
                                          Equations
                                          Instances For

                                            The relevance order #

                                            peripherality is a rank function — a numeric embedding. The object the hierarchy is really about is the order it induces: which categories are more stem-relevant than which. All relevance-hierarchy code — this file's RespectsRelevanceHierarchy and the consumers in Studies/ — speaks in that order via RelevanceLE / RelevanceLT; the specific ℕ values of peripherality are an implementation detail (only their comparisons carry meaning, as relevanceLE_iff_peripherality records).

                                            a is at least as stem-relevant as b: the rank order induced by peripherality. This is the unified relation relevance-hierarchy code uses.

                                            Equations
                                            Instances For

                                              a is strictly more stem-relevant than b.

                                              Equations
                                              Instances For

                                                The relevance order is reflexive.

                                                The relevance order is transitive.

                                                The relevance order is total: any two categories are comparable.

                                                Strict relevance order is the strict part of the order, as expected.

                                                peripherality reflects the relevance order exactly: it is the canonical rank realizing the order, so the order carries precisely the information the rank does.

                                                A morpheme ordering respects the relevance hierarchy when its categories are sorted stem-outward by the relevance order.

                                                Equations
                                                Instances For
                                                  structure Morphology.MorphRule (σ : Type) :

                                                  A morphological rule: carries formal AND semantic effects.

                                                  The type parameter σ is the meaning type, so this works uniformly across Bool/Frac/Float semantic backends.

                                                  Design principle: semEffect can be id for rules whose word-level semantic contribution is delegated to a higher composition layer (verb agreement -s carries no truth-conditional meaning at the word level; tense rules delegate to the intensional layer), making it explicit which inflections compute meaning at the word level and which delegate.

                                                  • category : MorphCategory

                                                    Which morphological category this rule realizes

                                                  • value : String

                                                    The feature value this rule realizes

                                                  • formRule : StringString

                                                    How the surface form changes

                                                  • How morphosyntactic features change

                                                  • valenceRule : Option ComplementTypeOption ComplementType

                                                    How the lexical frame changes — id except for valency-changing morphology (reciprocal, passive, causative affixes; [DA00]), where the frame change is the rule's whole point. The frame is lexical, not UD morphology, so it is its own effect channel rather than a featureRule component.

                                                  • semEffect : σσ

                                                    Semantic effect (id when meaning is delegated to a higher layer)

                                                  • delegatedSemantics : Bool

                                                    Is the word-level semantic contribution delegated to a higher composition layer? (Set true for agreement, tense, etc., where Semantics/{Tense,Aspect,Modality,Agreement}/ handle the meaning. NOT a claim that the morpheme is meaningless — see file docstring.)

                                                  Instances For
                                                    structure Morphology.Stem (σ : Type) :

                                                    A lexical stem: a root meaning plus its morphological paradigm.

                                                    • lemma_ : String

                                                      Base form (lemma)

                                                    • cat : UD.UPOS

                                                      Syntactic category

                                                    • baseFeatures : UD.MorphFeatures

                                                      Base morphosyntactic features

                                                    • baseFrame : Option ComplementType

                                                      Base lexical frame (complement selection) — lexical, beside the morphology.

                                                    • paradigm : List (MorphRule σ)

                                                      Available inflectional rules

                                                    Instances For
                                                      def Morphology.Stem.inflect {σ : Type} (s : Stem σ) (rule : MorphRule σ) (baseMeaning : σ) :
                                                      String × UD.MorphFeatures × σ

                                                      Apply a morphological rule to generate an inflected form + meaning. Threads morphology only; a rule's valenceRule acts on Stem.baseFrame at the consumer that builds Words.

                                                      Equations
                                                      Instances For
                                                        def Morphology.Stem.allForms {σ : Type} (s : Stem σ) (baseMeaning : σ) :
                                                        List (String × UD.MorphFeatures × σ)

                                                        Generate all forms in the paradigm (base + inflected).

                                                        Equations
                                                        Instances For

                                                          Distribution of inflectional categories between two elements of a periphrastic construction (e.g., auxiliary and lexical verb in an AVC). [And06a] [Byb85]

                                                          In an aux-headed AVC, onLex is minimal (stem only or empty). In a lex-headed AVC, onAux is empty. In a split AVC, onAux and onLex host different category types. In a doubled AVC, onAux and onLex overlap.

                                                          Instances For
                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For
                                                              def Morphology.instDecidableEqInflDistribution.decEq (x✝ x✝¹ : InflDistribution) :
                                                              Decidable (x✝ = x✝¹)
                                                              Equations
                                                              Instances For