Documentation

Linglib.Studies.ImelGuoST2026

Cross-Linguistic Modal Typology #

Empirical modal inventories from 27 languages (17 families) mapped to the 3×3 force-flavor meaning space, following Imel, Guo, & [IGST26].

Mapping conventions (raw typological data → 3×3 grid) #

Data source #

[STIG23]. A database for modal semantic typology. https://clmbr.shane.st/modal-typology/

Abbreviations for the nine meaning points #

Equations
Instances For
    Equations
    Instances For

      Gitksan has variable-force modals: ima('a) and gat express both weak and strong epistemic force. These satisfy SAV (varying on force only, single flavor) and IFF (since {poss, nec} × {epistemic} is a Cartesian product).

      Equations
      Instances For

        Gitksan's variable-force epistemic modals satisfy both SAV and IFF: {poss, nec} × {epistemic} varies on force only (single flavor).

        Equations
        Instances For

          Greek has non-IFF modals: Prepei and Mporei express non-rectangular subsets of the meaning space. Prepei covers {(nec,e),(poss,e),(nec,d),(nec,c)} which is NOT a Cartesian product (missing (poss,d) and (poss,c)).

          Equations
          Instances For

            Prepei is NOT IFF: it expresses both forces and all three flavors, but does not express (possibility, deontic) or (possibility, circumstantial).

            Mandarin has many modals, extensive synonymy, but all satisfy IFF. The paper notes Mandarin has three modals all encoding strong ∧ epistemic.

            Equations
            Instances For

              Dutch has one non-IFF modal: zou/zouden...kunnen expresses {(nec,e),(poss,e),(poss,c)} which is not Cartesian-closed (missing (nec,c)).

              Equations
              Instances For
                Equations
                Instances For

                  English modal inventory, derived from the Fragment (single source of truth). Uses ModalInventory.fromAuxEntries to extract modals from AuxEntry data.

                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For

                    German modal inventory, derived from the Fragment (single source of truth). Sollte is counted separately from sollen per morphological individuation ([STIG23] §4.3).

                    Equations
                    • One or more equations did not get rendered due to their size.
                    Instances For

                      All seven German modals satisfy IFF (including sollte as distinct from sollen).

                      Washo is a key counterexample to the SAV universal: -eʔ expresses both possibility and necessity with both epistemic and deontic flavors, varying on both axes simultaneously. Its meaning is the full Cartesian product {□,◇} × {e,d}, so it satisfies IFF. [STIG23] §4.1.

                      Equations
                      Instances For

                        Koryak ivək is the other SAV counterexample: it expresses both necessity and possibility with doxastic and assertive flavors (both mapped to epistemic in the 3×3 space). [STIG23] §3, §4.1.

                        Equations
                        Instances For

                          The hypothetical modal mighst (epistemic possibility + deontic necessity) is ruled out by IFF: its meaning {(◇,e),(□,d)} is not Cartesian-closed (missing (◇,d) and (□,e)). [STIG23] §4.1.

                          mighst also fails SAV (as expected, since SAV → IFF).

                          All twelve inventories.

                          Equations
                          • One or more equations did not get rendered due to their size.
                          Instances For

                            Ten of twelve encoded languages have perfect IFF degree (1.0).

                            All twelve languages have IFF degree > 0 (the minimum is Greek at 1/3).

                            The two SAV counterexamples (Washo, Koryak) both satisfy IFF: this is the core empirical claim of [STIG23].

                            Efficient Communication (Imel, Guo, & [IGST26]) #

                            Key computational results (verified over 32,301 generated + 27 natural languages):

                            1. Every Pareto-optimal modal system consists only of IFF modals.
                            2. IFF degree correlates positively with simplicity, informativeness, and Pareto-optimality.
                            3. Attested modal systems are more Pareto-optimal than the vast majority of hypothetically possible systems (mean optimality: 0.937 vs 0.776).

                            Mean Pareto-optimality of natural languages (Table 6).

                            Equations
                            Instances For

                              Mean Pareto-optimality of the generated population (Table 6).

                              Equations
                              Instances For