Documentation

Linglib.Studies.Haspelmath2001

Standard Average European: A Linguistic Area #

[Has01]

Formalization of [Has01]'s argument that the core European languages — the Romance, Germanic, Balkan, and Balto-Slavic families plus the westernmost Finno-Ugric languages — form a Sprachbund (linguistic area) called Standard Average European (SAE), defined by a dozen shared structural features that are absent in the geographically and genealogically adjacent languages.

The file first sets out a framework-neutral areal schema (§0: Isogloss, ArealReference, ArealProfile, IsArealAt, LinguisticArea) and then instantiates it: SAE is a LinguisticArea SAELanguage SAEFeature whose feature set is the twelve major Europeanisms of §2 of the paper, whose reference frame is the four samples §1 demands (area, cofamilial, adjacent, world), and whose areality is verified feature-by-feature against [Has01]'s Maps 107.1–107.12.

The cluster-map gradience of §4 (most notably the Charlemagne nucleus formed by French and German) is recovered automatically from the discrete feature data via LinguisticArea.clusterScore and nucleus.

Architectural notes #

A schema for typological linguistic areas (Sprachbünde) — groups of geographically contiguous, often genealogically heterogeneous languages that share structural features attributable to contact rather than inheritance. The schema follows the methodology of [Has01] but is gradient-first, in the spirit of his §4 cluster maps: the primary data per feature is an ArealProfile (a 4-tuple of densities — in-area, in-cofamilial, in-adjacent, in-world). Binary judgments are derived from profiles by comparison against an ArealThresholds parameter, defaulting to the qualitative 1/2 cut.

Design notes. Isogloss is Finset L, not Set L: typology works over finite samples, and Finset gives computable density without Classical. Reference samples carry Finset.Nonempty proofs: [Has01] evaluates his criteria against actual evidence — there is no "vacuous absence" in the paper — so requiring non-emptiness eliminates the degenerate 0/0 = 0 density and lets the rational and Nat formulations of the criteria agree on all in-use cases (cf. Isogloss.density_gt_iff / density_lt_iff). The threshold layer is parameterized rather than hard-coded so that stricter or paper-specific cutoffs (Heine-Kuteva, Grambank conventions) can be instantiated without rewriting the predicates; a future gradient treatment can replace Isogloss = Finset L with L → ℚ strength functions and reuse ArealProfile and ArealThresholds unchanged.

@[reducible, inline]

An isogloss is the set of languages exhibiting a given linguistic feature. Following standard typological practice, isoglosses are sets of languages, not the geometric boundaries between them.

This is a transparent abbreviation for Finset L; all finset operations (, , , , \, Finset.card) apply directly, and every membership question is decidable.

Equations
Instances For
    def Haspelmath2001.Isogloss.density {L : Type u} [DecidableEq L] (I : Isogloss L) (S : Finset L) :

    The proportion of languages in a finite reference sample S that lie inside the isogloss I. With Finset operations this is a computable rational in [0, 1] whenever S is non-empty; on S = ∅ it returns the ℚ-default 0/0 = 0.

    Equations
    • I.density S = (S I).card / S.card
    Instances For
      @[simp]
      theorem Haspelmath2001.Isogloss.density_empty {L : Type u} [DecidableEq L] (I : Isogloss L) :
      I.density = 0
      theorem Haspelmath2001.Isogloss.density_nonneg {L : Type u} [DecidableEq L] (I : Isogloss L) (S : Finset L) :
      0 I.density S
      theorem Haspelmath2001.Isogloss.density_le_one {L : Type u} [DecidableEq L] (I : Isogloss L) (S : Finset L) :
      I.density S 1
      theorem Haspelmath2001.Isogloss.density_gt_iff {L : Type u} [DecidableEq L] (I : Isogloss L) {S : Finset L} (hS : S.Nonempty) (θ : ) :
      I.density S > θ θ * S.card < (S I).card

      Bridge from the rational threshold form to a Nat-arithmetic form. Requires S non-empty so that density is not the degenerate 0 / 0. This is the lemma proofs rw to reach a decide-amenable goal.

      theorem Haspelmath2001.Isogloss.density_lt_iff {L : Type u} [DecidableEq L] (I : Isogloss L) {S : Finset L} (hS : S.Nonempty) (θ : ) :
      I.density S < θ (S I).card < θ * S.card
      theorem Haspelmath2001.Isogloss.density_gt_half_iff {L : Type u} [DecidableEq L] (I : Isogloss L) {S : Finset L} (hS : S.Nonempty) :
      I.density S > 1 / 2 S.card < 2 * (S I).card

      Specialization of density_gt_iff to the 1/2 threshold. The Nat-form RHS is fully kernel-reducible on concrete Finset data, so downstream decide calls do not need to traverse Rat.div.

      theorem Haspelmath2001.Isogloss.density_lt_half_iff {L : Type u} [DecidableEq L] (I : Isogloss L) {S : Finset L} (hS : S.Nonempty) :
      I.density S < 1 / 2 2 * (S I).card < S.card

      Specialization of density_lt_iff to the 1/2 threshold.

      The four reference samples needed to evaluate whether a feature is areal, following [Has01] §1's criteria (i)–(iv).

      Non-emptiness is required by construction: [Has01]'s methodology presupposes positive evidence on each sample, and an empty sample provides no information about the criterion it is meant to address.

      For Standard Average European, the area is the core European languages, the cofamilial sample is the eastern Indo-European branches (Iranian, Indic, Armenian) — branches of Indo-European that lie outside the area, used to rule out Proto-Indo-European inheritance — the adjacent sample is the geographically neighboring non-Indo-European languages (Turkic, Nakh-Daghestanian, Afro-Asiatic), and the world sample is a representative global typological sample.

      • area : Finset L

        The candidate area itself (e.g. core European languages for SAE).

      • cofamilial : Finset L

        Other branches of the same families as the area's languages, used to rule out genealogical inheritance from a deep common ancestor. For SAE: eastern Indo-European (Iranian, Indic, Armenian).

      • adjacent : Finset L

        Languages geographically adjacent to but outside the area, used to rule out a worldwide tendency that just happens to extend a bit further. For SAE: Turkic, Nakh-Daghestanian, Celtic (in some criteria), etc.

      • world : Finset L

        A representative worldwide sample, used to rule out the feature being a cross-linguistic universal preference rather than an areal phenomenon.

      • area_nonempty : self.area.Nonempty
      • cofamilial_nonempty : self.cofamilial.Nonempty
      • adjacent_nonempty : self.adjacent.Nonempty
      • world_nonempty : self.world.Nonempty
      Instances For

        The areal profile of an isogloss against a reference: the four densities that [Has01]'s methodology compares. This is the gradient datum from which both threshold-based binary judgments (IsArealAt) and §4-style cluster maps are derived.

        Profiles are framework-agnostic: a profile with inArea = 0.85, inCofamilial = 0.10, inAdjacent = 0.05, inWorld = 0.20 is a canonical "areal" pattern regardless of any threshold convention.

        • inArea :

          Density of the feature in the candidate area.

        • inCofamilial :

          Density of the feature in the cofamilial sample.

        • inAdjacent :

          Density of the feature in the geographically adjacent sample.

        • inWorld :

          Density of the feature in the worldwide sample.

        Instances For
          Equations
          • One or more equations did not get rendered due to their size.
          Instances For
            def Haspelmath2001.Isogloss.profile {L : Type u} [DecidableEq L] (I : Isogloss L) (R : ArealReference L) :

            The profile of an isogloss against a reference frame.

            Equations
            Instances For

              The maximum density outside the area — the most permissive value that the "lacks elsewhere" criteria must beat.

              Equations
              Instances For

                A natural areality score: in-area density minus the max outside density. Range [-1, 1], with higher values indicating stronger areal patterning. This is one of several reasonable score aggregations and is exposed for convenience; downstream studies are free to define their own.

                Equations
                Instances For

                  Numerical thresholds for [Has01]'s qualitative criteria.

                  [Has01] reads "the great majority" and "lack" qualitatively; this structure exposes the cutoffs so that:

                  • the default ⟨1/2, 1/2⟩ is the natural first approximation,
                  • stricter conventions (e.g. inside = 3/4, outside = 1/4) can be specified per-study,
                  • a future probabilistic generalization can plug in here without disturbing the rest of the schema.
                  • inside :

                    "Majority inside" cutoff: a feature is areal-at-T only if the area's density exceeds this.

                  • outside :

                    "Lacks outside" cutoff: cofamilial / adjacent / world densities must fall below this.

                  Instances For
                    structure Haspelmath2001.IsArealAt {L : Type u} [DecidableEq L] (T : ArealThresholds) (R : ArealReference L) (I : Isogloss L) :

                    The four-part Haspelmath criterion at a chosen ArealThresholds.

                    Each field captures one of [Has01] §1 criteria (i)–(iv), parameterized by the threshold T. With T = default this is the plain "majority in / minority out" reading.

                    • area_majority : I.density R.area > T.inside

                      Criterion (i): the area-internal density beats the inside threshold.

                    • cofamilial_lacks : I.density R.cofamilial < T.outside

                      Criterion (iii): cofamilial density falls below the outside threshold, ruling out common genealogical inheritance.

                    • adjacent_lacks : I.density R.adjacent < T.outside

                      Criterion (ii): adjacent-non-area density falls below the outside threshold, ruling out a wider regional drift.

                    • not_worldwide : I.density R.world < T.outside

                      Criterion (iv): worldwide density falls below the outside threshold, ruling out a universal cross-linguistic preference.

                    Instances For
                      @[reducible, inline]
                      abbrev Haspelmath2001.IsAreal {L : Type u} [DecidableEq L] (R : ArealReference L) (I : Isogloss L) :

                      Areality at the default Haspelmath threshold (⟨1/2, 1/2⟩).

                      Equations
                      Instances For
                        structure Haspelmath2001.LinguisticArea (L : Type u) (F : Type v) [DecidableEq L] :
                        Type (max u v)

                        A linguistic area (Sprachbund) parameterized by a language type L and a feature index type F.

                        An area bundles the three pieces of data needed for cluster-map analysis: a reference frame, a finite set of diagnostic features, and an isogloss assignment. The cluster-map methodology of [Has01] §4 is recovered from this data via featureProfile (per-feature gradient), clusterScore (per-language count), and isopleth / nucleus (cluster bands and core).

                        LinguisticArea deliberately does not require every diagnostic feature to satisfy IsArealAt at any particular threshold. Real Sprachbund analyses (including [Has01]'s own SAE) propose feature inventories where strong, weak, and tendency-only features coexist; the binary IsArealAt predicate is applied à la carte by downstream proofs for whichever subset passes the threshold of interest.

                        • reference : ArealReference L

                          The reference frame against which areality is judged.

                        • features : Finset F

                          The diagnostic features proposed for this Sprachbund.

                        • isogloss : FIsogloss L

                          The isogloss assigned to each feature: which languages exhibit it.

                        Instances For
                          def Haspelmath2001.LinguisticArea.featureProfile {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) (f : F) :

                          The areal profile of feature f: its four densities under the area's reference frame. The primary gradient datum per feature.

                          Equations
                          Instances For
                            def Haspelmath2001.LinguisticArea.featureScore {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) (f : F) :

                            The contrast score of feature f: in-area density minus the max outside density. Range [-1, 1], with IsArealAt features all scoring above T.inside - T.outside.

                            Equations
                            Instances For
                              def Haspelmath2001.LinguisticArea.clusterScore {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) (l : L) :

                              The cluster score of a language: how many of the area's features it exhibits. This recovers [Has01] §4's cluster-map gradient membership from the discrete feature-by-feature data.

                              Equations
                              Instances For
                                theorem Haspelmath2001.LinguisticArea.clusterScore_le_card_features {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) (l : L) :

                                The cluster score is bounded above by the total number of features.

                                def Haspelmath2001.LinguisticArea.isopleth {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) (k : ) :
                                Set L

                                The k-isopleth: the set of languages exhibiting at least k of the area's features. Used to draw cluster maps at varying tightness.

                                Equations
                                Instances For
                                  theorem Haspelmath2001.LinguisticArea.isopleth_anti {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) {j k : } (h : j k) :
                                  A.isopleth k A.isopleth j

                                  Isopleths are antitone in k: a stricter threshold yields a smaller set.

                                  def Haspelmath2001.LinguisticArea.nucleus {L : Type u} {F : Type v} [DecidableEq L] (A : LinguisticArea L F) :
                                  Set L

                                  The nucleus: languages exhibiting all but at most one of the area's features. For SAE, [Has01] §4 identifies French and German as the nuclear members — the Charlemagne Sprachbund.

                                  Nat subtraction truncates at zero, so features.card = 0 gives the trivial isopleth 0 = Set.univ; the SAE proofs below verify the non-degenerate case where features.card = 12.

                                  Equations
                                  Instances For

                                    Languages surveyed by [Has01], partitioned by their role in the four reference samples (area / cofamilial / adjacent / world).

                                    The list follows the paper's coverage but is necessarily a subset — the maps include Sami, Mordvin, Komi, Udmurt, Mari, Tatar, Kalmyk, Lezgian, etc. We retain enough of each subgroup to make the four samples non-empty and to preserve the paper's headline findings (French/German nucleus, Celtic/Basque/Turkic margin).

                                    Instances For
                                      @[implicit_reducible]
                                      Equations
                                      Equations
                                      • One or more equations did not get rendered due to their size.
                                      Instances For
                                        @[implicit_reducible]
                                        Equations
                                        • One or more equations did not get rendered due to their size.

                                        WALS code for each SAELanguage where one exists. WALS codes are 3-letter identifiers used by the World Atlas of Language Structures (the v2020.4 codes in Data.WALS.Features.*). Returns none for languages outside the WALS sample (currently: every SAELanguage constructor maps to a code, but the return type is Option String to accommodate future additions to SAELanguage that may not be in WALS).

                                        Equations
                                        Instances For
                                          def Haspelmath2001.walsClassifies {V : Type} [BEq V] (data : List (Data.WALS.Datapoint V)) (pred : VBool) (l : SAELanguage) :

                                          Generic WALS classifier: language l has a value in WALS chapter data that satisfies the boolean predicate pred. Returns False when l lacks a WALS code or the chapter has no entry for it.

                                          This is the bridging primitive for all WALS-grounded isoglosses below. Used as walsClassifies Data.WALS.F121A.allData (· == .particle) to ask "does WALS 121A classify this language as having a particle comparative?". The result is propositional with a derivable Decidable instance, so it slots directly into Finset.filter.

                                          Equations
                                          • One or more equations did not get rendered due to their size.
                                          Instances For
                                            @[implicit_reducible]
                                            instance Haspelmath2001.walsClassifies.instDecidable {V : Type} [BEq V] (data : List (Data.WALS.Datapoint V)) (pred : VBool) (l : SAELanguage) :
                                            Decidable (walsClassifies data pred l)
                                            Equations
                                            • One or more equations did not get rendered due to their size.

                                            The twelve "Europeanisms" identified in §2 of [Has01]. Each is the subject of one of Maps 107.1–107.12.

                                            • definiteIndefiniteArticles : SAEFeature

                                              §2.1, Map 107.1: Both definite and indefinite articles present.

                                            • relativeClausesWithRelPro : SAEFeature

                                              §2.2, Map 107.2: Postnominal relative clauses introduced by an inflecting relative pronoun (e.g. der/die/das, qui/que).

                                            • havePerfect : SAEFeature

                                              §2.3, Map 107.3: Transitive perfect formed by 'have' + past participle.

                                            • nominativeExperiencers : SAEFeature

                                              §2.4, Map 107.4: Predominant generalization of experiencer-as-nominative (English-style I like it) over inverting (it pleases me).

                                            • participialPassive : SAEFeature

                                              §2.5, Map 107.5: Canonical participial passive with copula + participle.

                                            • anticausativeProminence : SAEFeature

                                              §2.6, Map 107.6: Anticausative-prominent inchoative–causative pairs.

                                            • dativeExternalPossessor : SAEFeature

                                              §2.7, Map 107.7: Dative external possessors (e.g. German Die Mutter wäscht dem Kind die Haare).

                                            • negativePronounsNoVerbalNeg : SAEFeature

                                              §2.8, Map 107.8: Negative pronouns without obligatory verbal negation (V + NI type, e.g. French personne ne vient, German niemand kommt).

                                            • particleComparative : SAEFeature

                                              §2.9, Map 107.9: Particle comparatives (English than, Latin quam).

                                            • relativeBasedEquative : SAEFeature

                                              §2.10, Map 107.10: Equative constructions based on relative-clause structure (Catalan tan Z com X).

                                            • strictAgreement : SAEFeature

                                              §2.11, Map 107.11: Strict subject agreement — subject affixes that cannot stand alone with referential force (German ich arbeite, not arbeite).

                                            • intensifierReflexiveDifferentiation : SAEFeature

                                              §2.12, Map 107.12: Differentiated intensifier vs. reflexive forms (German selbst vs. sich, Russian sam vs. sebja).

                                            Instances For
                                              @[implicit_reducible]
                                              Equations
                                              Equations
                                              • One or more equations did not get rendered due to their size.
                                              Instances For
                                                @[implicit_reducible]
                                                Equations
                                                • One or more equations did not get rendered due to their size.

                                                WALS-derived: languages classified by F37A (Definite Articles) as having a definite article (any of the three positive values).

                                                Equations
                                                • One or more equations did not get rendered due to their size.
                                                Instances For

                                                  WALS-derived: languages classified by F38A (Indefinite Articles) as having any indefinite article distinct from "no indefinite article" or "indefinite-only of an unrelated kind".

                                                  Equations
                                                  • One or more equations did not get rendered due to their size.
                                                  Instances For

                                                    WALS-derived intersection: languages with both a definite and an indefinite article in the WALS data, matching [Has01] §2.1.

                                                    Equations
                                                    Instances For

                                                      Languages with both definite and indefinite articles (Map 107.1).

                                                      The paper's reading: Romance, Germanic (except Icelandic), Greek, Albanian, Macedonian, Bulgarian (the edin particle is treated as a budding indefinite article), and Hungarian. Icelandic is excluded — it has a suffixed definite article but no indefinite article.

                                                      Equations
                                                      • One or more equations did not get rendered due to their size.
                                                      Instances For

                                                        Languages with relative-pronoun relative clauses (Map 107.2).

                                                        Equations
                                                        • One or more equations did not get rendered due to their size.
                                                        Instances For

                                                          Languages with the 'have'-perfect (Map 107.3).

                                                          [Has01] §2.3 restricts this isogloss to the Romance and Germanic families plus a Balkan/peripheral fringe — Albanian, Greek, Macedonian (an innovation: ima + verbal adjective), and (parts of) Czech. Bulgarian and Serbo-Croatian retain the inherited Slavic 'be'+l-participle perfect rather than the 'have' construction.

                                                          Equations
                                                          • One or more equations did not get rendered due to their size.
                                                          Instances For

                                                            Languages with predominant nominative-experiencer coding (Map 107.4).

                                                            Equations
                                                            • One or more equations did not get rendered due to their size.
                                                            Instances For

                                                              WALS-derived parallel: languages classified by F107A (Passive Constructions) as having a passive present. F107A counts any passive (periphrastic, morphological, etc.), so this is a strict superset of Haspelmath's copula+participle criterion.

                                                              Equations
                                                              • One or more equations did not get rendered due to their size.
                                                              Instances For

                                                                Languages with a canonical participial passive (Map 107.5).

                                                                The Romance and Germanic copula+participle pattern, extended through Slavic and Baltic and into Greek, Albanian, Macedonian, and Irish.

                                                                Equations
                                                                • One or more equations did not get rendered due to their size.
                                                                Instances For

                                                                  Anticausative-prominent languages (Map 107.6: ≥ 70% anticausative).

                                                                  [Has01] §2.6 / Map 107.6 marks only the languages whose inchoative–causative pairs are anticausative-prominent on the Haspelmath 1993 figures. Romance is partially excluded (only French/Romanian register as prominent; Italian/Spanish/Portuguese fall on the causative-prominent side). The full SAE marking is German, French, Russian, Greek, Romanian, Lithuanian, English.

                                                                  Equations
                                                                  • One or more equations did not get rendered due to their size.
                                                                  Instances For

                                                                    Languages with dative external possessors (Map 107.7).

                                                                    Equations
                                                                    • One or more equations did not get rendered due to their size.
                                                                    Instances For

                                                                      WALS-derived parallel: languages classified by F115A (Negative Indefinite Pronouns and Predicate Negation) as not requiring predicate negation alongside the negative pronoun. Strict subset of Haspelmath's criterion: F115A.noPredicateNegation captures only the rigid V+NI type (predominantly Germanic), whereas Haspelmath's §2.8 also includes Romance languages where the predicate negative is optional or weakening.

                                                                      Equations
                                                                      • One or more equations did not get rendered due to their size.
                                                                      Instances For

                                                                        Languages with V + NI negation (no obligatory verbal negation; Map 107.8).

                                                                        Equations
                                                                        • One or more equations did not get rendered due to their size.
                                                                        Instances For

                                                                          WALS-derived parallel: languages classified by F121A (Comparative Constructions) as having a particle comparative.

                                                                          Equations
                                                                          • One or more equations did not get rendered due to their size.
                                                                          Instances For

                                                                            Languages with particle comparatives (Map 107.9).

                                                                            Equations
                                                                            • One or more equations did not get rendered due to their size.
                                                                            Instances For

                                                                              Languages with relative-based equatives (Map 107.10).

                                                                              Equations
                                                                              • One or more equations did not get rendered due to their size.
                                                                              Instances For

                                                                                WALS-derived parallel: languages classified by F101A (Expression of Pronominal Subjects) as requiring obligatory subject pronouns.

                                                                                Equations
                                                                                • One or more equations did not get rendered due to their size.
                                                                                Instances For

                                                                                  Languages with strict subject agreement (Map 107.11).

                                                                                  A language has strict agreement when subject pronouns are obligatory even in the presence of subject agreement on the verb (i.e., subject-agreement affixes lack referential force on their own). Russian and the pro-drop Romance languages (Italian, Spanish, Portuguese, Romanian) fail this criterion. Welsh has rich agreement but allows pro-drop, so it is also excluded.

                                                                                  Equations
                                                                                  • One or more equations did not get rendered due to their size.
                                                                                  Instances For

                                                                                    WALS-derived parallel: languages classified by F47A (Intensifiers and Reflexive Pronouns) as having differentiated forms.

                                                                                    Equations
                                                                                    • One or more equations did not get rendered due to their size.
                                                                                    Instances For

                                                                                      Languages with differentiated intensifier vs. reflexive forms (Map 107.12).

                                                                                      Equations
                                                                                      • One or more equations did not get rendered due to their size.
                                                                                      Instances For

                                                                                        The four reference samples for evaluating areality, per [Has01] §1.

                                                                                        • area: the core European languages (Romance, Germanic, Balkan, Balto-Slavic, marginal Finno-Ugric) that the paper proposes as SAE.
                                                                                        • cofamilial: other Indo-European branches (eastern IE: Iranian, Indic, Armenian) that lie outside the area; presence of a feature in these would point to PIE inheritance rather than areal contact.
                                                                                        • adjacent: geographically adjacent non-SAE languages (Celtic west, Turkic
                                                                                          • Nakh-Daghestanian east, Semitic south); presence here would suggest a wider regional drift rather than a strictly European phenomenon.
                                                                                        • world: a small worldwide sample for the (iv) "not common worldwide" criterion.
                                                                                        Equations
                                                                                        • One or more equations did not get rendered due to their size.
                                                                                        Instances For

                                                                                          Standard Average European as a LinguisticArea: the 12 diagnostic features of [Has01] §2 over the European/cofamilial/ adjacent/world reference frame.

                                                                                          LinguisticArea does not require every diagnostic feature to satisfy the strict IsAreal predicate at any particular threshold — and indeed, several SAE features (anticausative prominence, V+NI negation, strict agreement) do not pass strict majority on Haspelmath's own data. This matches the paper: [Has01]'s actual argument runs through the cluster maps of §4, not through per-feature majority thresholds.

                                                                                          Per-feature IsArealAt claims for the strongly-attested subset are proved separately below; clusterScore and nucleus are computed across all 12.

                                                                                          Equations
                                                                                          Instances For

                                                                                            Definite + indefinite articles (Map 107.1) is areal at the strict 1/2 threshold: ubiquitous in the area, absent from cofamilial/adjacent/world samples.

                                                                                            The 'have'-perfect (Map 107.3) is areal at the strict 1/2 threshold.

                                                                                            Particle comparatives (Map 107.9) are areal at the strict 1/2 threshold.

                                                                                            French sits in the SAE cluster nucleus: it exhibits all 12 diagnostic features (the maximum cluster score).

                                                                                            German sits in the SAE cluster nucleus alongside French — [Has01] §4's Charlemagne Sprachbund core.

                                                                                            The SAE feature inventory has 12 features, matching Maps 107.1–107.12 of [Has01].