Standard Average European: A Linguistic Area #
Formalization of [Has01]'s argument that the core European languages — the Romance, Germanic, Balkan, and Balto-Slavic families plus the westernmost Finno-Ugric languages — form a Sprachbund (linguistic area) called Standard Average European (SAE), defined by a dozen shared structural features that are absent in the geographically and genealogically adjacent languages.
The file first sets out a framework-neutral areal schema (§0:
Isogloss, ArealReference, ArealProfile, IsArealAt,
LinguisticArea) and then instantiates it: SAE is a
LinguisticArea SAELanguage SAEFeature
whose feature set is the twelve major Europeanisms of §2 of the paper, whose
reference frame is the four samples §1 demands (area, cofamilial, adjacent,
world), and whose areality is verified feature-by-feature against
[Has01]'s Maps 107.1–107.12.
The cluster-map gradience of §4 (most notably the Charlemagne nucleus
formed by French and German) is recovered automatically from the discrete
feature data via LinguisticArea.clusterScore and nucleus.
Architectural notes #
- Languages and features are local inductives: the paper surveys a
specific sample and committing to its boundaries is appropriate here.
A
SAELanguage.toWALSbridge maps each language to its WALS code where one exists. - The paper's Maps 107.1–107.12 are the primary source for every
isogloss. Each paper-based isogloss (e.g.
articleLgs,vplusNILgs) is the explicit set Haspelmath plots on the corresponding map. For the six features with a directly comparable WALS chapter (37A, 38A, 47A, 101A, 107A, 115A, 121A), a siblingwals*Lgsset is derived fromData.WALS.F*.allDataviawalsClassifies. The two are intentionally not unioned: where they disagree, that disagreement is a fact about Haspelmath's classification vs. WALS's encoding and should remain visible to readers and to bridge theorems. - Isoglosses are
Finset SAELanguage(the schema'sIsoglossis a transparent abbreviation forFinset L). All filter predicates are decidable, so the four areality criteria reduce todecideagainst the computable rationalIsogloss.density.
A schema for typological linguistic areas (Sprachbünde) — groups of
geographically contiguous, often genealogically heterogeneous languages
that share structural features attributable to contact rather than
inheritance. The schema follows the methodology of [Has01]
but is gradient-first, in the spirit of his §4 cluster maps: the
primary data per feature is an ArealProfile (a 4-tuple of densities —
in-area, in-cofamilial, in-adjacent, in-world). Binary judgments are
derived from profiles by comparison against an ArealThresholds
parameter, defaulting to the qualitative 1/2 cut.
Design notes. Isogloss is Finset L, not Set L: typology works over
finite samples, and Finset gives computable density without
Classical. Reference samples carry Finset.Nonempty proofs:
[Has01] evaluates his criteria against actual evidence — there
is no "vacuous absence" in the paper — so requiring non-emptiness
eliminates the degenerate 0/0 = 0 density and lets the rational and
Nat formulations of the criteria agree on all in-use cases (cf.
Isogloss.density_gt_iff / density_lt_iff). The threshold layer is
parameterized rather than hard-coded so that stricter or paper-specific
cutoffs (Heine-Kuteva, Grambank conventions) can be instantiated without
rewriting the predicates; a future gradient treatment can replace
Isogloss = Finset L with L → ℚ strength functions and reuse
ArealProfile and ArealThresholds unchanged.
An isogloss is the set of languages exhibiting a given linguistic feature. Following standard typological practice, isoglosses are sets of languages, not the geometric boundaries between them.
This is a transparent abbreviation for Finset L; all finset operations
(∩, ∪, ⊆, ∈, \, Finset.card) apply directly, and every
membership question is decidable.
Equations
- Haspelmath2001.Isogloss L = Finset L
Instances For
The proportion of languages in a finite reference sample S that lie
inside the isogloss I. With Finset operations this is a computable
rational in [0, 1] whenever S is non-empty; on S = ∅ it returns the
ℚ-default 0/0 = 0.
Equations
- I.density S = ↑(S ∩ I).card / ↑S.card
Instances For
Bridge from the rational threshold form to a Nat-arithmetic form.
Requires S non-empty so that density is not the degenerate 0 / 0.
This is the lemma proofs rw to reach a decide-amenable goal.
Specialization of density_gt_iff to the 1/2 threshold. The Nat-form
RHS is fully kernel-reducible on concrete Finset data, so downstream
decide calls do not need to traverse Rat.div.
Specialization of density_lt_iff to the 1/2 threshold.
The four reference samples needed to evaluate whether a feature is areal, following [Has01] §1's criteria (i)–(iv).
Non-emptiness is required by construction: [Has01]'s methodology presupposes positive evidence on each sample, and an empty sample provides no information about the criterion it is meant to address.
For Standard Average European, the area is the core European languages, the cofamilial sample is the eastern Indo-European branches (Iranian, Indic, Armenian) — branches of Indo-European that lie outside the area, used to rule out Proto-Indo-European inheritance — the adjacent sample is the geographically neighboring non-Indo-European languages (Turkic, Nakh-Daghestanian, Afro-Asiatic), and the world sample is a representative global typological sample.
- area : Finset L
The candidate area itself (e.g. core European languages for SAE).
- cofamilial : Finset L
Other branches of the same families as the area's languages, used to rule out genealogical inheritance from a deep common ancestor. For SAE: eastern Indo-European (Iranian, Indic, Armenian).
- adjacent : Finset L
Languages geographically adjacent to but outside the area, used to rule out a worldwide tendency that just happens to extend a bit further. For SAE: Turkic, Nakh-Daghestanian, Celtic (in some criteria), etc.
- world : Finset L
A representative worldwide sample, used to rule out the feature being a cross-linguistic universal preference rather than an areal phenomenon.
- area_nonempty : self.area.Nonempty
- cofamilial_nonempty : self.cofamilial.Nonempty
- adjacent_nonempty : self.adjacent.Nonempty
- world_nonempty : self.world.Nonempty
Instances For
The areal profile of an isogloss against a reference: the four
densities that [Has01]'s methodology compares. This is the
gradient datum from which both threshold-based binary judgments
(IsArealAt) and §4-style cluster maps are derived.
Profiles are framework-agnostic: a profile with inArea = 0.85,
inCofamilial = 0.10, inAdjacent = 0.05, inWorld = 0.20 is a
canonical "areal" pattern regardless of any threshold convention.
- inArea : ℚ
Density of the feature in the candidate area.
- inCofamilial : ℚ
Density of the feature in the cofamilial sample.
- inAdjacent : ℚ
Density of the feature in the geographically adjacent sample.
- inWorld : ℚ
Density of the feature in the worldwide sample.
Instances For
Equations
- Haspelmath2001.instReprArealProfile = { reprPrec := Haspelmath2001.instReprArealProfile.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
The profile of an isogloss against a reference frame.
Equations
Instances For
The maximum density outside the area — the most permissive value that the "lacks elsewhere" criteria must beat.
Equations
- p.outsideMax = max p.inCofamilial (max p.inAdjacent p.inWorld)
Instances For
A natural areality score: in-area density minus the max outside
density. Range [-1, 1], with higher values indicating stronger areal
patterning. This is one of several reasonable score aggregations and is
exposed for convenience; downstream studies are free to define their own.
Equations
- p.contrastScore = p.inArea - p.outsideMax
Instances For
Numerical thresholds for [Has01]'s qualitative criteria.
[Has01] reads "the great majority" and "lack" qualitatively; this structure exposes the cutoffs so that:
- the default
⟨1/2, 1/2⟩is the natural first approximation, - stricter conventions (e.g. inside = 3/4, outside = 1/4) can be specified per-study,
- a future probabilistic generalization can plug in here without disturbing the rest of the schema.
- inside : ℚ
"Majority inside" cutoff: a feature is areal-at-
Tonly if the area's density exceeds this. - outside : ℚ
"Lacks outside" cutoff: cofamilial / adjacent / world densities must fall below this.
Instances For
Equations
- Haspelmath2001.instInhabitedArealThresholds = { default := { } }
The four-part Haspelmath criterion at a chosen ArealThresholds.
Each field captures one of [Has01] §1 criteria (i)–(iv),
parameterized by the threshold T. With T = default this is the
plain "majority in / minority out" reading.
Criterion (i): the area-internal density beats the inside threshold.
Criterion (iii): cofamilial density falls below the outside threshold, ruling out common genealogical inheritance.
Criterion (ii): adjacent-non-area density falls below the outside threshold, ruling out a wider regional drift.
Criterion (iv): worldwide density falls below the outside threshold, ruling out a universal cross-linguistic preference.
Instances For
Areality at the default Haspelmath threshold (⟨1/2, 1/2⟩).
Equations
- Haspelmath2001.IsAreal R I = Haspelmath2001.IsArealAt default R I
Instances For
A linguistic area (Sprachbund) parameterized by a language type L
and a feature index type F.
An area bundles the three pieces of data needed for cluster-map analysis:
a reference frame, a finite set of diagnostic features, and an isogloss
assignment. The cluster-map methodology of [Has01] §4 is
recovered from this data via featureProfile (per-feature gradient),
clusterScore (per-language count), and isopleth / nucleus (cluster
bands and core).
LinguisticArea deliberately does not require every diagnostic
feature to satisfy IsArealAt at any particular threshold. Real
Sprachbund analyses (including [Has01]'s own SAE) propose
feature inventories where strong, weak, and tendency-only features
coexist; the binary IsArealAt predicate is applied à la carte by
downstream proofs for whichever subset passes the threshold of
interest.
- reference : ArealReference L
The reference frame against which areality is judged.
- features : Finset F
The diagnostic features proposed for this Sprachbund.
- isogloss : F → Isogloss L
The isogloss assigned to each feature: which languages exhibit it.
Instances For
The areal profile of feature f: its four densities under the area's
reference frame. The primary gradient datum per feature.
Equations
- A.featureProfile f = (A.isogloss f).profile A.reference
Instances For
The contrast score of feature f: in-area density minus the max
outside density. Range [-1, 1], with IsArealAt features all scoring
above T.inside - T.outside.
Equations
- A.featureScore f = (A.featureProfile f).contrastScore
Instances For
The cluster score of a language: how many of the area's features it exhibits. This recovers [Has01] §4's cluster-map gradient membership from the discrete feature-by-feature data.
Equations
- A.clusterScore l = {f ∈ A.features | l ∈ A.isogloss f}.card
Instances For
The cluster score is bounded above by the total number of features.
The k-isopleth: the set of languages exhibiting at least k of the
area's features. Used to draw cluster maps at varying tightness.
Equations
- A.isopleth k = {l : L | k ≤ A.clusterScore l}
Instances For
Isopleths are antitone in k: a stricter threshold yields a smaller set.
The nucleus: languages exhibiting all but at most one of the area's features. For SAE, [Has01] §4 identifies French and German as the nuclear members — the Charlemagne Sprachbund.
Nat subtraction truncates at zero, so features.card = 0 gives the
trivial isopleth 0 = Set.univ; the SAE proofs below verify the
non-degenerate case where features.card = 12.
Instances For
Languages surveyed by [Has01], partitioned by their role in the four reference samples (area / cofamilial / adjacent / world).
The list follows the paper's coverage but is necessarily a subset — the maps include Sami, Mordvin, Komi, Udmurt, Mari, Tatar, Kalmyk, Lezgian, etc. We retain enough of each subgroup to make the four samples non-empty and to preserve the paper's headline findings (French/German nucleus, Celtic/Basque/Turkic margin).
- French : SAELanguage
- Italian : SAELanguage
- Spanish : SAELanguage
- Portuguese : SAELanguage
- Romanian : SAELanguage
- Catalan : SAELanguage
- German : SAELanguage
- Dutch : SAELanguage
- English : SAELanguage
- Swedish : SAELanguage
- Norwegian : SAELanguage
- Danish : SAELanguage
- Icelandic : SAELanguage
- Russian : SAELanguage
- Polish : SAELanguage
- Czech : SAELanguage
- Bulgarian : SAELanguage
- SerboCroatian : SAELanguage
- Ukrainian : SAELanguage
- Greek : SAELanguage
- Albanian : SAELanguage
- Macedonian : SAELanguage
- Latvian : SAELanguage
- Lithuanian : SAELanguage
- Hungarian : SAELanguage
- Finnish : SAELanguage
- Estonian : SAELanguage
- Welsh : SAELanguage
- Irish : SAELanguage
- Breton : SAELanguage
- Basque : SAELanguage
- Turkish : SAELanguage
- Maltese : SAELanguage
- Persian : SAELanguage
- HindiUrdu : SAELanguage
- Armenian : SAELanguage
- Lezgian : SAELanguage
- Georgian : SAELanguage
- Mongolian : SAELanguage
- Indonesian : SAELanguage
- Yoruba : SAELanguage
- Japanese : SAELanguage
- Mandarin : SAELanguage
- Swahili : SAELanguage
- Quechua : SAELanguage
Instances For
Equations
- Haspelmath2001.instDecidableEqSAELanguage x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Haspelmath2001.instReprSAELanguage = { reprPrec := Haspelmath2001.instReprSAELanguage.repr }
Equations
- One or more equations did not get rendered due to their size.
WALS code for each SAELanguage where one exists. WALS codes are 3-letter
identifiers used by the World Atlas of Language Structures (the v2020.4 codes
in Data.WALS.Features.*). Returns none for languages outside the WALS
sample (currently: every SAELanguage constructor maps to a code, but the
return type is Option String to accommodate future additions to
SAELanguage that may not be in WALS).
Equations
- Haspelmath2001.SAELanguage.French.toWALS = some "fre"
- Haspelmath2001.SAELanguage.Italian.toWALS = some "ita"
- Haspelmath2001.SAELanguage.Spanish.toWALS = some "spa"
- Haspelmath2001.SAELanguage.Portuguese.toWALS = some "por"
- Haspelmath2001.SAELanguage.Romanian.toWALS = some "rom"
- Haspelmath2001.SAELanguage.Catalan.toWALS = some "ctl"
- Haspelmath2001.SAELanguage.German.toWALS = some "ger"
- Haspelmath2001.SAELanguage.Dutch.toWALS = some "dut"
- Haspelmath2001.SAELanguage.English.toWALS = some "eng"
- Haspelmath2001.SAELanguage.Swedish.toWALS = some "swe"
- Haspelmath2001.SAELanguage.Norwegian.toWALS = some "nor"
- Haspelmath2001.SAELanguage.Danish.toWALS = some "dsh"
- Haspelmath2001.SAELanguage.Icelandic.toWALS = some "ice"
- Haspelmath2001.SAELanguage.Russian.toWALS = some "rus"
- Haspelmath2001.SAELanguage.Polish.toWALS = some "pol"
- Haspelmath2001.SAELanguage.Czech.toWALS = some "cze"
- Haspelmath2001.SAELanguage.Bulgarian.toWALS = some "bul"
- Haspelmath2001.SAELanguage.SerboCroatian.toWALS = some "scr"
- Haspelmath2001.SAELanguage.Ukrainian.toWALS = some "ukr"
- Haspelmath2001.SAELanguage.Greek.toWALS = some "grk"
- Haspelmath2001.SAELanguage.Albanian.toWALS = some "alb"
- Haspelmath2001.SAELanguage.Macedonian.toWALS = some "mcd"
- Haspelmath2001.SAELanguage.Latvian.toWALS = some "lat"
- Haspelmath2001.SAELanguage.Lithuanian.toWALS = some "lit"
- Haspelmath2001.SAELanguage.Hungarian.toWALS = some "hun"
- Haspelmath2001.SAELanguage.Finnish.toWALS = some "fin"
- Haspelmath2001.SAELanguage.Estonian.toWALS = some "est"
- Haspelmath2001.SAELanguage.Welsh.toWALS = some "wel"
- Haspelmath2001.SAELanguage.Irish.toWALS = some "iri"
- Haspelmath2001.SAELanguage.Breton.toWALS = some "bre"
- Haspelmath2001.SAELanguage.Basque.toWALS = some "bsq"
- Haspelmath2001.SAELanguage.Turkish.toWALS = some "tur"
- Haspelmath2001.SAELanguage.Maltese.toWALS = some "mlt"
- Haspelmath2001.SAELanguage.Persian.toWALS = some "prs"
- Haspelmath2001.SAELanguage.HindiUrdu.toWALS = some "hin"
- Haspelmath2001.SAELanguage.Armenian.toWALS = some "arm"
- Haspelmath2001.SAELanguage.Lezgian.toWALS = some "lez"
- Haspelmath2001.SAELanguage.Georgian.toWALS = some "geo"
- Haspelmath2001.SAELanguage.Mongolian.toWALS = some "kha"
- Haspelmath2001.SAELanguage.Indonesian.toWALS = some "ind"
- Haspelmath2001.SAELanguage.Yoruba.toWALS = some "yor"
- Haspelmath2001.SAELanguage.Japanese.toWALS = some "jpn"
- Haspelmath2001.SAELanguage.Mandarin.toWALS = some "mnd"
- Haspelmath2001.SAELanguage.Swahili.toWALS = some "swa"
- Haspelmath2001.SAELanguage.Quechua.toWALS = some "qcu"
Instances For
Generic WALS classifier: language l has a value in WALS chapter data
that satisfies the boolean predicate pred. Returns False when l lacks
a WALS code or the chapter has no entry for it.
This is the bridging primitive for all WALS-grounded isoglosses below. Used as
walsClassifies Data.WALS.F121A.allData (· == .particle) to ask "does WALS
121A classify this language as having a particle comparative?". The result
is propositional with a derivable Decidable instance, so it slots directly
into Finset.filter.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
The twelve "Europeanisms" identified in §2 of [Has01]. Each is the subject of one of Maps 107.1–107.12.
- definiteIndefiniteArticles : SAEFeature
§2.1, Map 107.1: Both definite and indefinite articles present.
- relativeClausesWithRelPro : SAEFeature
§2.2, Map 107.2: Postnominal relative clauses introduced by an inflecting relative pronoun (e.g. der/die/das, qui/que).
- havePerfect : SAEFeature
§2.3, Map 107.3: Transitive perfect formed by 'have' + past participle.
- nominativeExperiencers : SAEFeature
§2.4, Map 107.4: Predominant generalization of experiencer-as-nominative (English-style I like it) over inverting (it pleases me).
- participialPassive : SAEFeature
§2.5, Map 107.5: Canonical participial passive with copula + participle.
- anticausativeProminence : SAEFeature
§2.6, Map 107.6: Anticausative-prominent inchoative–causative pairs.
- dativeExternalPossessor : SAEFeature
§2.7, Map 107.7: Dative external possessors (e.g. German Die Mutter wäscht dem Kind die Haare).
- negativePronounsNoVerbalNeg : SAEFeature
§2.8, Map 107.8: Negative pronouns without obligatory verbal negation (V + NI type, e.g. French personne ne vient, German niemand kommt).
- particleComparative : SAEFeature
§2.9, Map 107.9: Particle comparatives (English than, Latin quam).
- relativeBasedEquative : SAEFeature
§2.10, Map 107.10: Equative constructions based on relative-clause structure (Catalan tan Z com X).
- strictAgreement : SAEFeature
§2.11, Map 107.11: Strict subject agreement — subject affixes that cannot stand alone with referential force (German ich arbeite, not arbeite).
- intensifierReflexiveDifferentiation : SAEFeature
§2.12, Map 107.12: Differentiated intensifier vs. reflexive forms (German selbst vs. sich, Russian sam vs. sebja).
Instances For
Equations
- Haspelmath2001.instDecidableEqSAEFeature x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Haspelmath2001.instReprSAEFeature = { reprPrec := Haspelmath2001.instReprSAEFeature.repr }
Equations
- One or more equations did not get rendered due to their size.
WALS-derived: languages classified by F37A (Definite Articles) as having a definite article (any of the three positive values).
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived: languages classified by F38A (Indefinite Articles) as having any indefinite article distinct from "no indefinite article" or "indefinite-only of an unrelated kind".
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived intersection: languages with both a definite and an indefinite article in the WALS data, matching [Has01] §2.1.
Equations
Instances For
Languages with both definite and indefinite articles (Map 107.1).
The paper's reading: Romance, Germanic (except Icelandic), Greek, Albanian, Macedonian, Bulgarian (the edin particle is treated as a budding indefinite article), and Hungarian. Icelandic is excluded — it has a suffixed definite article but no indefinite article.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with relative-pronoun relative clauses (Map 107.2).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with the 'have'-perfect (Map 107.3).
[Has01] §2.3 restricts this isogloss to the Romance and Germanic families plus a Balkan/peripheral fringe — Albanian, Greek, Macedonian (an innovation: ima + verbal adjective), and (parts of) Czech. Bulgarian and Serbo-Croatian retain the inherited Slavic 'be'+l-participle perfect rather than the 'have' construction.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with predominant nominative-experiencer coding (Map 107.4).
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived parallel: languages classified by F107A (Passive Constructions) as having a passive present. F107A counts any passive (periphrastic, morphological, etc.), so this is a strict superset of Haspelmath's copula+participle criterion.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with a canonical participial passive (Map 107.5).
The Romance and Germanic copula+participle pattern, extended through Slavic and Baltic and into Greek, Albanian, Macedonian, and Irish.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Anticausative-prominent languages (Map 107.6: ≥ 70% anticausative).
[Has01] §2.6 / Map 107.6 marks only the languages whose inchoative–causative pairs are anticausative-prominent on the Haspelmath 1993 figures. Romance is partially excluded (only French/Romanian register as prominent; Italian/Spanish/Portuguese fall on the causative-prominent side). The full SAE marking is German, French, Russian, Greek, Romanian, Lithuanian, English.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with dative external possessors (Map 107.7).
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived parallel: languages classified by F115A (Negative Indefinite Pronouns and Predicate Negation) as not requiring predicate negation alongside the negative pronoun. Strict subset of Haspelmath's criterion: F115A.noPredicateNegation captures only the rigid V+NI type (predominantly Germanic), whereas Haspelmath's §2.8 also includes Romance languages where the predicate negative is optional or weakening.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with V + NI negation (no obligatory verbal negation; Map 107.8).
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived parallel: languages classified by F121A (Comparative Constructions) as having a particle comparative.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with particle comparatives (Map 107.9).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with relative-based equatives (Map 107.10).
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived parallel: languages classified by F101A (Expression of Pronominal Subjects) as requiring obligatory subject pronouns.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with strict subject agreement (Map 107.11).
A language has strict agreement when subject pronouns are obligatory even in the presence of subject agreement on the verb (i.e., subject-agreement affixes lack referential force on their own). Russian and the pro-drop Romance languages (Italian, Spanish, Portuguese, Romanian) fail this criterion. Welsh has rich agreement but allows pro-drop, so it is also excluded.
Equations
- One or more equations did not get rendered due to their size.
Instances For
WALS-derived parallel: languages classified by F47A (Intensifiers and Reflexive Pronouns) as having differentiated forms.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Languages with differentiated intensifier vs. reflexive forms (Map 107.12).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Dispatch from feature to its isogloss (as a Finset).
Equations
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.definiteIndefiniteArticles = Haspelmath2001.articleLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.relativeClausesWithRelPro = Haspelmath2001.relProLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.havePerfect = Haspelmath2001.havePerfectLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.nominativeExperiencers = Haspelmath2001.nomExpLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.participialPassive = Haspelmath2001.particPassiveLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.anticausativeProminence = Haspelmath2001.anticausativeLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.dativeExternalPossessor = Haspelmath2001.dativeExtPossLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.negativePronounsNoVerbalNeg = Haspelmath2001.vplusNILgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.particleComparative = Haspelmath2001.particleCompLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.relativeBasedEquative = Haspelmath2001.relEquativeLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.strictAgreement = Haspelmath2001.strictAgrLgs
- Haspelmath2001.isoglossFinset Haspelmath2001.SAEFeature.intensifierReflexiveDifferentiation = Haspelmath2001.intRefDiffLgs
Instances For
The four reference samples for evaluating areality, per [Has01] §1.
area: the core European languages (Romance, Germanic, Balkan, Balto-Slavic, marginal Finno-Ugric) that the paper proposes as SAE.cofamilial: other Indo-European branches (eastern IE: Iranian, Indic, Armenian) that lie outside the area; presence of a feature in these would point to PIE inheritance rather than areal contact.adjacent: geographically adjacent non-SAE languages (Celtic west, Turkic- Nakh-Daghestanian east, Semitic south); presence here would suggest a wider regional drift rather than a strictly European phenomenon.
world: a small worldwide sample for the (iv) "not common worldwide" criterion.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Standard Average European as a LinguisticArea: the 12 diagnostic
features of [Has01] §2 over the European/cofamilial/
adjacent/world reference frame.
LinguisticArea does not require every diagnostic feature to satisfy
the strict IsAreal predicate at any particular threshold — and
indeed, several SAE features (anticausative prominence, V+NI negation,
strict agreement) do not pass strict majority on Haspelmath's own data.
This matches the paper: [Has01]'s actual argument runs
through the cluster maps of §4, not through per-feature majority
thresholds.
Per-feature IsArealAt claims for the strongly-attested subset are
proved separately below; clusterScore and nucleus are computed
across all 12.
Equations
- Haspelmath2001.sae = { reference := Haspelmath2001.europeanReference, features := Finset.univ, isogloss := Haspelmath2001.isoglossFor }
Instances For
Definite + indefinite articles (Map 107.1) is areal at the strict 1/2 threshold: ubiquitous in the area, absent from cofamilial/adjacent/world samples.
The 'have'-perfect (Map 107.3) is areal at the strict 1/2 threshold.
Particle comparatives (Map 107.9) are areal at the strict 1/2 threshold.
French sits in the SAE cluster nucleus: it exhibits all 12 diagnostic features (the maximum cluster score).
German sits in the SAE cluster nucleus alongside French — [Has01] §4's Charlemagne Sprachbund core.
The SAE feature inventory has 12 features, matching Maps 107.1–107.12 of [Has01].