@cite{downing-1996} — Numeral Classifier Systems: The Case of Japanese #
@cite{downing-1996} @cite{aikhenvald-2000} @cite{chierchia-1998}
Formalizes core contributions from Downing's monograph on Japanese numeral classifiers (Studies in Discourse and Grammar, vol. 4).
Two Central Hypotheses #
Hypothesis 1 (Universal semantic trends): Classifier categories encode culturally significant categories defined by physical, functional, and social interaction. The choice of which features are exploited is culture-dependent.
Hypothesis 2 (Semantic supplementation): Classifiers systematically supplement the information carried by nouns — the classifier system provides categorization independent of and additional to the common noun system.
UNVERIFIED: chapter/section/table location references throughout this file were inherited from earlier formalization work and have not been cross-checked against the @cite{downing-1996} monograph. Specific cell counts in the frequency tables below should be treated as illustrative of the qualitative shape of the data, not as verbatim transcriptions.
Individuation Function #
@cite{downing-1996} treats classifier phrases and plural markers as individuators. @cite{chierchia-1998}'s later Nominal Mapping Parameter provides a formal framework for this insight: in [+arg, -pred] languages, bare nouns denote kinds, and classifiers supply individuation for enumeration. The bridge to @cite{chierchia-1998} is a linglib contribution, not one Downing herself makes (Chierchia 1998 postdates this monograph). The strict NMP correlation has been challenged (e.g., Turkish has bare arguments without classifiers; Li 2013 argues Chinese nouns are not uniformly mass), but the core insight — that classifiers relate to individuation — is preserved in current work, with the mechanism (atomization vs. unitization) still debated.
Anaphoric Use #
Classifier phrases (numeral + classifier without accompanying noun) serve as anaphoric devices in discourse, occupying a unique niche between zero anaphora (short range) and full lexical NPs (long range). Qualitative findings (UNVERIFIED specific percentages): 人 nin dominates the anaphoric distribution; numeral 2 dominates the numeral distribution; striking distance is intermediate between pronouns and full NPs.
Shape Dimensionality #
Shape-based classifiers decompose along a 1D/2D/3D dimensionality axis
per @cite{allan-1977}, formalized via Fragments.Japanese.Classifier.shapeDim.
Core Inventory #
All 27 classifiers from Downing's core inventory (UNVERIFIED: claimed
to be Table 1.1) are represented in the Japanese fragment
(Fragments.Japanese.Classifier.core), including the homophonous
軒 kenBuilding / 件 kenIncident pair, the maritime size split
(隻 seki / 艘 soo), and the two building classifiers (軒 kenBuilding / 棟 mune).
Frequency Distribution #
UNVERIFIED: the n=500 corpus sample and specific cell counts below were inherited from prior formalization work and have not been verified against the monograph. The qualitative pattern they encode is well-attested (extreme Zipfian skew with 人 nin and つ tsu dominating; shape-based classifiers collectively outweigh function-based ones), but the precise numbers should be regarded as a placeholder until reconfirmed.
Shape-based classifiers in the Japanese inventory decompose into three dimensionality classes (Downing 1996, UNVERIFIED location).
Equations
- Downing1996.shapeClassifiers = List.filter (fun (c : Fragments.Japanese.Classifier) => decide (c.Encodes Typology.SemanticParameter.shape)) Fragments.Japanese.Classifier.all
Instances For
At least 5 classifiers in the inventory encode shape.
All three shape dimensions (1D, 2D, 3D) are attested.
The animacy hierarchy in Japanese classifiers: human (nin/mei) > large animal (tou) > small animal (hiki) > inanimate (tsu). Each level is distinguished by a distinct classifier.
The human classifier has a formal register variant (名 mei) marking
formality/register of the speech act, unlike the neutral 人 nin.
Note: this indexes register (formal vs casual) rather than
socialStatus (honorific status of the referent — kin, age, social
rank).
Distribution of classifiers in anaphoric examples (Downing 1996, UNVERIFIED location).
- classifier : Fragments.Japanese.Classifier
- count : ℕ
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
UNVERIFIED: cell-precise counts (nin=48, tsu=4, hiki=2, wa=1) inherited
from prior formalization. The qualitative claim — that the anaphoric
distribution is dominated by -nin and skewed against the long tail —
is well-attested; the precise counts have not been cross-checked.
Equations
- One or more equations did not get rendered due to their size.
Instances For
UNVERIFIED: total anaphoric classifier examples = 55 (sum of cells in
anaphoricClassifierData).
UNVERIFIED specific count: 人 nin dominates anaphoric classifier use at 48/55 ≈ 87%. The dominance claim is qualitatively robust; the precise count is unverified.
Distribution of numerals in anaphoric classifier examples (Downing 1996, UNVERIFIED location). Numeral 1 is absent — explained by competition with zero anaphora.
- numeral : ℕ
- count : ℕ
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
UNVERIFIED: cell-precise counts (n=2:41, n=3:12, n=4:1, n=5:1) inherited from prior formalization.
Equations
Instances For
UNVERIFIED specific count: numeral 2 dominates anaphoric use at 41/55 ≈ 75%.
Numeral 1 is absent from anaphoric classifier constructions. Explanation: 'one' + CL has low contrastive information potential and competes with zero anaphora.
(UNVERIFIED location: Ch. 7) discusses classifier phrases as individuators. @cite{chierchia-1998}'s later linking hypothesis formalizes this: [+arg, -pred] languages have kind-denoting bare nouns and need classifiers for individuation. The strict correlation is contested (see module docstring), but the co-occurrence is robustly attested.
Witnessed by: Japanese is [+arg, -pred] AND has numeral classifiers.
In @cite{chierchia-1998}'s framework, [+arg, -pred] languages have no type-shift blocking (no articles), so bare nouns freely occur as arguments. Classifiers rather than articles provide individuation.
Non-default classifiers encode at least one semantic parameter,
confirming they carry individuation-relevant information beyond
mere enumeration. The default classifier つ is the only one
that enumerates without individuating. Delegates to the structural
theorem in Fragments.Japanese.Classifier.
Seven recurrent semantic relations between the independent sense
of the classifier morpheme and the classifier category.
Six are from Downing 1996 (UNVERIFIED location: Ch. 5, Table 5.2); the seventh
(sharedQuality) is attested in non-Japanese languages and noted
by Downing as recurring cross-linguistically but absent in Japanese.
- identicalClass : MorphemeCategoryRelation
Morpheme denotes a class identical/superordinate to the category. e.g., 件 ken 'matter' → classifier for incidents.
- partOfMembers : MorphemeCategoryRelation
Morpheme denotes a part possessed by category members. e.g., 頭 tou 'head' → classifier for large animals.
- associatedAction : MorphemeCategoryRelation
Morpheme denotes an action associated with category members. e.g., 通 tsuu 'to pass' → classifier for letters/documents.
- exemplar : MorphemeCategoryRelation
Morpheme denotes an exemplar possessing the defining traits. e.g., 筋 suji 'sinew' → classifier for long, slender objects.
- creationAction : MorphemeCategoryRelation
Morpheme denotes the action creating category members. e.g., 巻 maki 'to roll up' → classifier for scrolls.
- beneficiaryGoal : MorphemeCategoryRelation
Morpheme denotes the beneficiary/goal of category activity. e.g., 足 soku 'foot' → classifier for pairs of footwear.
Instances For
Equations
- Downing1996.instDecidableEqMorphemeCategoryRelation x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
A witness pairing a classifier with its morpheme (UNVERIFIED location: Table 5.2)-category relation and the independent meaning of the morpheme.
- classifier : Fragments.Japanese.Classifier
- relation : MorphemeCategoryRelation
- independentMeaning : String
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Concrete morpheme (UNVERIFIED location: Table 5.2)-category relation assignments for classifiers in our inventory. Each entry records the classifier, the relation type, and the independent lexical meaning of the morpheme that motivates the relation.
Equations
- One or more equations did not get rendered due to their size.
Instances For
At least four of the six Japanese-attested relation types are witnessed in the inventory (Types 1, 2, 3, 6).
All witnesses reference classifiers in our inventory.
Trivial since Classifier.mem_all says every constructor is in all.
The Japanese classifier inventory includes both sortal and mensural classifiers, with sortal classifiers dominating.
Function-based classifiers are the largest semantic group, confirming Downing's observation that the system concentrates on interactionally significant categories.
The core inventory (UNVERIFIED location: Table 1.1) has exactly 27 classifiers.
Every core classifier is in the full inventory.
Trivial by construction: all := core ++ extended ++ sudoAdditions.
The core inventory distinguishes two homophonous ken classifiers:
軒 kenBuilding and 件 kenIncident — different kanji, different
semantic domains.
Two building classifiers exist: 軒 kenBuilding (functional capacity —
home/shop) and 棟 mune (roofed structure).
Two maritime classifiers exist: 隻 seki (large boats) and 艘 soo (small boats), paralleling the animacy size split (頭 tou / 匹 hiki).
Frequency data (UNVERIFIED location: Ch. 3, Table 3.1): raw counts of classifiers in a 500-form corpus sample (first 50 uses from each of five works of fiction + 250 forms from transcribed conversations and oral narrative).
- classifier : Fragments.Japanese.Classifier
- count : ℕ
Instances For
Equations
- Downing1996.instReprFrequencyEntry = { reprPrec := Downing1996.instReprFrequencyEntry.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
UNVERIFIED: 17-cell frequency distribution claimed to be from
Downing 1996's n=500 corpus sample. Cells inherited from prior
formalization and not cross-checked against the monograph; the
qualitative shape (extreme Zipfian skew, -nin and -tsu
dominating, shape-based collectively outweighing function-based)
is well-attested but the precise counts are placeholders.
Equations
- One or more equations did not get rendered due to their size.
Instances For
人 nin is the single most frequent classifier (qualitatively robust). UNVERIFIED specific count: 201/500 ≈ 40%.
人 nin and つ tsu together dominate the distribution (qualitatively robust). UNVERIFIED specific arithmetic: 201 + 115 = 316 ≈ 63% of 500.
The top five classifiers (nin, tsu, hiki, hon, mai) account for the bulk of the distribution (Zipfian skew, qualitatively robust). UNVERIFIED specific arithmetic: top-5 sum = 410 ≈ 82% of 500.
Quality classifiers (shape-based: hon, mai, ko) are collectively more frequent than any individual kind classifier (function-based like ken, dai). observes that "classifiers denoting categories united by a common shape ... are used relatively more often than most of the 'kind-based' classifiers."