Documentation

Linglib.Phenomena.Classifiers.Studies.Downing1996

@cite{downing-1996} — Numeral Classifier Systems: The Case of Japanese #

@cite{downing-1996} @cite{aikhenvald-2000} @cite{chierchia-1998}

Formalizes core contributions from Downing's monograph on Japanese numeral classifiers (Studies in Discourse and Grammar, vol. 4).

Two Central Hypotheses #

Hypothesis 1 (Universal semantic trends): Classifier categories encode culturally significant categories defined by physical, functional, and social interaction. The choice of which features are exploited is culture-dependent.

Hypothesis 2 (Semantic supplementation): Classifiers systematically supplement the information carried by nouns — the classifier system provides categorization independent of and additional to the common noun system.

UNVERIFIED: chapter/section/table location references throughout this file were inherited from earlier formalization work and have not been cross-checked against the @cite{downing-1996} monograph. Specific cell counts in the frequency tables below should be treated as illustrative of the qualitative shape of the data, not as verbatim transcriptions.

Individuation Function #

@cite{downing-1996} treats classifier phrases and plural markers as individuators. @cite{chierchia-1998}'s later Nominal Mapping Parameter provides a formal framework for this insight: in [+arg, -pred] languages, bare nouns denote kinds, and classifiers supply individuation for enumeration. The bridge to @cite{chierchia-1998} is a linglib contribution, not one Downing herself makes (Chierchia 1998 postdates this monograph). The strict NMP correlation has been challenged (e.g., Turkish has bare arguments without classifiers; Li 2013 argues Chinese nouns are not uniformly mass), but the core insight — that classifiers relate to individuation — is preserved in current work, with the mechanism (atomization vs. unitization) still debated.

Anaphoric Use #

Classifier phrases (numeral + classifier without accompanying noun) serve as anaphoric devices in discourse, occupying a unique niche between zero anaphora (short range) and full lexical NPs (long range). Qualitative findings (UNVERIFIED specific percentages): 人 nin dominates the anaphoric distribution; numeral 2 dominates the numeral distribution; striking distance is intermediate between pronouns and full NPs.

Shape Dimensionality #

Shape-based classifiers decompose along a 1D/2D/3D dimensionality axis per @cite{allan-1977}, formalized via Fragments.Japanese.Classifier.shapeDim.

Core Inventory #

All 27 classifiers from Downing's core inventory (UNVERIFIED: claimed to be Table 1.1) are represented in the Japanese fragment (Fragments.Japanese.Classifier.core), including the homophonous 軒 kenBuilding / 件 kenIncident pair, the maritime size split (隻 seki / 艘 soo), and the two building classifiers (軒 kenBuilding / 棟 mune).

Frequency Distribution #

UNVERIFIED: the n=500 corpus sample and specific cell counts below were inherited from prior formalization work and have not been verified against the monograph. The qualitative pattern they encode is well-attested (extreme Zipfian skew with 人 nin and つ tsu dominating; shape-based classifiers collectively outweigh function-based ones), but the precise numbers should be regarded as a placeholder until reconfirmed.

Shape-based classifiers in the Japanese inventory decompose into three dimensionality classes (Downing 1996, UNVERIFIED location).

Equations
Instances For

    At least 5 classifiers in the inventory encode shape.

    The human classifier has a formal register variant (名 mei) marking formality/register of the speech act, unlike the neutral 人 nin. Note: this indexes register (formal vs casual) rather than socialStatus (honorific status of the referent — kin, age, social rank).

    Distribution of classifiers in anaphoric examples (Downing 1996, UNVERIFIED location).

    Instances For
      Equations
      • One or more equations did not get rendered due to their size.
      Instances For

        UNVERIFIED: cell-precise counts (nin=48, tsu=4, hiki=2, wa=1) inherited from prior formalization. The qualitative claim — that the anaphoric distribution is dominated by -nin and skewed against the long tail — is well-attested; the precise counts have not been cross-checked.

        Equations
        • One or more equations did not get rendered due to their size.
        Instances For
          theorem Downing1996.anaphoric_total :
          List.foldl (fun (x1 x2 : ) => x1 + x2) 0 (List.map (fun (x : AnaphoricDistribution) => x.count) anaphoricClassifierData) = 55

          UNVERIFIED: total anaphoric classifier examples = 55 (sum of cells in anaphoricClassifierData).

          UNVERIFIED specific count: 人 nin dominates anaphoric classifier use at 48/55 ≈ 87%. The dominance claim is qualitatively robust; the precise count is unverified.

          Distribution of numerals in anaphoric classifier examples (Downing 1996, UNVERIFIED location). Numeral 1 is absent — explained by competition with zero anaphora.

          • numeral :
          • count :
          Instances For
            Equations
            • One or more equations did not get rendered due to their size.
            Instances For

              UNVERIFIED: cell-precise counts (n=2:41, n=3:12, n=4:1, n=5:1) inherited from prior formalization.

              Equations
              Instances For
                theorem Downing1996.two_dominates_anaphoric :
                Option.map (fun (x : NumeralDistribution) => x.count) (List.find? (fun (x : NumeralDistribution) => x.numeral == 2) anaphoricNumeralData) = some 41

                UNVERIFIED specific count: numeral 2 dominates anaphoric use at 41/55 ≈ 75%.

                Numeral 1 is absent from anaphoric classifier constructions. Explanation: 'one' + CL has low contrastive information potential and competes with zero anaphora.

                (UNVERIFIED location: Ch. 7) discusses classifier phrases as individuators. @cite{chierchia-1998}'s later linking hypothesis formalizes this: [+arg, -pred] languages have kind-denoting bare nouns and need classifiers for individuation. The strict correlation is contested (see module docstring), but the co-occurrence is robustly attested.

                Witnessed by: Japanese is [+arg, -pred] AND has numeral classifiers.

                In @cite{chierchia-1998}'s framework, [+arg, -pred] languages have no type-shift blocking (no articles), so bare nouns freely occur as arguments. Classifiers rather than articles provide individuation.

                Non-default classifiers encode at least one semantic parameter, confirming they carry individuation-relevant information beyond mere enumeration. The default classifier つ is the only one that enumerates without individuating. Delegates to the structural theorem in Fragments.Japanese.Classifier.

                Seven recurrent semantic relations between the independent sense of the classifier morpheme and the classifier category. Six are from Downing 1996 (UNVERIFIED location: Ch. 5, Table 5.2); the seventh (sharedQuality) is attested in non-Japanese languages and noted by Downing as recurring cross-linguistically but absent in Japanese.

                • identicalClass : MorphemeCategoryRelation

                  Morpheme denotes a class identical/superordinate to the category. e.g., 件 ken 'matter' → classifier for incidents.

                • partOfMembers : MorphemeCategoryRelation

                  Morpheme denotes a part possessed by category members. e.g., 頭 tou 'head' → classifier for large animals.

                • associatedAction : MorphemeCategoryRelation

                  Morpheme denotes an action associated with category members. e.g., 通 tsuu 'to pass' → classifier for letters/documents.

                • exemplar : MorphemeCategoryRelation

                  Morpheme denotes an exemplar possessing the defining traits. e.g., 筋 suji 'sinew' → classifier for long, slender objects.

                • creationAction : MorphemeCategoryRelation

                  Morpheme denotes the action creating category members. e.g., 巻 maki 'to roll up' → classifier for scrolls.

                • beneficiaryGoal : MorphemeCategoryRelation

                  Morpheme denotes the beneficiary/goal of category activity. e.g., 足 soku 'foot' → classifier for pairs of footwear.

                • sharedQuality : MorphemeCategoryRelation

                  Morpheme independently represents a quality shared by members. e.g., Indonesian bentuk 'curved' → classifier for rings, wheels.

                Instances For
                  @[implicit_reducible]
                  Equations
                  Equations
                  • One or more equations did not get rendered due to their size.
                  Instances For

                    A witness pairing a classifier with its morpheme (UNVERIFIED location: Table 5.2)-category relation and the independent meaning of the morpheme.

                    Instances For
                      Equations
                      • One or more equations did not get rendered due to their size.
                      Instances For

                        Concrete morpheme (UNVERIFIED location: Table 5.2)-category relation assignments for classifiers in our inventory. Each entry records the classifier, the relation type, and the independent lexical meaning of the morpheme that motivates the relation.

                        Equations
                        • One or more equations did not get rendered due to their size.
                        Instances For
                          theorem Downing1996.four_relation_types_attested :
                          (List.map (fun (x : MorphemeRelationWitness) => x.relation) table5_2_witnesses).eraseDups.length 4

                          At least four of the six Japanese-attested relation types are witnessed in the inventory (Types 1, 2, 3, 6).

                          All witnesses reference classifiers in our inventory. Trivial since Classifier.mem_all says every constructor is in all.

                          theorem Downing1996.sortal_dominance :
                          (List.filter (fun (c : Fragments.Japanese.Classifier) => decide ¬decide c.IsMensural = true) Fragments.Japanese.Classifier.all).length > (List.filter (fun (c : Fragments.Japanese.Classifier) => decide c.IsMensural) Fragments.Japanese.Classifier.all).length

                          The Japanese classifier inventory includes both sortal and mensural classifiers, with sortal classifiers dominating.

                          Function-based classifiers are the largest semantic group, confirming Downing's observation that the system concentrates on interactionally significant categories.

                          The core inventory (UNVERIFIED location: Table 1.1) has exactly 27 classifiers.

                          Every core classifier is in the full inventory. Trivial by construction: all := core ++ extended ++ sudoAdditions.

                          The core inventory distinguishes two homophonous ken classifiers: 軒 kenBuilding and 件 kenIncident — different kanji, different semantic domains.

                          Two building classifiers exist: 軒 kenBuilding (functional capacity — home/shop) and 棟 mune (roofed structure).

                          Two maritime classifiers exist: 隻 seki (large boats) and 艘 soo (small boats), paralleling the animacy size split (頭 tou / 匹 hiki).

                          Frequency data (UNVERIFIED location: Ch. 3, Table 3.1): raw counts of classifiers in a 500-form corpus sample (first 50 uses from each of five works of fiction + 250 forms from transcribed conversations and oral narrative).

                          Instances For
                            Equations
                            • One or more equations did not get rendered due to their size.
                            Instances For

                              UNVERIFIED: 17-cell frequency distribution claimed to be from Downing 1996's n=500 corpus sample. Cells inherited from prior formalization and not cross-checked against the monograph; the qualitative shape (extreme Zipfian skew, -nin and -tsu dominating, shape-based collectively outweighing function-based) is well-attested but the precise counts are placeholders.

                              Equations
                              • One or more equations did not get rendered due to their size.
                              Instances For
                                theorem Downing1996.nin_most_frequent :
                                Option.map (fun (x : FrequencyEntry) => x.count) (List.find? (fun (x : FrequencyEntry) => x.classifier == Fragments.Japanese.Classifier.nin) frequencyData) = some 201

                                人 nin is the single most frequent classifier (qualitatively robust). UNVERIFIED specific count: 201/500 ≈ 40%.

                                theorem Downing1996.nin_tsu_dominate :
                                201 + 115 = 316 316 * 100 / 500 = 63

                                人 nin and つ tsu together dominate the distribution (qualitatively robust). UNVERIFIED specific arithmetic: 201 + 115 = 316 ≈ 63% of 500.

                                theorem Downing1996.top_five_dominate :
                                have top5 := [201, 115, 32, 31, 31]; List.foldl (fun (x1 x2 : ) => x1 + x2) 0 top5 = 410 410 * 100 / 500 = 82

                                The top five classifiers (nin, tsu, hiki, hon, mai) account for the bulk of the distribution (Zipfian skew, qualitatively robust). UNVERIFIED specific arithmetic: top-5 sum = 410 ≈ 82% of 500.

                                Quality classifiers (shape-based: hon, mai, ko) are collectively more frequent than any individual kind classifier (function-based like ken, dai). observes that "classifiers denoting categories united by a common shape ... are used relatively more often than most of the 'kind-based' classifiers."