San Martín Peras Mixtec (SMPM) Fragment #
Language data for San Martín Peras Mixtec (ISO: jmx), an Oto-Manguean language spoken by about 12,000 people in Oaxaca, Mexico. SMPM is predicate-initial (VSO) and non-pro-drop: all clauses require overt subjects and all transitive clauses require overt objects.
Coverage #
- Morpho-aspectual system: completive, continuous, irrealis
- Pronoun paradigm ((4), (5), (61), (62)):
PersonalPronounentries in two series, clitic vs non-clitic, with per-entry C&Sstrength - Embedded clause typology (three-way: finite, tensed subj., untensed subj.)
- Complement-taking predicate classification by clause type selected
SMPM's three morpho-aspectual categories.
All clauses must be marked with one of these. SMPM lacks morphologically nonfinite predicates — the completive/continuous/ irrealis distinction is the only TAM system. Aspect is primarily tonal: completive by low tone or prefix nì-, continuous by high tone, irrealis by mid/unmarked tone or stem changes.
- comp : Aspect
Completive (COMP): low tone on first vowel, or prefix nì-
- cont : Aspect
Continuous (CONT): high tone on first vowel
- irr : Aspect
Irrealis (IRR): mid/unmarked tone or stem changes
Instances For
Equations
- Mixtec.SMPM.instDecidableEqAspect x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Mixtec.SMPM.instReprAspect = { reprPrec := Mixtec.SMPM.instReprAspect.repr }
Equations
- Mixtec.SMPM.instReprAspect.repr Mixtec.SMPM.Aspect.comp prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Aspect.comp")).group prec✝
- Mixtec.SMPM.instReprAspect.repr Mixtec.SMPM.Aspect.cont prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Aspect.cont")).group prec✝
- Mixtec.SMPM.instReprAspect.repr Mixtec.SMPM.Aspect.irr prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Aspect.irr")).group prec✝
Instances For
Grammatical genders for nonlocal (3rd person) pronouns ((5)).
SMPM distinguishes six genders in the singular and two in the plural. There is no number distinction for most nonlocal pronouns: e.g., =rà 'he, they (all-male group)'.
Instances For
Equations
- Mixtec.SMPM.instDecidableEqGender x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- Mixtec.SMPM.instReprGender = { reprPrec := Mixtec.SMPM.instReprGender.repr }
Equations
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.neutral prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.neutral")).group prec✝
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.fem prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.fem")).group prec✝
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.masc prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.masc")).group prec✝
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.liq prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.liq")).group prec✝
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.wd prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.wd")).group prec✝
- Mixtec.SMPM.instReprGender.repr Mixtec.SMPM.Gender.aml prec✝ = Repr.addAppParen (Std.Format.nest (if prec✝ ≥ 1024 then 1 else 2) (Std.Format.text "Mixtec.SMPM.Gender.aml")).group prec✝
Instances For
Map SMPM gender to the shared surface-level gender type. Only fem/masc map directly; the remaining four genders (neutral, liquid, wooden, animal) are language-specific noun class distinctions without cross-linguistic surface equivalents.
Equations
Instances For
Clitic series ((4), (61)) #
SMPM distinguishes clitic and non-clitic pronouns ([CS99a],
cited at [Ost26] (63)): clitics cannot be coordinated (63a), cannot
occur on their own (63b), may have impersonal readings (63c), and cannot bear
focus ((65), with íntàà 'only'; speaker comment at (66)). Each entry
declares the C&S class on the shared Pronoun.strength field.
Equations
- Mixtec.SMPM.cl1sg = { form := "=ì", person := some Person.first, number := some Number.singular, strength := some Pronoun.Strength.clitic }
Instances For
Equations
- Mixtec.SMPM.cl1plIncl = { form := "=(y)é", person := some Person.firstInclusive, number := some Number.plural, strength := some Pronoun.Strength.clitic }
Instances For
Equations
- Mixtec.SMPM.cl1plExcl = { form := "=ndú", person := some Person.firstExclusive, number := some Number.plural, strength := some Pronoun.Strength.clitic }
Instances For
Equations
- Mixtec.SMPM.cl2sg = { form := "=ú", person := some Person.second, number := some Number.singular, strength := some Pronoun.Strength.clitic }
Instances For
Equations
- Mixtec.SMPM.cl2pl = { form := "=ndó", person := some Person.second, number := some Number.plural, strength := some Pronoun.Strength.clitic }
Instances For
Surface form of the nonlocal (3rd person) clitic for each gender ((5); the neutral has a prevocalic allomorph =(y)à).
Equations
- Mixtec.SMPM.Gender.neutral.cliticForm = "=ñà"
- Mixtec.SMPM.Gender.fem.cliticForm = "=ñá"
- Mixtec.SMPM.Gender.masc.cliticForm = "=rà"
- Mixtec.SMPM.Gender.liq.cliticForm = "=rá"
- Mixtec.SMPM.Gender.wd.cliticForm = "=tún"
- Mixtec.SMPM.Gender.aml.cliticForm = "=rí"
Instances For
Nonlocal (3rd person) clitic by gender ((5)). number := none: most
nonlocal pronouns make no number distinction (=rà 'he, they (all-male
group)'); the API gender is derived via Gender.toGender.
Equations
- Mixtec.SMPM.cl3 g = { form := g.cliticForm, person := some Person.third, gender := some g.toGender, strength := some Pronoun.Strength.clitic }
Instances For
=nà — nonlocal plural, neutral ((5)).
Equations
- Mixtec.SMPM.cl3plNeutral = { form := "=nà", person := some Person.third, number := some Number.plural, gender := some Gender.common, strength := some Pronoun.Strength.clitic }
Instances For
=ná — nonlocal plural, feminine ((5)).
Equations
- Mixtec.SMPM.cl3plFem = { form := "=ná", person := some Person.third, number := some Number.plural, gender := some Gender.feminine, strength := some Pronoun.Strength.clitic }
Instances For
Non-clitic series ((4), (62)) #
Focusable and coordinable ((64)–(66)) — the C&S .strong class. (62) has a
gap: 1PL.INCL and the nonlocal persons lack dedicated non-clitic forms;
'strengthening' strategies fill the gap phrasally — clitic + demonstrative
(yé yo'o 'we (INCL) here', =ra kan 'he there' in (65b)) or the definite
article mí (mí =rà 'himself', also the reflexive, §7 below; cf.
McCloskey & Hale 1984 on Irish). Being phrasal, they are not lexical
entries here.
Equations
- Mixtec.SMPM.str1sg = { form := "yù'u", person := some Person.first, number := some Number.singular, strength := some Pronoun.Strength.strong }
Instances For
Equations
- Mixtec.SMPM.str1plExcl = { form := "ndú'ú", person := some Person.firstExclusive, number := some Number.plural, strength := some Pronoun.Strength.strong }
Instances For
Equations
- Mixtec.SMPM.str2sg = { form := "yô'o", person := some Person.second, number := some Number.singular, strength := some Pronoun.Strength.strong }
Instances For
Equations
- Mixtec.SMPM.str2pl = { form := "ndó'ó", person := some Person.second, number := some Number.plural, strength := some Pronoun.Strength.strong }
Instances For
Series structure #
The clitic series ((4), (5), (61)).
Equations
- One or more equations did not get rendered due to their size.
Instances For
The non-clitic series ((4), (62)).
Equations
Instances For
The local-person clitic/non-clitic pairs ((61)–(62)); none marks the
(62) gap (1PL.INCL, filled only by phrasal strengthening).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Both series are strength-homogeneous — the per-series C&S facts, now per-entry data on the shared field.
Within each pair, the clitic is strictly more deficient — derived from the shared deficiency order, not stipulated.
The Cardinaletti–Starke class required of a controlled subject in an untensed subjunctive: the clitic (most deficient). Non-clitic forms — including strengthened mí =rà — are sharply ungrammatical there ((67)).
Instances For
SMPM's ban on non-clitic controlled subjects ((67)) realizes the Cardinaletti–Starke prediction: the required class sits strictly below every non-clitic entry's declared class in the deficiency order.
SMPM's three embedded clause types, distinguished by three binary features (table 26).
SMPM lacks morphologically nonfinite predicates: all clauses are marked with one of the three aspects. The "tensed" vs "untensed" subjunctive distinction is diagnosed by independent temporal adverbs and noncoreferential subject availability, not by overt tense morphology.
- finiteEmbedded : EmbeddedClauseType
Finite embedded clause: unrestricted TAM, free subject reference, no restructuring. Selected by: ka'án 'think', nakanini 'believe', kà'àn 'say', káchi 'said', kusijǐ ini 'be happy', etc.
- tensedSubjunctive : EmbeddedClauseType
Tensed subjunctive: restricted TAM (irrealis only), free subject reference, no restructuring. Allows ná for disjoint reference. Selected by: kòni 'want', sǐso ini 'hate', ntatu 'hope', etc.
- untensedSubjunctive : EmbeddedClauseType
Untensed subjunctive: restricted TAM (irrealis only), obligatory coreference, mandatory restructuring. Subject must be overt clitic pronoun. Selected by: ntùkú 'try', nàkú'ún 'remember', kìxà 'start', sakwā'a 'learn', etc.
Instances For
Equations
- Mixtec.SMPM.instDecidableEqEmbeddedClauseType x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Properties distinguishing the three clause types (table 26).
- unrestrictedTAM : Bool
Unrestricted TAM morphology (all three aspects available)
- noncoreferentialSubject : Bool
Noncoreferential embedded subject allowed
- restructuring : Bool
Shows restructuring effects (quantifier fronting targets matrix)
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- Mixtec.SMPM.clauseProperties Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded = { unrestrictedTAM := true, noncoreferentialSubject := true, restructuring := false }
- Mixtec.SMPM.clauseProperties Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive = { unrestrictedTAM := false, noncoreferentialSubject := true, restructuring := false }
- Mixtec.SMPM.clauseProperties Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive = { unrestrictedTAM := false, noncoreferentialSubject := false, restructuring := true }
Instances For
A complement-taking predicate in SMPM.
- form : String
- gloss : String
- selects : EmbeddedClauseType
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Mixtec.SMPM.instReprCTP = { reprPrec := Mixtec.SMPM.instReprCTP.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- Mixtec.SMPM.think = { form := "ka'án", gloss := "think", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.believe = { form := "nakanini", gloss := "believe", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.wonder = { form := "kuntàà ini", gloss := "wonder", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.know = { form := "kòni", gloss := "know", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.say = { form := "kà'àn", gloss := "say", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.chat = { form := "ntatǔ'un", gloss := "chat", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.said = { form := "káchi", gloss := "said", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.beHappy = { form := "kusijǐ ini", gloss := "be happy", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.beSad = { form := "ntsi'i ini", gloss := "be sad", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.regret = { form := "ntsiko ini", gloss := "regret", selects := Mixtec.SMPM.EmbeddedClauseType.finiteEmbedded }
Instances For
Equations
- Mixtec.SMPM.want = { form := "kòni", gloss := "want", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.hate = { form := "sǐso ini", gloss := "hate", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.beAfraid = { form := "iyì'bí", gloss := "be afraid", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.beScared = { form := "kuntasí", gloss := "be scared", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.pray = { form := "nakwatu", gloss := "pray", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.hope = { form := "ntatu", gloss := "hope", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.agree = { form := "xiinka", gloss := "agree", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.refuse = { form := "xǐunka", gloss := "refuse", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.getIdea = { form := "chikàà ini", gloss := "get the idea", selects := Mixtec.SMPM.EmbeddedClauseType.tensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.try_ = { form := "ntùkú", gloss := "try", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.remember = { form := "nàkú'ún", gloss := "remember", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.forget = { form := "nantōso", gloss := "forget", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.likeTo = { form := "kutō", gloss := "like to", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.start = { form := "kìxà", gloss := "start", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.finish = { form := "ntsi'i", gloss := "finish", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.stop = { form := "xikwīn", gloss := "stop", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.continue_ = { form := "kò xikwīn", gloss := "continue", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.need = { form := "xiniñu'u", gloss := "need", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.knowHow = { form := "kòni xá kasa", gloss := "know how to", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.learnHow = { form := "sakwā'a", gloss := "learn how to", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
Equations
- Mixtec.SMPM.notBother = { form := "kò ntaa", gloss := "not bother", selects := Mixtec.SMPM.EmbeddedClauseType.untensedSubjunctive }
Instances For
SMPM is non-pro-drop: all clauses require overt subjects (3).
Equations
- Mixtec.SMPM.allowsProDrop = false
Instances For
SMPM is predicate-initial: VSO for verbal, copula-initial for nominal/adjectival predicates (2a–c).
Equations
- Mixtec.SMPM.predicateInitial = true
Instances For
SMPM reflexive anaphors are formed with the definite article mí plus a clitic pronoun (71). Only locally bound — without mí, only a noncoreferential interpretation is available (72).
Examples:
- Xini Juân mí =rà ini yùtátá. 'Juan saw himself in the mirror.'
- Saá kâ'àn María xa'ǎ mí =ñá. 'María always talks about herself.'
Equations
- Mixtec.SMPM.reflexiveFormation = "mí + clitic pronoun"
Instances For
Quantified nominals can locally bind reflexive anaphors (73).
- Tá'iin'iin =nà bálí xìni mí =nà ini yùtátá. 'Every child saw themselves in the mirror.'
Equations
Instances For
Exempt anaphors (reflexive forms used outside canonical binding domain) CANNOT have quantified antecedents (75, 78).
- *Tá'iin'iin tsǐnà tsìi ndò'ò mí =rí. 'Each dog bit its own tail.'
- *Ni'iin =ná bálí ní- xìni táta mí =ná. 'No girl saw her own father.'
Equations
Instances For
Exempt anaphors occur as possessors (74) but are restricted: they cannot have quantified antecedents.
- Tsìi tsǐnà ndò'ò {=rí, mí =rí}. 'The dog bit {its, its own} tail.'
- Xìni María táta {=ñá, mí =ñá}. 'María saw {her, her own} father.'
Equations
Instances For
Ná is a morpheme used in tensed subjunctives to force disjoint reference when the embedded subject does not match the matrix subject in φ-features (18b–d). It is optional with nonpronominal subjects (18d). Ná does NOT occur with untensed subjunctives (19).
Equations
Instances For
Clitic left-dislocation in SMPM is NOT island-sensitive (80–82): it is available out of adjunct islands and wh-islands. This argues against a movement analysis of left-dislocation.