@cite{kwon-lee-2026}: Accessibility Markers in Korean #
@cite{ariel-2001} @cite{carminati-2002} @cite{kweon-2011} @cite{contemori-di-domenico-2021} @cite{zhang-kwon-2022} @cite{choe-2021}
Three experiments test @cite{ariel-2001}'s Accessibility Theory in Korean — a discourse-oriented language without verbal/gender agreement — using null pronouns, overt kyay, and full NPs. The Experiment 3 antecedent-choice data (71% / 43% / 35% subject bias) instantiates the universal accessibility ordering at three points; the relative ordering holds cross-linguistically while the spread is language-specific.
KoreanRefForm is the 3-element domain tested. It carries a
LinearOrder lifted from AccessibilityLevel.rank, so the central
claim "subject bias increases in accessibility" appears as one
StrictMono lemma (subjectBias_strictMono) rather than per-pair
inequalities. Bridges to @cite{kehler-rohde-2013} (topichood),
@cite{carminati-2002} (PAH alternative), and Ariel's
AccessibilityAssessment are provided.
The three Korean referential forms tested across the experiments. Each instantiates a different point on @cite{ariel-2001}'s Accessibility Marking Scale.
- nullPro : KoreanRefForm
Null pronoun (pro): no phonological exponent.
- overt : KoreanRefForm
Overt colloquial 3sg pronoun kyay (걔), gender-neutral, derived from ku ai ('that child').
- fullNP : KoreanRefForm
Full NP — demonstrative + noun (e.g., ku chinkwu 'that friend').
Instances For
Equations
- KwonLee2026.instDecidableEqKoreanRefForm x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- KwonLee2026.instReprKoreanRefForm = { reprPrec := KwonLee2026.instReprKoreanRefForm.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- KwonLee2026.instBEqKoreanRefForm.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
Map each Korean form to its position on @cite{ariel-2001}'s 18-level scale.
kyay maps to unstressedPron rather than distalDem because, although
historically derived from a demonstrative, it functions synchronically
as a 3rd-person pronoun in spoken Korean and lacks the deictic force
of a true demonstrative (@cite{kwon-lee-2026} §5).
The full-NP condition uses demonstrative + noun rather than a bare name or definite description, because Korean lacks definite articles.
Equations
Instances For
The accessibility rank of a Korean form (the rank of its
AccessibilityLevel image), used to lift the universal accessibility
ordering onto KoreanRefForm.
Equations
- f.rank = f.toAccessibility.rank
Instances For
KoreanRefForm inherits a LinearOrder from @cite{ariel-2001}'s
accessibility scale via the rank pullback. The induced order is
fullNP < overt < nullPro — more accessible forms are larger.
This lets every monotonicity claim about Korean forms be expressed
as a single StrictMono lemma rather than per-pair inequalities.
Equations
Bridge to the Korean fragment: the overt form's surface realization
is the colloquial pronoun gyae (Yale: kyay) in
Fragments.Korean.Pronouns. Derived from the fragment field — not
duplicated.
Equations
- KwonLee2026.KoreanRefForm.nullPro.surface = none
- KwonLee2026.KoreanRefForm.overt.surface = some Fragments.Korean.Pronouns.gyae.form
- KwonLee2026.KoreanRefForm.fullNP.surface = some "ku chinkwu"
Instances For
Attenuation (phonological reduction) is strictly increasing in the
accessibility order on Korean forms: more accessible forms are more
reduced. (Subsumes the previous per-pair attenuation theorems via
StrictMono.lt_iff_lt.)
Informativity is antitone in accessibility: more accessible forms
are less informative (≤, not <, because @cite{ariel-2001}'s scale
collapses distalDemNP and unstressedPron at informativity 1).
Antecedent-choice rates, from Figure 3 of @cite{kwon-lee-2026}. Globally ambiguous discourse contexts (two same-gender personal names), so neither semantic plausibility nor gender cues disambiguate. Form alone drives interpretation.
- form : KoreanRefForm
- subjectPercent : ℕ
Percentage choosing the subject antecedent (0–100).
- objectPercent : ℕ
Percentage choosing the object antecedent (0–100).
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Exp 3, pro: 70.6% subject, 29.4% object.
Equations
- KwonLee2026.exp3_pro = { form := KwonLee2026.KoreanRefForm.nullPro, subjectPercent := 71, objectPercent := 29 }
Instances For
Exp 3, kyay: 42.8% subject, 57.2% object.
Equations
- KwonLee2026.exp3_overt = { form := KwonLee2026.KoreanRefForm.overt, subjectPercent := 43, objectPercent := 57 }
Instances For
Exp 3, full NP: 35.3% subject, 64.7% object.
Equations
- KwonLee2026.exp3_fullNP = { form := KwonLee2026.KoreanRefForm.fullNP, subjectPercent := 35, objectPercent := 65 }
Instances For
Subject-antecedent bias for each Korean form, derived from the Exp 3
records. Defined as a function so the central monotonicity claim
can be expressed as StrictMono.
Equations
Instances For
Object-antecedent bias for each Korean form.
Equations
Instances For
The Exp 3 task forces a binary subject/object choice, so for each form the two percentages sum to 100.
Central claim of @cite{kwon-lee-2026}: subject-antecedent bias is
strictly monotone in accessibility — more accessible (higher-rank)
forms attract subject antecedents more strongly. This single
StrictMono lemma subsumes per-pair claims like
subjectBias .nullPro > subjectBias .overt (which follow via
StrictMono.lt_iff_lt applied to fullNP_lt_overt/overt_lt_nullPro).
Form–function correlation in one line: more accessible form ↔ more accessible antecedent.
Mirror image: object-antecedent bias is antitone in accessibility. Full NPs are the most object-biased; null pronouns the least.
Three-way distinction: corollary of subjectBias_strictMono —
the three forms have three distinct subject-bias values. Rules out
the alternative that Korean has only a binary null/non-null contrast
(which @cite{kweon-2011} suggested for the older overt pronoun
ku/kunye).
Naturalness ratings (Table 5) on the 1–7 Likert scale. The three forms are essentially identical (5.3, 5.3, 5.4; n.s.). When the form is coindexed with its preferred antecedent, all three are equally natural. The accessibility distinction surfaces in interpretation (antecedent choice), not in raw acceptability.
Equations
Instances For
Exp 1 naturalness ratings (Table 1) on the 1–7 Likert scale. With only one available antecedent, the highest-accessibility marker (null pro) is the most natural. The overt pronoun and full NP do not differ significantly (β = 0.19, n.s.).
Equations
Instances For
Null is most natural with a single highly-accessible antecedent. Predicted by Accessibility Theory: when only one referent is salient, its mental representation is maximally accessible, so the maximally reduced form is the felicitous choice.
The overt-vs-full-NP boundary is gradient in single-antecedent contexts. The two forms do not differ significantly in Exp 1 — the accessibility distinction collapses when only one antecedent is available. @cite{kwon-lee-2026} interpret this as evidence that adjacent markers on the scale need not exhibit categorical distinctions across all contexts (consistent with @cite{ariel-2001}). For the concrete values, full NP is rated slightly higher than overt by less than 0.1 Likert points.
Exp 2: comprehension accuracy when contextual gender bias points to a particular antecedent. The accuracy gap across contexts is the diagnostic of accessibility sensitivity.
Subject-biased contexts: gender cue points to subject; null pronoun accuracy is near-ceiling (92.9%) because the form-cue (null → subject) aligns with the gender cue.
Object-biased contexts: gender cue points to object, contradicting the form-cue for null. Accuracy drops to 60.3% — null pronouns resist the contextual override. Other forms show no asymmetry.
- form : KoreanRefForm
- subjectBiasedAccuracy : ℕ
Accuracy in subject-biased context (%).
- objectBiasedAccuracy : ℕ
Accuracy in object-biased context (%).
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Figure 1 of @cite{kwon-lee-2026}.
Equations
- KwonLee2026.exp2_pro = { form := KwonLee2026.KoreanRefForm.nullPro, subjectBiasedAccuracy := 93, objectBiasedAccuracy := 60 }
Instances For
Equations
- KwonLee2026.exp2_overt = { form := KwonLee2026.KoreanRefForm.overt, subjectBiasedAccuracy := 81, objectBiasedAccuracy := 78 }
Instances For
Equations
- KwonLee2026.exp2_fullNP = { form := KwonLee2026.KoreanRefForm.fullNP, subjectBiasedAccuracy := 79, objectBiasedAccuracy := 80 }
Instances For
The accuracy gap between subject-biased and object-biased contexts, a measure of how strongly the form's interpretive bias resists the gender-cue override.
Equations
- c.contextSensitivity = max c.subjectBiasedAccuracy c.objectBiasedAccuracy - min c.subjectBiasedAccuracy c.objectBiasedAccuracy
Instances For
Null pronouns drop ~33 accuracy points when the context bias contradicts their default subject-antecedent preference (Figure 1). Direct evidence that null pronouns encode strong subject-antecedent expectations even in the comprehension component.
Overt pronouns show essentially no asymmetry across context biases.
Full NPs show essentially no asymmetry across context biases.
Null is strictly more context-sensitive than overt — exceeding it by over 25 percentage points.
Null is strictly more context-sensitive than full NP.
Naturalness ratings on the 1–7 Likert scale for Exp 2, broken out by context bias. The naturalness data mirrors the comprehension accuracy data: only null pronouns show an asymmetry between subject-biased (4.58) and object-biased (3.94) contexts.
This dual confirmation — same asymmetry in two independent dependent measures (interpretation accuracy AND felicity judgment) — is the paper's strongest evidence that null pronouns carry an interpretive bias that goes beyond mere preference.
- form : KoreanRefForm
- subjectBiased : ℚ
Naturalness in subject-biased context (1–7 Likert).
- objectBiased : ℚ
Naturalness in object-biased context (1–7 Likert).
Instances For
Equations
- KwonLee2026.instReprExp2Naturalness = { reprPrec := KwonLee2026.instReprExp2Naturalness.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Figure 2 of @cite{kwon-lee-2026}.
Equations
- KwonLee2026.exp2nat_pro = { form := KwonLee2026.KoreanRefForm.nullPro, subjectBiased := 458 / 100, objectBiased := 394 / 100 }
Instances For
Equations
- KwonLee2026.exp2nat_overt = { form := KwonLee2026.KoreanRefForm.overt, subjectBiased := 462 / 100, objectBiased := 442 / 100 }
Instances For
Equations
- KwonLee2026.exp2nat_fullNP = { form := KwonLee2026.KoreanRefForm.fullNP, subjectBiased := 433 / 100, objectBiased := 456 / 100 }
Instances For
Equations
- n.contextSensitivity = max n.subjectBiased n.objectBiased - min n.subjectBiased n.objectBiased
Instances For
Naturalness mirrors comprehension: only the null pronoun shows a large naturalness asymmetry across context biases (>0.50 Likert points, β = −1.06, p = .028 in the paper). The overt and full NP forms show no significant asymmetry.
The two Exp 2 dependent measures (accuracy and naturalness) agree on the same asymmetry pattern: null is the only form whose felicity drops when context conflicts with its interpretive bias. This converging evidence is the cornerstone of the paper's argument.
Naturalness ratings cross-tabulated with comprehension correctness (paper §3.2.2, p. 16): trials where participants chose the intended antecedent received higher naturalness ratings (M = 4.40) than trials where they chose the unintended antecedent (M = 4.05). Effect: β = 0.38, SE = 0.13, z = 3.05, p = .002.
This is the paper's most direct evidence that the form-function correlation is psychologically real (not just an experimental artifact): listeners who heard a form and computed an antecedent that didn't match the speaker's intent also perceived the sentence as less natural. The two measures co-vary at the trial level.
Equations
- KwonLee2026.correctTrial_naturalness = 440 / 100
Instances For
Form-function correlation is psychologically real: when the listener's chosen antecedent matches the speaker's intent (correct trial), the sentence is rated more natural than when it doesn't. This validates the form-function link as more than an experimental artifact — it tracks the listener's online interpretive process.
The gap is non-trivial (≈ 0.35 Likert points), within the range where the paper reports significance (β = 0.38).
A language's calibration of @cite{ariel-2001}'s accessibility scale: the empirical subject-antecedent bias of each referential form in a globally ambiguous two-antecedent context.
This is the structure that lets us compare how different languages instantiate the same universal ordering. The relative ordering (null > overt > [full NP]) is preserved, but the spread varies.
- language : String
- nullSubjectPercent : ℕ
P(subject antecedent | null pronoun), as a percentage 0–100.
- overtSubjectPercent : Option ℕ
P(subject antecedent | overt pronoun).
noneif the language was not tested with overt pronouns or has no overt 3sg pronoun. - fullNPSubjectPercent : Option ℕ
P(subject antecedent | full NP).
noneif not tested.
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Italian, @cite{carminati-2002}: null = 80.72%, overt = 100% − 83.33% = 16.67%. The cleanest division of labor of any language tested (Position of Antecedent Hypothesis).
Equations
- KwonLee2026.italian = { language := "Italian", nullSubjectPercent := 81, overtSubjectPercent := some 17, fullNPSubjectPercent := none }
Instances For
Spanish, @cite{contemori-di-domenico-2021}: null = 62%, overt = 100% − 58% = 42%. Weaker division of labor than Italian.
Equations
- KwonLee2026.spanish = { language := "Spanish", nullSubjectPercent := 62, overtSubjectPercent := some 42, fullNPSubjectPercent := none }
Instances For
Chinese, @cite{zhang-kwon-2022}: null = 84%, overt = 65.3%. Both pronoun types show subject bias; the overt form does not flip to object bias as in Italian.
Equations
- KwonLee2026.chinese = { language := "Chinese", nullSubjectPercent := 84, overtSubjectPercent := some 65, fullNPSubjectPercent := none }
Instances For
Korean (this paper's Exp 3, overt = colloquial kyay). The first cross-linguistic dataset that includes full NPs alongside null and overt pronouns.
Equations
- One or more equations did not get rendered due to their size.
Instances For
Korean (@cite{kweon-2011}, overt = literary ku/kunye). 12-item questionnaire study; null = 81.1%, overt = 31.4% subject (so 68.6% object). Resembles Italian's clean division of labor — Kweon interpreted this as supporting Carminati's PAH.
Equations
- KwonLee2026.korean_kweon = { language := "Korean (Kweon 2011, ku/kunye)", nullSubjectPercent := 81, overtSubjectPercent := some 31, fullNPSubjectPercent := none }
Instances For
Korean (@cite{choe-2021}, overt = literary ku/kunye). 40-target / 24-filler study; null = 91%, overt = 73% subject. Both forms subject-biased; little division of labor. Diverges sharply from Kweon. The paper attributes the discrepancy to methodological differences (filler ratio, ambiguity verification).
Equations
- KwonLee2026.korean_choe = { language := "Korean (Choe 2021, ku/kunye)", nullSubjectPercent := 91, overtSubjectPercent := some 73, fullNPSubjectPercent := none }
Instances For
Equations
Instances For
All Korean profiles.
Equations
Instances For
Universal ordering preserved: in every language tested, null pronouns are at least as subject-biased as overt pronouns. This is the universal claim of @cite{ariel-2001}: the relative ordering holds even when the magnitudes vary.
Cross-linguistic granularity varies: Italian shows ≥60-point spread between null and overt; Spanish 20; Chinese 19; Korean 28. The same theory accounts for all four, with language-specific calibration of the spread.
Equations
- p.nullOvertSpread = match p.overtSubjectPercent with | some o => p.nullSubjectPercent - o | none => 0
Instances For
Korean is the only language tested with full NPs: a unique methodological contribution of @cite{kwon-lee-2026}. The full-NP bias (35% subject ↔ 65% object) extends Accessibility Theory's test set to a wider range of forms than prior cross-linguistic work.
Robust within-Korean finding: every Korean study agrees that null pronouns are subject-biased. The variation is entirely in the strength of the bias (and in the overt-pronoun behavior).
The Kweon vs Choe disagreement is one of the paper's framing motivations. Kweon (small item set) shows clean object-bias for overt (~31% subject); Choe (unusually low filler ratio) shows subject-bias (73%). These cannot both be representative of the same underlying competence. The paper attributes the gap to methodological factors.
The Kweon-Choe gap exceeds 40 percentage points.
@cite{kwon-lee-2026}'s Exp 3 finding (43% subject for kyay) lies between Kweon (31%) and Choe (73%). The paper takes this as suggesting Kweon was directionally correct (overt is object-biased in Korean) but that the magnitude depends on the form: kyay is less rigidly object-biased than ku/kunye, consistent with kyay's higher position on the accessibility scale (closer to null).
The relative ordering (null > overt) holds for every Korean study, despite the disagreement on magnitudes. This is exactly @cite{ariel-2001}'s universal: the ordering is invariant; the spread is methodologically/contextually labile.
Distance between two forms on @cite{ariel-2001}'s accessibility scale, measured as the absolute difference of their ranks. Larger distance = further apart on the universal ordering.
Equations
- a.accessibilityDistance b = max a.toAccessibility.rank b.toAccessibility.rank - min a.toAccessibility.rank b.toAccessibility.rank
Instances For
Subject-bias spread between two forms in Exp 3 (absolute difference of subject-choice percentages). Larger spread = stronger empirical distinction between the two forms.
Equations
- KwonLee2026.biasSpread a b = max a.subjectPercent b.subjectPercent - min a.subjectPercent b.subjectPercent
Instances For
Triangle-inequality-like prediction: the extreme pair (null vs full NP) has the largest accessibility distance and the largest empirical bias spread. This is a derived prediction of @cite{ariel-2001}'s ordinal scale — it follows from the rank ordering of the three forms, not from any data-fitting. The four sub-theorems below state each pairwise comparison separately.
Non-uniform calibration — the paper's deepest empirical finding (Discussion): the null↔overt step (3 ranks → 28-point bias spread) is much steeper than the overt↔full-NP step (6 ranks → 8-point spread). Korean's accessibility scale has a steep cliff at the null/non-null boundary and a shallow slope among non-null forms. Exactly what @cite{ariel-2001} predicts (§4.2) about language-specific calibration: only the ordering is universal, not the magnitudes.
The accessibility-distance step from null to overt (3 ranks) is smaller than from overt to full NP (6 ranks).
Yet the empirical bias spread shows the opposite pattern: the smaller-distance pair (null↔overt) has the larger bias spread. The scale is calibrated non-uniformly in Korean.
The null↔overt spread is large (>25 points) — the "cliff" at the null/non-null boundary.
The overt↔fullNP spread is small (<15 points) — the "shallow slope" among non-null forms.
@cite{kehler-rohde-2013} decompose pronoun interpretation as:
P(referent | pronoun) ∝ P(pronoun | referent) × P(referent)
The production component P(pronoun | referent) is conditioned by topichood — speakers use reduced forms for topical referents. The Korean data slot directly into this framework: the canonical Korean topic position is the (typically subject-marked) sentence-initial position, so subjects have high topichood and license null forms.
The cross-linguistic variation in spread (§ 5) reflects how strongly each language's null form encodes topichood relative to other forms.
Equations
Instances For
Korean subjects are the default topichood level (subject of an active clause). Null pronouns mark high accessibility, which @cite{kehler-rohde-2013} derive from high topichood.
Carminati's Position of Antecedent Hypothesis (@cite{carminati-2002}): null pronouns prefer antecedents in syntactically prominent positions (canonically Spec-IP, the preverbal subject position); overt pronouns prefer non-prominent positions.
PAH is a structural theory: prominence is determined by syntactic position, not by discourse accessibility. This contrasts with Accessibility Theory's cognitive/discourse-based prominence. In configurational SVO languages where subject = Spec-IP, the two theories make identical predictions; they diverge in topic-prominent languages where the topic position is structurally distinct from Spec-IP.
@cite{kwon-lee-2026} fn. 1 takes the position that PAH and AT are compatible — PAH being a structural special case that happens to coincide with AT in canonical configurations.
- specIP : SyntacticPosition
Spec-IP: preverbal subject position. PAH's "prominent" position.
- lowerIP : SyntacticPosition
Below Spec-IP: object, complement, adjunct positions.
Instances For
Equations
- KwonLee2026.instDecidableEqSyntacticPosition x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
Equations
- KwonLee2026.instBEqSyntacticPosition.beq x✝ y✝ = (x✝.ctorIdx == y✝.ctorIdx)
Instances For
The PAH-predicted antecedent position for each Korean form. Null prefers Spec-IP; overt avoids it (overt → non-Spec-IP). The PAH does not directly address full NPs (it was formulated for the null/overt pronoun contrast in Italian).
Equations
Instances For
Convergence theorem: PAH and AT make the same prediction for null pronouns in canonical Korean SVO — both predict the subject antecedent (subject = Spec-IP in the experimental stimuli). This is why @cite{kwon-lee-2026}'s Exp 3 data cannot distinguish the two theories.
Divergence point: in a topic-prominent configuration (e.g., a sentence with a topicalized object in sentence-initial position), the topic is NOT in Spec-IP. PAH would still predict null → Spec-IP (= the in-situ subject), while AT would predict null → topic (= the most-accessible referent regardless of structural position).
The Korean experiments cannot test this because all stimuli used canonical SVO order. Disambiguating PAH from AT requires testing topicalization configurations — an empirical extension explicitly flagged by @cite{kwon-lee-2026} (Discussion).
PAH does not address full NPs — it was formulated for pronouns only. AT, by contrast, places full NPs at the bottom of the accessibility scale and predicts the inverse bias (object antecedent). The Exp 3 full-NP finding (35% subject ↔ 65% object) is therefore predicted by AT alone, not by PAH. The paper's inclusion of full NPs is what lets the data adjudicate between the two theories.
Exp 1: a single antecedent in same-clause topic position. Maximally accessible — no competition, tight unity, recently mentioned, topical. Predicts the form should not need to disambiguate.
Equations
- KwonLee2026.exp1_assessment = { distance := 0, topicality := 2, competition := 0, unity := 2 }
Instances For
Exp 2 & 3: two antecedents in same clause. Competition = 1 (one additional candidate). The two experiments differ in whether additional cues (gender in Exp 2) disambiguate, but the accessibility-theoretic competition level is identical.
@cite{ariel-2001}'s AccessibilityAssessment does not have a
field for "disambiguating cue", which is the gap that the Exp 2
naturalness × accuracy correlation (§ 4c) helps fill.
Equations
- KwonLee2026.exp2_3_assessment = { distance := 0, topicality := 2, competition := 1, unity := 2 }
Instances For
Distance is held constant across experiments.
Topicality is held constant.
Unity is held constant.
The manipulation isolates competition: across the three
experiments, only the competition field varies; distance, topicality,
and unity are held constant. This makes the experimental design a clean
test of how competition affects form-function visibility.
Exp 1 has higher accessibility than Exp 2/3 — the referent is more accessible when there's no competition.
Empirical asymmetry consistent with the theory: in the
high-accessibility Exp 1 setting (one antecedent), the
form-function distinction collapses (exp1_overt_fullNP_close).
In the lower-accessibility Exp 3 setting (competition), the
distinction emerges as a clean three-way split
(accessibility_predicts_subject_bias).
This is captured by the assessment-score difference: form-bias strength is inversely correlated with the per-referent accessibility score, because forms only need to disambiguate when there is ambiguity to resolve.
@cite{ariel-2001}'s three form-function criteria all line up with
accessibility for the Korean forms. The criteria don't perfectly
pull apart at every adjacent level (informativity collapses
distalDemNP and unstressedPron at 1), but the overall pattern
holds: the more accessible form is less informative, less rigid,
and more attenuated.
Informativity: full NP ≥ overt (≥ because Ariel's coarse scale
collapses distalDemNP and unstressedPron).
Informativity: overt > null.
Rigidity: full NP (demonstrative + noun) ≥ overt.
Attenuation: null > overt — the null pronoun has no phonological exponent, the overt pronoun has one syllable.
Attenuation: overt > full NP — the demonstrative + noun form has more phonological material than a single pronoun.