[WWL+23]: Numeral Frequency and Roundness #
Corpus study showing number frequency is predicted by: (a) log magnitude, and (b) graded roundness via Sigurd/Jansen & Pollmann k-ness properties.
Key finding: each k-ness property has an independent positive effect on frequency, with 10-ness being the strongest predictor and multipleOf5 the weakest.
Register Effect #
Informational texts (Wikipedia) show stronger roundness effects than non-informational texts (fiction, conversation), suggesting roundness interacts with communicative goals.
Regression coefficient for each k-ness property on log frequency.
Higher β = stronger positive effect on numeral frequency in corpora. All coefficients are positive: each property independently increases frequency.
- property : String
- β : ℚ
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
β coefficients ordered by magnitude (§4.2).
Equations
- WoodinEtAl2024.β_tenness = { property := "10-ness", β := 446 / 100 }
Instances For
Equations
- WoodinEtAl2024.β_2_5ness = { property := "2.5-ness", β := 384 / 100 }
Instances For
Equations
- WoodinEtAl2024.β_5ness = { property := "5-ness", β := 339 / 100 }
Instances For
Equations
- WoodinEtAl2024.β_2ness = { property := "2-ness", β := 274 / 100 }
Instances For
Equations
- WoodinEtAl2024.β_mult10 = { property := "multipleOf10", β := 245 / 100 }
Instances For
Equations
- WoodinEtAl2024.β_mult5 = { property := "multipleOf5", β := 6 / 100 }
Instances For
The 6 coefficients in descending order.
Equations
Instances For
Frequency-weighted roundness score #
Frequency-weighted roundness score using Woodin et al.'s β coefficients.
Unlike the unweighted roundnessScore (which counts properties equally),
this weights each property by its empirical frequency effect.
Equations
- One or more equations did not get rendered due to their size.
Instances For
weightedRoundnessScore 50 > 0: 50 has multipleOf5, multipleOf10,
2.5-ness, 5-ness, and 10-ness (50 = 5 × 10¹), so its weighted score is
the strictly positive sum of those β coefficients.
RSA utterance prior from corpus frequency. Rounder numerals have
higher prior weight, so weightedRoundnessScore doubles as an
empirically-grounded RSA utterance prior: rounder numerals are more
likely to be chosen, all else equal. The strict-monotonicity chain
100 > 50 > 7 realises this on representative cases.
Register effect data #
Register type from corpus analysis.
Informational registers (Wikipedia) show stronger roundness effects than non-informational registers (fiction, conversation).
Instances For
Equations
- WoodinEtAl2024.instReprRegister = { reprPrec := WoodinEtAl2024.instReprRegister.repr }
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- WoodinEtAl2024.instDecidableEqRegister x✝ y✝ = if h : x✝.ctorIdx = y✝.ctorIdx then isTrue ⋯ else isFalse ⋯
Register effect datum: roundness β is larger in informational texts.
- register : Register
- roundnessEffectMagnitude : String
- notes : String
Instances For
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
Equations
- One or more equations did not get rendered due to their size.
Instances For
Equations
- One or more equations did not get rendered due to their size.