TY - JOUR
T1 - The counting principle makes number words unique
AU - Ariel, Mira
AU - Levshina, Natalia
N1 - Publisher Copyright:
© 2024 Walter de Gruyter GmbH, Berlin/Boston.
PY - 2025/2/1
Y1 - 2025/2/1
N2 - Following Ariel (2021. Why it's hard to construct ad hoc number concepts. In Caterina Mauri, Ilaria Fiorentini, & Eugenio Goria (eds.), Building categories in interaction: Linguistic resources at work, 439-462. Amsterdam: John Benjamins), we argue that number words manifest distinct distributional patterns from open-class lexical items. When modified, open-class words typically take selectors (as in kinda table), which select a subset of their potential denotations (e.g., "nonprototypical table"). They are typically not modified by loosening operators (e.g., approximately), since even if bare, typical lexemes can broaden their interpretation (e.g., table referring to a rock used as a table). Number words, on the other hand, have a single, precise meaning and denotation and cannot take a selector, which would need to select a subset of their (single) denotation (kinda seven). However, they are often overtly broadened (approximately seven), creating a range of values around N. First, we extend Ariel's empirical examination to the larger COCA and to Hebrew (HeTenTen). Second, we propose that open-class and number words belong to sparse versus dense lexical domains, respectively, because the former exhibit prototypicality effects, but the latter do not. Third, we further support the contrast between sparse and dense domains by reference to: synchronic word2vec models of sparse and dense lexemes, which testify to their differential distributions, numeral use in noncounting communities, and different renewal rates for the two lexical types.
AB - Following Ariel (2021. Why it's hard to construct ad hoc number concepts. In Caterina Mauri, Ilaria Fiorentini, & Eugenio Goria (eds.), Building categories in interaction: Linguistic resources at work, 439-462. Amsterdam: John Benjamins), we argue that number words manifest distinct distributional patterns from open-class lexical items. When modified, open-class words typically take selectors (as in kinda table), which select a subset of their potential denotations (e.g., "nonprototypical table"). They are typically not modified by loosening operators (e.g., approximately), since even if bare, typical lexemes can broaden their interpretation (e.g., table referring to a rock used as a table). Number words, on the other hand, have a single, precise meaning and denotation and cannot take a selector, which would need to select a subset of their (single) denotation (kinda seven). However, they are often overtly broadened (approximately seven), creating a range of values around N. First, we extend Ariel's empirical examination to the larger COCA and to Hebrew (HeTenTen). Second, we propose that open-class and number words belong to sparse versus dense lexical domains, respectively, because the former exhibit prototypicality effects, but the latter do not. Third, we further support the contrast between sparse and dense domains by reference to: synchronic word2vec models of sparse and dense lexemes, which testify to their differential distributions, numeral use in noncounting communities, and different renewal rates for the two lexical types.
KW - distributional semantics
KW - lexical domains
KW - numerals
KW - numeric expressions
KW - prototypes
KW - word2vec
UR - http://www.scopus.com/inward/record.url?scp=85189139531&partnerID=8YFLogxK
U2 - 10.1515/cllt-2023-0105
DO - 10.1515/cllt-2023-0105
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 39925873
AN - SCOPUS:85189139531
SN - 1613-7027
VL - 21
SP - 173
EP - 199
JO - Corpus Linguistics and Linguistic Theory
JF - Corpus Linguistics and Linguistic Theory
IS - 1
ER -