TY - GEN
T1 - Pragmatic Markers in Dialogue and Monologue
T2 - 22nd International Conference on Speech and Computer, SPECOM 2020
AU - Bogdanova-Beglarian, Natalia
AU - Blinova, Olga
AU - Sherstinova, Tatiana
AU - Gorbunova, Daria
AU - Zaides, Kristina
AU - Popova, Tatiana
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - The paper deals with new research findings on pragmatic markers (PMs) use in spoken Russian. The study is based on two speech corpora: “One Day of Speech” (ORD, which contains mainly dialogues), and “Balanced Annotated Collection of Texts” (SAT, which contains only monologues). We explored two annotated subcorpora consisting of 321,504 tokens and 50,128 tokens respectively. The main results are as follows: 1) the extended frequency lists of PMs were formed; 2) PMs, that are frequently used in both types of speech, were identified (e.g., hesitation markers like tam ‘there’, tak ‘that way’), 3) the list of PMs, used primarily in monologue speech, was compiled (in this list there are such PMs as boundary ones znachit ‘well’, nu vot ‘well er’, vs’o ‘that’s all’); 4) the list of PMs, used primarily in dialogues, was made (among such PMs are, for example, “xeno”-markers takoj ‘like’, grit ‘says’ and meta-communicative markers like vidish’ ‘you know’, (ja) ne znaju ‘don’t know’). Particular attention was paid to the variability of pragmatic markers, as well as to complex cases of their identification. Finally, the most common models of pragmatic markers formation (for single-word and multi-word PMs) were revealed.
AB - The paper deals with new research findings on pragmatic markers (PMs) use in spoken Russian. The study is based on two speech corpora: “One Day of Speech” (ORD, which contains mainly dialogues), and “Balanced Annotated Collection of Texts” (SAT, which contains only monologues). We explored two annotated subcorpora consisting of 321,504 tokens and 50,128 tokens respectively. The main results are as follows: 1) the extended frequency lists of PMs were formed; 2) PMs, that are frequently used in both types of speech, were identified (e.g., hesitation markers like tam ‘there’, tak ‘that way’), 3) the list of PMs, used primarily in monologue speech, was compiled (in this list there are such PMs as boundary ones znachit ‘well’, nu vot ‘well er’, vs’o ‘that’s all’); 4) the list of PMs, used primarily in dialogues, was made (among such PMs are, for example, “xeno”-markers takoj ‘like’, grit ‘says’ and meta-communicative markers like vidish’ ‘you know’, (ja) ne znaju ‘don’t know’). Particular attention was paid to the variability of pragmatic markers, as well as to complex cases of their identification. Finally, the most common models of pragmatic markers formation (for single-word and multi-word PMs) were revealed.
KW - Corpus annotation
KW - Dialogue
KW - Monologue
KW - Pragmatic marker
KW - Russian everyday speech
KW - Speech corpus
UR - http://www.scopus.com/inward/record.url?scp=85092942126&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-60276-5_7
DO - 10.1007/978-3-030-60276-5_7
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85092942126
SN - 9783030602758
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 68
EP - 78
BT - Speech and Computer - 22nd International Conference, SPECOM 2020, Proceedings
A2 - Karpov, Alexey
A2 - Potapova, Rodmonga
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 7 October 2020 through 9 October 2020
ER -