Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models

Natalia Bogdanova-Beglarian, Olga Blinova, Tatiana Sherstinova*, Daria Gorbunova, Kristina Zaides, Tatiana Popova

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The paper deals with new research findings on pragmatic markers (PMs) use in spoken Russian. The study is based on two speech corpora: “One Day of Speech” (ORD, which contains mainly dialogues), and “Balanced Annotated Collection of Texts” (SAT, which contains only monologues). We explored two annotated subcorpora consisting of 321,504 tokens and 50,128 tokens respectively. The main results are as follows: 1) the extended frequency lists of PMs were formed; 2) PMs, that are frequently used in both types of speech, were identified (e.g., hesitation markers like tam ‘there’, tak ‘that way’), 3) the list of PMs, used primarily in monologue speech, was compiled (in this list there are such PMs as boundary ones znachit ‘well’, nu vot ‘well er’, vs’o ‘that’s all’); 4) the list of PMs, used primarily in dialogues, was made (among such PMs are, for example, “xeno”-markers takoj ‘like’, grit ‘says’ and meta-communicative markers like vidish’ ‘you know’, (ja) ne znaju ‘don’t know’). Particular attention was paid to the variability of pragmatic markers, as well as to complex cases of their identification. Finally, the most common models of pragmatic markers formation (for single-word and multi-word PMs) were revealed.

Original languageEnglish
Title of host publicationSpeech and Computer - 22nd International Conference, SPECOM 2020, Proceedings
EditorsAlexey Karpov, Rodmonga Potapova
PublisherSpringer Science and Business Media Deutschland GmbH
Pages68-78
Number of pages11
ISBN (Print)9783030602758
DOIs
StatePublished - 2020
Externally publishedYes
Event22nd International Conference on Speech and Computer, SPECOM 2020 - St. Petersburg, Russian Federation
Duration: 7 Oct 20209 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12335 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference22nd International Conference on Speech and Computer, SPECOM 2020
Country/TerritoryRussian Federation
CitySt. Petersburg
Period7/10/209/10/20

Keywords

  • Corpus annotation
  • Dialogue
  • Monologue
  • Pragmatic marker
  • Russian everyday speech
  • Speech corpus

Fingerprint

Dive into the research topics of 'Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models'. Together they form a unique fingerprint.

Cite this