Large vocabulary natural language continuous speech recognition

L. R. Bahl*, R. Bakis, J. Bellegarda, P. F. Brown, D. Burshtein, S. K. Das, P. V. de Souza, P. S. Gopalakrishnan, F. Jelinek, D. Kanevsky, R. L. Mercer, A. J. Nadas, D. Nahamoo, M. A. Picheny

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

28 Scopus citations

Abstract

A description is presented of the authors' current research on automatic speech recognition of continuously read sentences from a naturally-occurring corpus: office correspondence. The recognition system combines features from their current isolated-word recognition system and from their previously developed continuous-speech recognition systems. It consists of an acoustic processor, an acoustic channel model, a language model, and a linguistic decoder. Some new features in the recognizer relative to the isolated-word speech recognition system include the use of a fast match to prune rapidly to a manageable number the candidates considered by the detailed match, multiple pronunciations of all function words, and modeling of interphone coarticulatory behavior. The authors recorded training and test data from a set of ten male talkers. The perplexity of the test sentences was found to be 93; none of the sentences was part of the data used to generate the language model. Preliminary (speaker-dependent) recognition results on these talkers yielded an average word error rate of 11.0%.

Original languageEnglish
Pages (from-to)465-467
Number of pages3
JournalProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Volume1
StatePublished - 1989
Externally publishedYes
Event1989 International Conference on Acoustics, Speech, and Signal Processing - Glasgow, Scotland
Duration: 23 May 198926 May 1989

Fingerprint

Dive into the research topics of 'Large vocabulary natural language continuous speech recognition'. Together they form a unique fingerprint.

Cite this