Speech enhancement using a mixture-maximum model

David Burshtein*, Sharon Gannot

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

75 Scopus citations

Abstract

We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a mixture model for the short time spectrum of the clean speech signal, and on a maximum assumption in the production of the noisy speech spectrum. In the past this model was used in the context of noise robust speech recognition. In this paper we show that this model is also effective for improving the quality of speech signals corrupted by additive noise. The computational requirements of the algorithm can be significantly reduced, essentially without paying performance penalties, by incorporating a dual codebook scheme with tied variances. Experiments, using recorded speech signals and actual noise sources, show that in spite of its low computational requirements, the algorithm shows improved performance compared to alternative speech enhancement algorithms.

Original languageEnglish
Pages (from-to)341-351
Number of pages11
JournalIEEE Transactions on Speech and Audio Processing
Volume10
Issue number6
DOIs
StatePublished - Sep 2002

Keywords

  • Gaussian mixture model
  • MIXMAX model
  • Speech enhancement

Fingerprint

Dive into the research topics of 'Speech enhancement using a mixture-maximum model'. Together they form a unique fingerprint.

Cite this