Abstract
The speech production model where the speech signal is modeled as the output of an all pole filter driven either by some white noise sequence (unvoiced speech) or by the sum of an impulse sequence and a noise sequence (voiced speech) is considered. Approximate maximum-likelihood (ML) estimation algorithms for the unvoiced case are well known. In this work, the expectation-maximization (EM) algorithm is used in order to obtain the ML estimator of the parameters for the voiced speech model. These parameters consist of the parameters of the impulse sequence (pitch parameters) and the parameters of the filter (autoregressive parameters).
Original language | English |
---|---|
Pages (from-to) | 797-800 |
Number of pages | 4 |
Journal | Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing |
Volume | 2 |
State | Published - 1990 |
Externally published | Yes |
Event | 1990 International Conference on Acoustics, Speech, and Signal Processing: Speech Processing 2, VLSI, Audio and Electroacoustics Part 2 (of 5) - Albuquerque, New Mexico, USA Duration: 3 Apr 1990 → 6 Apr 1990 |