论文信息 - A spectral model for nonstationary voiced speech

A spectral model for nonstationary voiced speech

Spectral modeling is a key step in several speech processing applications. This paper presents a novel model capable of accurately handling the short-time spectrum of nonstationary voiced speech. As is well known, locally stationary, i.e., periodic, voiced speech exhibits a well-defined spectral line structure. It is shown in this paper that locally nonstationary voiced speech, i.e., speech with variations of both pitch and vocal tract, within the analysis window, still exhibits, in a generalized form, a spectral line structure, each line being formed by a series of well-defined components. The applications of this model to several speech processing problems, namely those involving speech prediction and pitch detection are presented, along with supporting experimental results.

José M. Tribolet | Luís B. Almeida

[1] José M. Tribolet,et al. A model for short-time phase prediction of speech , 1981, ICASSP.

[2] Ronald W. Schafer,et al. Real-time digital hardware pitch detector , 1976 .

[3] Luís B. Almeida,et al. Harmonic coding: A low bit-rate, good-quality speech coding technique , 1982, ICASSP.