Frequency-varying sinusoidal modeling of speech

A sinusoidal model is presented where the nonstationary nature of speech is considered by using a time-varying frequency and amplitude for each sinusoid. The proposed model generalizes other sinusoidal models while still having an analytically tractable short-time spectrum. The estimation of the parameters of the sinusoids is done in the frequency domain by a suboptimal linear estimator. The experimental results obtained with the proposed model illustrate its ability to represent nonstationary speech frames. >

[1]  M. Portnoff Short-time Fourier analysis of sampled speech , 1981 .

[2]  Biing-Hwang Juang,et al.  Speech enhancement with harmonic synthesis , 1983, ICASSP.

[3]  Per Hedelin A tone oriented voice excited vocoder , 1981, ICASSP.

[4]  Dennis Gabor,et al.  Theory of communication , 1946 .

[5]  Anastasios N. Venetsanopoulos,et al.  Efficient realizations of two-dimensional quadratic digital filters , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Luís B. Almeida,et al.  Nonstationary spectral modeling of voiced speech , 1983 .

[8]  L. Almeida,et al.  A background for sinusoid based representation of voiced speech , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.