Harmonic coding at 4.8 kb/s

A harmonic coder designed for operation at 4.8 kb/s is presented. Sinusoidal modeling of speech is reviewed and the use of harmonically related frequencies is extended to unvoiced and transition regions. The estimation of the fundamental frequency with emphasis on the fine tuning of the pitch estimates is discussed. The structure of the 4.8-kbit/s harmonic coder is described, and a detailed discussion of phase quantization issues is presented. Experimental results are presented which reveal a reverberant character in many speech utterances and difficulties in some classes of sounds (plosives, liquids, and voiced fricatives). These distortions are mostly produced by the quantization of phase, but they are also explained by the difficulties of the sinusoidal analysis/synthesis in transition regions when the frame length is large.<<ETX>>

[1]  Luís B. Almeida,et al.  Frequency-varying sinusoidal modeling of speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  Isabel Trancoso,et al.  Quantization Issues in Harmonic Coders , 1988 .

[3]  Luís B. Almeida,et al.  Sinusoidal modeling of voiced and unvoiced speech , 1989, EUROSPEECH.

[4]  Luís B. Almeida,et al.  Nonstationary spectral modeling of voiced speech , 1983 .

[5]  D. L. Thomson Parametric models of the magnitude/phase spectrum for harmonic speech coding , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6]  Ronald W. Schafer,et al.  Real-time digital hardware pitch detector , 1976 .

[7]  Luís B. Almeida,et al.  Variable-frequency synthesis: An improved harmonic coding scheme , 1984, ICASSP.

[8]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[9]  Biing-Hwang Juang,et al.  Speech enhancement with harmonic synthesis , 1983, ICASSP.

[10]  D. Griffin,et al.  A high quality 9.6 kbps speech coding system , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..