论文信息 - An incremental speaker-adaptation technique for hybrid HMM-MLP recognizer

An incremental speaker-adaptation technique for hybrid HMM-MLP recognizer

One of the problems of speaker-independent continuous speech recognition systems is their inability to cope with the inter-speaker variability. When we find test speakers with different characteristics from the ones presented in the training pool we observe a large degradation on the system performance. To overcome this problem speaker-adaptation techniques may be used to provide near speaker-dependent accuracy. In this work we present a speaker-adaptation technique applied to a hybrid HMM-MLP system for large vocabulary, continuous speech recognition. This technique is based on an architecture that employs a trainable linear input network (LIN) to map the speaker specific features input vectors to the speaker-independent system. This speaker-adaptation technique is evaluated in an incremental speaker-adaptation task using a Wall Street Journal (WSJ) database. Both supervised and unsupervised modes are evaluated. The results show that speaker-adaptation within the hybrid framework can substantially improve system performance.

Ciro Martins | Luís B. Almeida | Joao P. Neto

[1] Ciro Martins,et al. Unsupervised Speaker-Adaptation For Hybrid Hmm-Mlp Continuous Speech Recognition System , 1995 .

[2] Hervé Bourlard,et al. Neural networks for statistical recognition of continuous speech , 1995, Proc. IEEE.

[3] Richard Lippmann,et al. Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[4] Ciro Martins,et al. Speaker-adaptation in a hybrid HMM-MLP recognizer , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[6] Yochai Konig,et al. A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project , 1993, EUROSPEECH.

[7] Anthony J. Robinson,et al. Real-time recognition of broadcast radio speech , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[8] Ciro Martins,et al. Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system , 1995, EUROSPEECH.