论文信息 - Statistical segmentation and word modeling techniques in isolated word recognition

Statistical segmentation and word modeling techniques in isolated word recognition

A speech recognition system is described using a combination of statistical segment and word modeling. Segment models are constructed by first segmenting training data automatically and then grouping the resultant segments into clusters. Mixtures of Gaussian densities are used to model each segment cluster. In order to integrate the segment models into word models, a generalization of the hidden Markov model approach is proposed. Experimental results on a multispeaker recognition system for alpha-digits demonstrate that the new approach improved the performance of conventional whole-word-based models. In particular, the word models show good discrimination abilities for differentiating phonetically similar words such as the E-set alphabet.<<ETX>>

[1] N. Sedgwick,et al. A method for segmenting acoustic patterns, with applications to automatic speech recognition , 1977 .

[2] Frank K. Soong. A phonetically labeled acoustic segment (PLAS) approach to speech analysis-synthesis , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3] Lawrence R. Rabiner,et al. Some performance benchmarks for isolated work speech recognition systems , 1987 .

[4] L. R. Rabiner,et al. Recognition of isolated digits using hidden Markov models with continuous mixture densities , 1985, AT&T Technical Journal.

[5] Chin-Hui Lee. On the use of some robust modeling techniques for speech recognition , 1989 .

[6] Torbjørn Svendsen,et al. On the automatic segmentation of speech signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.