论文信息 - Improving a statistical language model by modulating the effects of context words

Improving a statistical language model by modulating the effects of context words

We show how to improve a state-of-the-art neural network language model that converts the previous "context" words into feature vectors and combines these feature vectors to predict the feature vector of the next word. Significant improvements in predictive accuracy are achieved by using higher-level features to modulate the effects of the con- text words. This is more effective than using the higher-level features to directly predict the feature vector of the next word, but it is also possible to combine both methods.

Geoffrey E. Hinton | Andriy Mnih | Zhang Yuecheng

[1] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[2] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[3] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[4] John Blitzer,et al. Distributed Latent Variable Models of Lexical Co-occurrences , 2005, AISTATS.

[5] Geoffrey E. Hinton,et al. Three new graphical models for statistical language modelling , 2007, ICML '07.

[6] Jean-Luc Gauvain,et al. Training Neural Network Language Models on Very Large Corpora , 2005, HLT.

[7] John Blitzer,et al. Hierarchical Distributed Representations for Statistical Language Modeling , 2004, NIPS.