Two decades of statistical language modeling: where do we go from here?
暂无分享,去创建一个
[1] Claude E. Shannon,et al. Prediction and Entropy of Printed English , 1951 .
[2] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .
[3] E. Jaynes. Information Theory and Statistical Mechanics , 1957 .
[4] Karl Steinbuch,et al. Closing remarks , 1959, IFIP Congress.
[5] J. Darroch,et al. Generalized Iterative Scaling for Log-Linear Models , 1972 .
[6] F. Jelinek,et al. Perplexity—a measure of the difficulty of speech recognition tasks , 1977 .
[7] Thomas M. Cover,et al. A convergent gambling estimate of the entropy of English , 1978, IEEE Trans. Inf. Theory.
[8] J. Baker. Trainable grammars for speech recognition , 1979 .
[9] Frederick Jelinek,et al. Interpolated estimation of Markov source parameters from sparse data , 1980 .
[10] Gerard Salton,et al. Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.
[11] Slava M. Katz,et al. Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..
[12] Roland Kuhn,et al. Speech Recognition and the Frequency of Recently Used Words: A Modified Markov Model for Natural Language , 1988, COLING.
[13] Julian Kupiec,et al. Probabilistic Models of Short and Long Distance Word Dependencies in Running Text , 1989, HLT.
[14] Lalit R. Bahl,et al. A tree-based statistical language model for natural language speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..
[15] Richard A. Harshman,et al. Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..
[16] Wayne H. Ward,et al. The CMU Air Travel Information Service: Understanding Spontaneous Speech , 1990, HLT.
[17] P. J. Price,et al. Evaluation of Spoken Language Systems: the ATIS Domain , 1990, HLT.
[18] John Cocke,et al. A Statistical Approach to Machine Translation , 1990, CL.
[19] Renato De Mori,et al. A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..
[20] Bernard Mérialdo,et al. A Dynamic Language Model for Speech Recognition , 1991, HLT.
[21] Robert L. Mercer,et al. A Statistical Approach to Sense Disambiguation in Machine Translation , 1991, HLT.
[22] Ian H. Witten,et al. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.
[23] A. Nadas,et al. An iterative 'flip-flop' approximation of the most informative split in the construction of decision trees , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[24] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[25] Frederick Jelinek,et al. Basic Methods of Probabilistic Context Free Grammars , 1992 .
[26] Janet M. Baker,et al. The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.
[27] John Lafferty,et al. Grammatical Trigrams: A Probabilistic Model of Link Grammar , 1992 .
[28] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[29] Renato De Mori,et al. Corrections to "A Cache-Based Language Model for Speech Recognition" , 1992, IEEE Trans. Pattern Anal. Mach. Intell..
[30] Robert L. Mercer,et al. Adaptive Language Modeling Using Minimum Discriminant Estimation , 1992, HLT.
[31] Glenn Carroll,et al. Two Experiments on Learning Probabilistic Dependency Grammars from Corpora , 1992 .
[32] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.
[33] Dana Ron,et al. The Power of Amnesia , 1993, NIPS.
[34] Daniel Dominic Sleator,et al. Parsing English with a Link Grammar , 1995, IWPT.
[35] Reinhard Kneser,et al. On the dynamic adaptation of stochastic language models , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[36] Ronald Rosenfeld,et al. Trigger-based language models: a maximum entropy approach , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[37] Hermann Ney,et al. Improved clustering techniques for class-based statistical language modelling , 1993, EUROSPEECH.
[38] Ronald Rosenfeld,et al. Adaptive Statistical Language Modeling; A Maximum Entropy Approach , 1994 .
[39] Hermann Ney,et al. On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..
[40] John D. Lafferty,et al. Cluster Expansions and Iterative Scaling for Maximum Entropy Language Models , 1995, ArXiv.
[41] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[42] Douglas E. Appelt,et al. Combining Linguistic and Statistical Knowledge Sources in Natural-Language Processing for ATIS , 1995 .
[43] Ronald Rosenfeld,et al. The CMU Statistical Language Modeling Toolkit and its use in the 1994 ARPA CSR Evaluation , 1995 .
[44] Ralf D. Brown,et al. Applying Statistical English Language Modelling to Symbolic Machine Translation , 1995, TMI.
[45] Isabelle Guyon,et al. Design of a linguistic postprocessor using variable memory length Markov models , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.
[46] Ronald Rosenfeld,et al. A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..
[47] Michael Collins,et al. A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.
[48] Mari Ostendorf,et al. Modeling long distance dependence in language: topic mixtures vs. dynamic cache models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[49] Mari Ostendorf,et al. Modeling long distance dependence in language: topic mixtures versus dynamic cache models , 1996, IEEE Trans. Speech Audio Process..
[50] Reinhard Kneser,et al. Statistical language modeling using a variable context length , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[51] R. Rosenfeld,et al. ERROR ANALYSIS AND DISFLUENCY MODELING IN THE SWITCHBOARD DOMAIN , 1996 .
[52] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.
[53] Frederick Jelinek,et al. Statistical methods for speech recognition , 1997 .
[54] John D. Lafferty,et al. A Model of Lexical Attraction and Repulsion , 1997, ACL.
[55] John D. Lafferty,et al. Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..
[56] Andreas Stolcke,et al. Structure and performance of a dependency language model , 1997, EUROSPEECH.
[57] Ronald Rosenfeld,et al. Using story topics for language model adaptation , 1997, EUROSPEECH.
[58] Ronald Rosenfeld,et al. Lattice based language models , 1997 .
[59] Roni Rosenfeld,et al. A whole sentence maximum entropy language model , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.
[60] Ronald Rosenfeld,et al. Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.
[61] Ronald Rosenfeld,et al. Topic adaptation for language modeling using unnormalized exponential models , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[62] Stanley F. Chen,et al. Evaluation Metrics For Language Models , 1998 .
[63] Ronald Rosenfeld,et al. Nonlinear interpolation of topic models for language model adaptation , 1998, ICSLP.
[64] Jerome R. Bellegarda,et al. A multispan language modeling framework for large vocabulary speech recognition , 1998, IEEE Trans. Speech Audio Process..
[65] Eric Brill,et al. Beyond N-Grams: Can Linguistic Sophistication Improve Language Modeling? , 1998, COLING-ACL.
[66] John D. Lafferty,et al. Information retrieval as statistical translation , 1999, SIGIR '99.
[67] Ronald Rosenfeld,et al. Linguistic features for whole sentence maximum entropy language models , 1999, EUROSPEECH.
[68] Dietrich Klakow,et al. COMPACT MAXIMUM ENTROPY LANGUAGE MODELS , 1999 .
[69] Roni Rosenfeld,et al. Interactive Feature Induction and Logistic Regression for Whole Sentence Exponential Language , 1999 .
[70] Ronald Rosenfeld,et al. Efficient sampling and feature selection in whole sentence maximum entropy language models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[71] Jun Wu,et al. A maximum entropy language model integrating N-grams and topic dependencies for conversational speech recognition , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[72] Jun Wu,et al. Combining nonlocal, syntactic and n-gram dependencies in language modeling , 1999, EUROSPEECH.
[73] Frederick Jelinek,et al. Recognition performance of a structured language model , 2000, EUROSPEECH.
[74] Thomas Niesler,et al. Variable-length categoryn-gram language models , 1999, Comput. Speech Lang..
[75] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[76] Frederick Jelinek,et al. Improved clustering techniques for class-based statistical language modeling , 1999 .
[77] Jerome R. Bellegarda. Large vocabulary speech recognition with multispan statistical language models , 2000, IEEE Trans. Speech Audio Process..
[78] Ronald Rosenfeld,et al. A survey of smoothing techniques for ME models , 2000, IEEE Trans. Speech Audio Process..
[79] Mari Ostendorf,et al. Variable n-grams and extensions for conversational speech language modeling , 2000, IEEE Trans. Speech Audio Process..
[80] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.
[81] Ronald Rosenfeld,et al. Whole-sentence exponential language models: a vehicle for linguistic-statistical integration , 2001, Comput. Speech Lang..