Beyond Word N-Grams
暂无分享,去创建一个
[1] R. Fisher,et al. The Relation Between the Number of Species and the Number of Individuals in a Random Sample of an Animal Population , 1943 .
[2] Claude E. Shannon,et al. Prediction and Entropy of Printed English , 1951 .
[3] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .
[4] Raphail E. Krichevsky,et al. The performance of universal encoding , 1981, IEEE Trans. Inf. Theory.
[5] J. Rissanen. A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .
[6] Robert E. Tarjan,et al. Self-adjusting binary search trees , 1985, JACM.
[7] Slava M. Katz,et al. Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..
[8] Alfredo De Santis,et al. Learning probabilistic prediction functions , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.
[9] Donald Hindle,et al. Noun Classification From Predicate-Argument Structures , 1990, ACL.
[10] Ian H. Witten,et al. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.
[11] Kenneth Ward Church,et al. A comparison of the enhanced Good-Turing and deleted estimation methods for estimating probabilities of English bigrams , 1991 .
[12] Philip Resnik,et al. WordNet and Distributional Analysis: A Class-based Approach to Lexical Discovery , 1992, AAAI 1992.
[13] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[14] David Haussler,et al. How to use expert advice , 1993, STOC.
[15] Naftali Tishby,et al. Distributional Clustering of English Words , 1993, ACL.
[16] Frans M. J. Willems,et al. The context-tree weighting method: basic properties , 1995, IEEE Trans. Inf. Theory.
[17] Kenneth Ward Church,et al. Inverse Document Frequency (IDF): A Measure of Deviations from Poisson , 1995, VLC@ACL.