Distribution of mutual information from complete and incomplete data

[1]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[2]  R. Fisher The Advanced Theory of Statistics , 1943, Nature.

[3]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[4]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[5]  Solomon Kullback,et al.  Information Theory and Statistics , 1970, The Mathematical Gazette.

[6]  David G. Stork,et al.  Pattern Classification , 1973 .

[7]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[8]  S. Fienberg,et al.  Two-Dimensional Contingency Tables with Both Completely and Partially Cross-Classified Data , 1974 .

[9]  Nils J. Nilsson,et al.  Artificial Intelligence , 1974, IFIP Congress.

[10]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[11]  S. Fienberg,et al.  The Analysis of Contingency Tables with Incompletely Classified Data , 1976 .

[12]  Editors , 1986, Brain Research Bulletin.

[13]  William H. Press,et al.  Book-Review - Numerical Recipes in Pascal - the Art of Scientific Computing , 1989 .

[14]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[15]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  David D. Lewis,et al.  Feature Selection and Feature Extraction for Text Categorization , 1992, HLT.

[18]  William H. Press,et al.  Numerical recipes in C++: the art of scientific computing, 2nd Edition (C++ ed., print. is corrected to software version 2.10) , 1994 .

[19]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[20]  Ron Kohavi,et al.  MLC++: a machine learning library in C++ , 1994, Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94.

[21]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[22]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[23]  David R. Wolf,et al.  Estimating functions of probability distributions from a finite set of samples. , 1994, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[24]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[25]  Nils J. Nilsson,et al.  MLC++, A Machine Learning Library in C++. , 1995 .

[26]  Wray L. Buntine A Guide to the Literature on Learning Probabilistic Networks from Data , 1996, IEEE Trans. Knowl. Data Eng..

[27]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[28]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[29]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[30]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[31]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[32]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[33]  Gernot D. Kleiter,et al.  The posterior probability of Bayes nets with strong dependences , 1999, Soft Comput..

[34]  Georgios Paliouras,et al.  An evaluation of Naive Bayesian anti-spam filtering , 2000, ArXiv.

[35]  Ian Witten,et al.  Data Mining , 2000 .

[36]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[37]  Marcus Hutter,et al.  Distribution of Mutual Information , 2001, NIPS.

[38]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[39]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[40]  Andrew W. Moore,et al.  Using Tarjan's Red Rule for Fast Dependency Tree Construction , 2002, NIPS.

[41]  Marco Zaffalon,et al.  Robust Feature Selection by Mutual Information Distributions , 2002, UAI.

[42]  David Page,et al.  KDD Cup 2001 report , 2002, SKDD.

[43]  Marco Zaffalon,et al.  Bayesian Treatment of Incomplete Discrete Data applied to Mutual Information and Feature Selection , 2003, KI.

[44]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[45]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[46]  Marco Zaffalon,et al.  Robust inference of trees , 2005, Annals of Mathematics and Artificial Intelligence.

[47]  Richard E. Neapolitan,et al.  Learning Bayesian networks , 2007, KDD '07.