Selective Classifiers Can Be Too Restrictive: A Case-Study in Oesophageal Cancer

Real-life datasets in biomedicine often include missing values. When learning a Bayesian network classifier from such a dataset, the missing values are typically filled in by means of an imputation method to arrive at a complete dataset. The thus completed dataset then is used for the classifier’s construction. When learning a selective classifier, also the selection of appropriate features is based upon the completed data. The resulting classifier, however, is likely to be used in the original real-life setting where it is again confronted with missing values. By means of a real-life dataset in the field of oesophageal cancer that includes a relatively large number of missing values, we argue that especially the wrapper approach to feature selection may result in classifiers that are too selective for such a setting and that, in fact, some redundancy is required to arrive at a reasonable classification accuracy in practice.

[1]  Enrique F. Castillo,et al.  Expert Systems and Probabilistic Network Models , 1996, Monographs in Computer Science.

[2]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[3]  Mehran Sahami,et al.  Learning Limited Dependence Bayesian Classifiers , 1996, KDD.

[4]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[5]  Leandro Pardo Llorente Teoría de la información estadística , 1993 .

[6]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[7]  Michael J. Pazzani,et al.  Searching for Dependencies in Bayesian Classifiers , 1995, AISTATS.

[8]  G. Kalton,et al.  The treatment of missing survey data , 1986 .

[9]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[10]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[11]  Pat Langley,et al.  Induction of Selective Bayesian Classifiers , 1994, UAI.

[12]  Marvin Minsky,et al.  Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[13]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.