论文信息 - Comparing multi-objective and threshold-moving ROC curve generation for a prototype-based classifier

Comparing multi-objective and threshold-moving ROC curve generation for a prototype-based classifier

Receiver Operating Characteristics (ROC) curves represent the performance of a classifier for all possible operating conditions, i.e., for all preferences regarding the tradeoff between false positives and false negatives. The generation of a ROC curve generally involves the training of a single classifier for a given set of operating conditions, with the subsequent use of threshold-moving to obtain a complete ROC curve. Recent work has shown that the generation of ROC curves may also be formulated as a multi-objective optimization problem in ROC space: the goals to be minimized are the false positive and false negative rates. This technique also produces a single ROC curve, but the curve may derive from operating points for a number of different classifiers. This paper aims to provide an empirical comparison of the performance of both of the above approaches, for the specific case of prototype-based classifiers. Results on synthetic and real domains shows a performance advantage for the multi-objective approach.

Joshua D. Knowles | Ricardo Aler | Julia Handl

[1] Peter A. Flach,et al. Improving Accuracy and Cost of Two-class and Multi-class Probabilistic Classifiers Using ROC Curves , 2003, ICML.

[2] Robert C. Holte,et al. Cost curves: An improved method for visualizing classifier performance , 2006, Machine Learning.

[3] Ian H. Witten,et al. Weka: Practical machine learning tools and techniques with Java implementations , 1999 .

[4] Peter A. Flach,et al. On classification, ranking, and probability estimation , 2007, Probabilistic, Logical and Relational Learning - A Further Synthesis.

[5] Evan J. Hughes,et al. MSOPS-II: A general-purpose Many-Objective optimiser , 2007, 2007 IEEE Congress on Evolutionary Computation.

[6] Jonathan E. Fieldsend,et al. Multi-class ROC analysis from a multi-objective optimisation perspective , 2006, Pattern Recognit. Lett..

[7] Thomas Villmann,et al. Relevance LVQ versus SVM , 2004, ICAISC.

[8] Tom Fawcett,et al. Robust Classification for Imprecise Environments , 2000, Machine Learning.

[9] David W. Corne,et al. Approximating the Nondominated Front Using the Pareto Archived Evolution Strategy , 2000, Evolutionary Computation.

[10] Tom Fawcett,et al. An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[11] Richard A. Watson,et al. Reducing Local Optima in Single-Objective Problems by Multi-objectivization , 2001, EMO.

[12] Tom Fawcett,et al. ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[13] Thomas G. Dietterich. Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[14] Atsushi Sato. Discriminative Dimensionality Reduction Based on Generalized LVQ , 2001, ICANN.

[15] Robert P. W. Duin,et al. Approximating the multiclass ROC by pairwise analysis , 2007, Pattern Recognit. Lett..

[16] Panu Somervuo,et al. Self-Organizing Maps and Learning Vector Quantization for Feature Sequences , 1999, Neural Processing Letters.

[17] Francisco Herrera,et al. Study on the Impact of Partition-Induced Dataset Shift on $k$-Fold Cross-Validation , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[18] Thomas Villmann,et al. Divergence-based classification in learning vector quantization , 2011, Neurocomputing.

[19] M. Anastasio,et al. Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves , 1999, IEEE Transactions on Medical Imaging.

[20] Bernhard Sendhoff,et al. Generalization Improvement in Multi-Objective Learning , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.