论文信息 - Co-Validation: Using Model Disagreement on Unlabeled Data to Validate Classification Algorithms

Co-Validation: Using Model Disagreement on Unlabeled Data to Validate Classification Algorithms

In the context of binary classification, we define disagreement as a measure of how often two independently-trained models differ in their classification of unlabeled data. We explore the use of disagreement for error estimation and model selection. We call the procedure co-validation, since the two models effectively (invalidate one another by comparing results on unlabeled data, which we assume is relatively cheap and plentiful compared to labeled data. We show that per-instance disagreement is an unbiased estimate of the variance of error for that instance. We also show that disagreement provides a lower bound on the prediction (generalization) error, and a tight upper bound on the "variance of prediction error", or the variance of the average error across instances, where variance is measured across training sets. We present experimental results on several data sets exploring co-validation for error estimation and model selection. The procedure is especially effective in active learning settings, where training sets are not drawn at random and cross validation overestimates error.

David M. Pennock | Gary William Flake | Omid Madani | G. Flake | Omid Madani

[1] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[2] Partha Niyogi,et al. Algorithmic stability and ensemble-based learning , 2002 .

[3] Anders Krogh,et al. Neural Network Ensembles, Cross Validation, and Active Learning , 1994, NIPS.

[4] Lars Kai Hansen,et al. Neural Network Ensembles , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Dale Schuurmans. A New Metric-Based Approach to Model Selection , 1997, AAAI/IAAI.

[6] Dale Schuurmans,et al. Metric-Based Methods for Adaptive Model Selection and Regularization , 2002, Machine Learning.

[7] Nicolas Chapados,et al. Extensions to Metric-Based Model Selection , 2003, J. Mach. Learn. Res..

[8] Joachim M. Buhmann,et al. Stability-Based Model Selection , 2002, NIPS.

[9] Yoshua Bengio,et al. Série Scientifique Scientific Series No Unbiased Estimator of the Variance of K-fold Cross-validation No Unbiased Estimator of the Variance of K-fold Cross-validation , 2022 .

[10] Partha Niyogi,et al. Almost-everywhere Algorithmic Stability and Generalization Error , 2002, UAI.

[11] Joachim M. Buhmann,et al. Stability-Based Validation of Clustering Solutions , 2004, Neural Computation.