Estimation of prediction error with known covariate shift

In supervised learning, the estimation of prediction error on unlabeled test data 1 is an important task. Existing methods are usually built on the assumption that 2 the training and test data are sampled from the same distribution, which is often 3 violated in practice. As a result, traditional estimators like cross-validation (CV) 4 will be biased and this may result in poor model selection. In this paper, we 5 assume that we have a test dataset in which the feature values are available but 6 not the outcome labels, and focus on a particular form of distributional shift of 7 covariate shift. We propose an alternative method based on parametric bootstrap of 8 the target of conditional error Err X [2]. Empirically our method outperforms CV 9 for both simulation and real data example across different modeling tasks, and is 10 comparable to state-of-the-art methods for image classification. 11

[1]  J. Steinhardt,et al.  Predicting Out-of-Distribution Error with the Projection Norm , 2022, ICML.

[2]  Mayee F. Chen,et al.  Mandoline: Model Evaluation under Distribution Shift , 2021, ICML.

[3]  R. Tibshirani,et al.  Cross-validation: what does it estimate and how well does it do it? , 2021, Journal of the American Statistical Association.

[4]  Insup Lee,et al.  Calibrated Prediction with Covariate Shift via Unsupervised Domain Adaptation , 2020, AISTATS.

[5]  Stefan Wager,et al.  Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani , 2020 .

[6]  Guangquan Zhang,et al.  Learning under Concept Drift: A Review , 2019, IEEE Transactions on Knowledge and Data Engineering.

[7]  Waleed A. Yousef,et al.  A Leisurely Look at Versions and Variants of the Cross Validation Estimator , 2019, ArXiv.

[8]  Emmanuel J. Candès,et al.  Conformal Prediction Under Covariate Shift , 2019, NeurIPS.

[9]  Jaime G. Carbonell,et al.  Low-Dimensional Density Ratio Estimation for Covariate Shift Correction , 2019, AISTATS.

[10]  Thomas G. Dietterich,et al.  Benchmarking Neural Network Robustness to Common Corruptions and Perturbations , 2018, ICLR.

[11]  Saharon Rosset,et al.  From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation , 2017, Journal of the American Statistical Association.

[12]  Kevin Gimpel,et al.  A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[13]  Frank Hutter,et al.  SGDR: Stochastic Gradient Descent with Warm Restarts , 2016, ICLR.

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  A. Bifet,et al.  A survey on concept drift adaptation , 2014, ACM Comput. Surv..

[16]  R. Tibshirani,et al.  A Study of Error Variance Estimation in Lasso Regression , 2013, 1311.5274.

[17]  R. Tibshirani,et al.  Degrees of freedom in lasso problems , 2011, 1111.0653.

[18]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[19]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[20]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Neil D. Lawrence,et al.  Dataset Shift in Machine Learning , 2009 .

[22]  B. Schölkopf,et al.  Covariate Shift by Kernel Mean Matching , 2009, NIPS 2009.

[23]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[24]  Jan de Leeuw,et al.  Journal of Statistical Software , 2009 .

[25]  Klaus-Robert Müller,et al.  Covariate Shift Adaptation by Importance Weighted Cross Validation , 2007, J. Mach. Learn. Res..

[26]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[27]  D. Hinkley Annals of Statistics , 2006 .

[28]  Fernando Paganini,et al.  IEEE Transactions on Automatic Control , 2006 .

[29]  K. Müller,et al.  Generalization Error Estimation under Covariate Shift , 2005 .

[30]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[31]  R. Tibshirani,et al.  Improvements on Cross-Validation: The 632+ Bootstrap Method , 1997 .

[32]  P. Zhang,et al.  Assessing prediction error in non-parametric regression , 1995 .

[33]  B. Efron Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation , 1983 .

[34]  C. Stein Estimation of the Mean of a Multivariate Normal Distribution , 1981 .

[35]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[36]  H. Akaike A new look at the statistical model identification , 1974 .

[37]  C. L. Mallows Some comments on C_p , 1973 .

[38]  O. William Journal Of The American Statistical Association V-28 , 1932 .

[39]  Lang Wang,et al.  Statistica Sinica , 2022 .