论文信息 - Estimation of prediction error with known covariate shift

Estimation of prediction error with known covariate shift

In supervised learning, the estimation of prediction error on unlabeled test data 1 is an important task. Existing methods are usually built on the assumption that 2 the training and test data are sampled from the same distribution, which is often 3 violated in practice. As a result, traditional estimators like cross-validation (CV) 4 will be biased and this may result in poor model selection. In this paper, we 5 assume that we have a test dataset in which the feature values are available but 6 not the outcome labels, and focus on a particular form of distributional shift of 7 covariate shift. We propose an alternative method based on parametric bootstrap of 8 the target of conditional error Err X [2]. Empirically our method outperforms CV 9 for both simulation and real data example across different modeling tasks, and is 10 comparable to state-of-the-art methods for image classification. 11

R. Tibshirani | Hui Xu

[1] J. Steinhardt,et al. Predicting Out-of-Distribution Error with the Projection Norm , 2022, ICML.

[2] Mayee F. Chen,et al. Mandoline: Model Evaluation under Distribution Shift , 2021, ICML.

[3] R. Tibshirani,et al. Cross-validation: what does it estimate and how well does it do it? , 2021, Journal of the American Statistical Association.

[4] Insup Lee,et al. Calibrated Prediction with Covariate Shift via Unsupervised Domain Adaptation , 2020, AISTATS.

[5] Stefan Wager,et al. Cross-Validation, Risk Estimation, and Model Selection: Comment on a Paper by Rosset and Tibshirani , 2020 .

[6] Guangquan Zhang,et al. Learning under Concept Drift: A Review , 2019, IEEE Transactions on Knowledge and Data Engineering.

[7] Waleed A. Yousef,et al. A Leisurely Look at Versions and Variants of the Cross Validation Estimator , 2019, ArXiv.

[8] Emmanuel J. Candès,et al. Conformal Prediction Under Covariate Shift , 2019, NeurIPS.

[9] Jaime G. Carbonell,et al. Low-Dimensional Density Ratio Estimation for Covariate Shift Correction , 2019, AISTATS.

[10] Thomas G. Dietterich,et al. Benchmarking Neural Network Robustness to Common Corruptions and Perturbations , 2018, ICLR.

[11] Saharon Rosset,et al. From Fixed-X to Random-X Regression: Bias-Variance Decompositions, Covariance Penalties, and Prediction Error Estimation , 2017, Journal of the American Statistical Association.