论文信息 - Unsupervised Learning with Non-Ignorable Missing Data

Unsupervised Learning with Non-Ignorable Missing Data

In this paper we explore the topic of unsupervised learning in the presence of nonignorable missing data with an unknown missing data mechanism. We discuss several classes of missing data mechanisms for categorical data and develop learning and inference methods for two specific models. We present empirical results using synthetic data which show that these algorithms can recover both the unknown selection model parameters and the underlying data model parameters to a high degree of accuracy. We also apply the algorithms to real data from the domain of collaborative filtering, and report initial results.

[1] Thomas Hofmann,et al. Learning What People (Don't) Want , 2001, ECML.

[2] Esmeralda A. Ramalho,et al. Discrete choice models for nonignorable missing data , 2002 .

[3] Benjamin M. Marlin,et al. Modeling User Rating Profiles For Collaborative Filtering , 2003, NIPS.

[4] Nicole A. Lazar,et al. Statistical Analysis With Missing Data , 2003, Technometrics.

[5] Benjamin M. Marlin,et al. Collaborative Filtering: A Machine Learning Perspective , 2004 .

[6] Kenneth Y. Goldberg,et al. Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.