Extending Self-Organizing Maps with uncertainty information of probabilistic PCA

We introduce a probabilistic version of the self-organizing map (SOM) where we model the uncertainty of both the model vectors and the data. While uncertainty information about the data is often not available, this property becomes very useful when the method is combined in a hierarchical manner with probabilistic principal component analysis (PCA), where we do estimate uncertainty of the principal components and the weights. We apply the hierarchical model to the domain of collaborative filtering, where probabilistic PCA is a popular approach due to its robustness for tackling many missing values in the data. The main focus in this paper is for recommendation systems about movies, where the movie rating data matrix of size people times movies is available, but contains lots of missing values. The matrix is first decomposed into a matrix product of people times features and features times movies by PCA. Then we apply the probabilistic SOM to both of those matrices separately. The uncertainty is large when a person (or a movie) has only a few ratings. The experiments with Movielens and Netflix data show an improvement over traditional SOM.

[1]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[2]  Grégoire Lefebvre,et al.  A probabilistic Self-Organizing Map for facial recognition , 2008, 2008 19th International Conference on Pattern Recognition.

[3]  Charles M. Bishop Variational principal components , 1999 .

[4]  Domingo López-Rodríguez,et al.  Probabilistic PCA Self-Organizing Maps , 2009, IEEE Transactions on Neural Networks.

[5]  Bong-Jin Yum,et al.  Collaborative filtering based on iterative principal component analysis , 2005, Expert Syst. Appl..

[6]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[7]  大西 仁,et al.  Pearl, J. (1988, second printing 1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan-Kaufmann. , 1994 .

[8]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[9]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[10]  Hidetomo Ichihashi,et al.  Collaborative Filtering Using Principal Component Analysis and Fuzzy Clustering , 2001, Web Intelligence.

[11]  Juan J. Villanueva,et al.  Novel SOM-PCA network for face identification , 2001, SPIE Optics + Photonics.

[12]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[13]  T. Raiko,et al.  Partially observed values , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[14]  Quan Yuan,et al.  Boosting collaborative filtering based on statistical prediction errors , 2008, RecSys '08.

[15]  Andrzej Cichocki,et al.  Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .

[16]  Ezequiel López-Rubio,et al.  The Principal Components Analysis Self-Organizing Map , 2002, ICANN.

[17]  Hsin-Min Wang,et al.  Model-Based Clustering by Probabilistic Self-Organizing Maps , 2009, IEEE Transactions on Neural Networks.

[18]  Alexander Ilin,et al.  Bayesian PCA for reconstruction of historical sea surface temperatures , 2009, 2009 International Joint Conference on Neural Networks.

[19]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[20]  Juha Karhunen,et al.  Principal Component Analysis for Large Scale Problems with Lots of Missing Values , 2007, ECML.