Non-Linear Canonical Correlation Analysis Using Alpha-Beta Divergence

We propose a generalized method of the canonical correlation analysis using Alpha-Beta divergence, called AB-canonical analysis (ABCA). From observations of two random variables, x ∈ RP and y ∈ RQ, ABCA finds directions, wx ∈ RP and wy ∈ RQ, such that the AB-divergence between the joint distribution of (wT x, wT y) and the product x y of their marginal distributions is maximized. The number of significant non-zero canonical coefficients are determined by using a sequential permutation test. The advantage of our method over the standard canonical correlation analysis (CCA) is that it can reconstruct the hidden non-linear relationship between wT xx and wT y, and it is robust against outliers. We extend ABCA when data are observed in terms of tensors. We further generalize this method by imposing sparseness constraints. Extensive simulation study is performed to justify our approach.

[1]  S. Amari Integration of Stochastic Models by Minimizing -Divergence , 2007, Neural Computation.

[2]  Mihoko Minami,et al.  Robust Blind Source Separation by Beta Divergence , 2002, Neural Computation.

[3]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[4]  Robert Tibshirani,et al.  An Introduction to the Bootstrap CHAPMAN & HALL/CRC , 1993 .

[5]  Clayton D. Scott,et al.  Robust kernel density estimation , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  R. Tibshirani,et al.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. , 2009, Biostatistics.

[7]  David W. Scott Parametric Statistical Modeling by Minimum Integrated Square Error , 2001, Technometrics.

[8]  B. Silverman Density estimation for statistics and data analysis , 1986 .

[9]  T. N. Sriram,et al.  Robust multivariate association and dimension reduction using density divergences , 2013, J. Multivar. Anal..

[10]  Andrzej Cichocki,et al.  Csiszár's Divergences for Non-negative Matrix Factorization: Family of New Algorithms , 2006, ICA.

[11]  Genevera I. Allen,et al.  Sparse Higher-Order Principal Components Analysis , 2012, AISTATS.

[12]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[13]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[14]  Colin Fyfe,et al.  Dual stream data exploration , 2012, Int. J. Data Min. Model. Manag..

[15]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2006 .

[16]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[17]  Gert R. G. Lanckriet,et al.  Identifying Words that are Musically Meaningful , 2007, ISMIR.

[18]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[19]  Fuzhen Zhang,et al.  A matrix decomposition and its applications , 2015 .

[20]  Jorge Nocedal,et al.  A trust region method based on interior point techniques for nonlinear programming , 2000, Math. Program..

[21]  Jorge Nocedal,et al.  An Interior Point Algorithm for Large-Scale Nonlinear Programming , 1999, SIAM J. Optim..

[22]  Xiangrong Yin,et al.  Canonical correlation analysis based on information theory , 2004 .

[23]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[24]  M. C. Jones,et al.  Robust and efficient estimation by minimising a density power divergence , 1998 .

[25]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[26]  Raul Kompass,et al.  A Generalized Divergence Measure for Nonnegative Matrix Factorization , 2007, Neural Computation.

[27]  T. N. Sriram,et al.  Multivariate Association and Dimension Reduction: A Generalization of Canonical Correlation Analysis , 2010, Biometrics.

[28]  Andrzej Cichocki,et al.  Nonnegative Matrix and Tensor Factorization T , 2007 .

[29]  Nancy Bertin,et al.  Nonnegative Matrix Factorization with the Itakura-Saito Divergence: With Application to Music Analysis , 2009, Neural Computation.

[30]  Sergio Cruces,et al.  Generalized Alpha-Beta Divergences and Their Application to Robust Nonnegative Matrix Factorization , 2011, Entropy.

[31]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..