Information Geometry of Positive Measures and Positive-Definite Matrices: Decomposable Dually Flat Structure

Information geometry studies the dually flat structure of a manifold, highlighted by the generalized Pythagorean theorem. The present paper studies a class of Bregman divergences called the (ρ,τ)-divergence. A (ρ,τ) -divergence generates a dually flat structure in the manifold of positive measures, as well as in the manifold of positive-definite matrices. The class is composed of decomposable divergences, which are written as a sum of componentwise divergences. Conversely, a decomposable dually flat divergence is shown to be a (ρ,τ) -divergence. A (ρ,τ) -divergence is determined from two monotone scalar functions, ρ and τ. The class includes the KL-divergence, α-, β- and (α, β)-divergences as special cases. The transformation between an affine parameter and its dual is easily calculated in the case of a decomposable divergence. Therefore, such a divergence is useful for obtaining the center for a cluster of points, which will be applied to classification and information retrieval in vision. For the manifold of positive-definite matrices, in addition to the dually flatness and decomposability, we require the invariance under linear transformations, in particular under orthogonal transformations. This opens a way to define a new class of divergences, called the (ρ,τ) -structure in the manifold of positive-definite matrices.

[1]  Frank Nielsen,et al.  On Conformal Divergences and Their Population Minimizers , 2013, IEEE Transactions on Information Theory.

[2]  Gunnar Rätsch,et al.  Matrix Exponentiated Gradient Updates for On-line Learning and Bregman Projection , 2004, J. Mach. Learn. Res..

[3]  Jun Zhang,et al.  Nonparametric Information Geometry: From Divergence Function to Referential-Representational Biduality on Statistical Manifolds , 2013, Entropy.

[4]  Jun Zhang,et al.  Divergence Function, Duality, and Convex Analysis , 2004, Neural Computation.

[5]  Inderjit S. Dhillon,et al.  Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[6]  S. Eguchi Information Geometry and Statistical Pattern Recognition , 2004 .

[7]  Sergio Cruces,et al.  Generalized Alpha-Beta Divergences and Their Application to Robust Nonnegative Matrix Factorization , 2011, Entropy.

[8]  Shinto Eguchi,et al.  Geometry on Positive Definite Matrices Induced from V-Potential Function , 2013, GSI.

[9]  Inderjit S. Dhillon,et al.  Matrix Nearness Problems with Bregman Divergences , 2007, SIAM J. Matrix Anal. Appl..

[10]  D. Petz Monotone metrics on matrix spaces , 1996 .

[11]  Shun-ichi Amari,et al.  Geometry of deformed exponential families: Invariant, dually-flat and conformal geometries , 2012 .

[12]  Frank Nielsen,et al.  Shape Retrieval Using Hierarchical Total Bregman Soft Clustering , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Maher Moakher,et al.  Means of Hermitian positive-definite matrices based on the log-determinant α-divergence function , 2012 .

[14]  S. Amari,et al.  Dualistic differential geometry of positive definite matrices and its applications to related problems , 1996 .

[15]  S. Eguchi,et al.  Robust parameter estimation with a small bias against heavy contamination , 2008 .

[16]  Mihoko Minami,et al.  Robust Blind Source Separation by Beta Divergence , 2002, Neural Computation.

[17]  C. Tsallis Introduction to Nonextensive Statistical Mechanics: Approaching a Complex World , 2009 .

[18]  Shinto Eguchi,et al.  Group Invariance of Information Geometry on q-Gaussian Distributions Induced by Beta-Divergence , 2013, Entropy.

[19]  Shun-ichi Amari,et al.  $\alpha$ -Divergence Is Unique, Belonging to Both $f$-Divergence and Bregman Divergence Classes , 2009, IEEE Transactions on Information Theory.

[20]  Frank Nielsen,et al.  Total Bregman Divergence and Its Applications to DTI Analysis , 2011, IEEE Transactions on Medical Imaging.

[21]  Frank Nielsen,et al.  Matrix Information Geometry , 2012 .

[22]  Shun-ichi Amari,et al.  Methods of information geometry , 2000 .

[23]  Andrzej Cichocki,et al.  Families of Alpha- Beta- and Gamma- Divergences: Flexible and Robust Measures of Similarities , 2010, Entropy.

[24]  Frank Nielsen,et al.  Mining Matrix Data with Bregman Matrix Divergences for Portfolio Selection , 2013 .

[25]  H. Hasegawa α-Divergence of the non-commutative information geometry , 1993 .