论文信息 - Learning Flat Latent Manifolds with VAEs

Learning Flat Latent Manifolds with VAEs

Measuring the similarity between data points often requires domain knowledge, which can in parts be compensated by relying on unsupervised methods such as latent-variable models, where similarity/distance is estimated in a more compact latent space. Prevalent is the use of the Euclidean metric, which has the drawback of ignoring information about similarity of data stored in the decoder, as captured by the framework of Riemannian geometry. We propose an extension to the framework of variational auto-encoders allows learning flat latent manifolds, where the Euclidean metric is a proxy for the similarity between data points. This is achieved by defining the latent space as a Riemannian manifold and by regularising the metric tensor to be a scaled identity matrix. Additionally, we replace the compact prior typically used in variational auto-encoders with a recently presented, more expressive hierarchical one---and formulate the learning problem as a constrained optimisation problem. We evaluate our method on a range of data-sets, including a video-tracking benchmark, where the performance of our unsupervised approach nears that of state-of-the-art supervised approaches, while retaining the computational efficiency of straight-line-based approaches.

Justin Bayer | Patrick van der Smagt | Francesco Ferroni | Nutan Chen | Alexej Klushyn

[1] Tao Li,et al. The Relationships Among Various Nonnegative Matrix Factorization Methods for Clustering , 2006, Sixth International Conference on Data Mining (ICDM'06).

[2] Xueyan Jiang,et al. Metrics for Deep Generative Models , 2017, AISTATS.

[3] Raja Giryes,et al. Improving DNN Robustness to Adversarial Attacks using Jacobian Regularization , 2018, ECCV.

[4] Max Welling,et al. VAE with a VampPrior , 2017, AISTATS.

[5] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[6] Yu Liu,et al. POI: Multiple Object Tracking with High Performance Detection and Appearance Feature , 2016, ECCV Workshops.

[7] Pascal Vincent,et al. Higher Order Contractive Auto-Encoder , 2011, ECML/PKDD.

[8] Justin Bayer,et al. Fast Approximate Geodesics for Deep Generative Models , 2018, ICANN.

[9] David J. Fleet,et al. This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Gaussian Process Dynamical Model , 2007 .

[10] Pierre Vandergheynst,et al. Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[11] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[12] Ole Winther,et al. Ladder Variational Autoencoders , 2016, NIPS.

[13] Asahi Ushio,et al. Latent Space Cartography: Generalised Metric-Inspired Measures and Measure-Based Transformations for Generative Models , 2019, ArXiv.

[14] N. Altman. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[15] Alexander A. Alemi,et al. Fixing a Broken ELBO , 2017, ICML.

[16] Dietrich Paulus,et al. Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[17] Justin Bayer,et al. Efficient movement representation by embedding Dynamic Movement Primitives in deep autoencoders , 2015, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids).

[18] Serge J. Belongie,et al. Bayesian representation learning with oracle constraints , 2015, ICLR 2016.

[19] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[20] Neil D. Lawrence,et al. Metrics for Probabilistic Geometries , 2014, UAI.

[21] Pascal Vincent,et al. The Manifold Tangent Classifier , 2011, NIPS.

[22] Ankit B. Patel,et al. Towards a Better Understanding and Regularization of GAN Training Dynamics , 2018, UAI.

[23] Nir Ailon,et al. Deep Metric Learning Using Triplet Network , 2014, SIMBAD.

[24] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[25] David J. Fleet,et al. Erratum: "Gaussian process dynamical models for human motion" (IEEE Transactions on Pattern analysis and Machine Intelligenc (292)) , 2008 .

[26] Ruslan Salakhutdinov,et al. Importance Weighted Autoencoders , 2015, ICLR.

[27] Patrick van der Smagt,et al. Learning Hierarchical Priors in VAEs , 2019, NeurIPS.

[28] Michael I. Jordan,et al. Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[29] Ioannis Mitliagkas,et al. Manifold Mixup: Better Representations by Interpolating Hidden States , 2018, ICML.

[30] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.

[31] R Devon Hjelm,et al. On Adversarial Mixup Resynthesis , 2019, NeurIPS.

[32] Francesco Solera,et al. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking , 2016, ECCV Workshops.

[33] Stefan Roth,et al. MOT16: A Benchmark for Multi-Object Tracking , 2016, ArXiv.

[34] Ieee Xplore,et al. IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35] Frank Nielsen,et al. GLSR-VAE: Geodesic latent space regularization for variational autoencoder architectures , 2017, 2017 IEEE Symposium Series on Computational Intelligence (SSCI).

[36] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.

[37] Hugo Larochelle,et al. The Neural Autoregressive Distribution Estimator , 2011, AISTATS.

[38] Bernhard Schölkopf,et al. Kernel Principal Component Analysis , 1997, ICANN.

[39] Lars Kai Hansen,et al. Latent Space Oddity: on the Curvature of Deep Generative Models , 2017, ICLR.

[40] Christopher K. I. Williams,et al. Magnification factors for the SOM and GTM algorithms , 1997 .

[41] Lorenzo Livi,et al. Learning Graph Embeddings on Constant-Curvature Manifolds for Change Detection in Graph Streams , 2018, ArXiv.

[42] Yee Whye Teh,et al. Hierarchical Representations with Poincaré Variational Auto-Encoders , 2019, ArXiv.

[43] John M. Lee. Riemannian Manifolds: An Introduction to Curvature , 1997 .

[44] Samy Bengio,et al. Generating Sentences from a Continuous Space , 2015, CoNLL.

[45] Lorenzo Livi,et al. Change Detection in Graph Streams by Learning Graph Embeddings on Constant-Curvature Manifolds , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[46] Alex Bewley,et al. Deep Cosine Metric Learning for Person Re-identification , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[47] Maximilian Karl,et al. Dynamic movement primitives in latent space of time-dependent variational autoencoders , 2016, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids).

[48] Patrick van der Smagt,et al. Active Learning based on Data Uncertainty and Model Sensitivity , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[49] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[50] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[51] Fabio Tozeto Ramos,et al. Simple online and realtime tracking , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[52] Kilian Q. Weinberger,et al. Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.