论文信息 - Unsupervised Learning of Structured Representations via Closed-Loop Transcription

Unsupervised Learning of Structured Representations via Closed-Loop Transcription

This paper proposes an unsupervised method for learning a unified representation that serves both discriminative and generative purposes. While most existing unsupervised learning approaches focus on a representation for only one of these two goals, we show that a unified representation can enjoy the mutual benefits of having both. Such a representation is attainable by generalizing the recently proposed \textit{closed-loop transcription} framework, known as CTRL, to the unsupervised setting. This entails solving a constrained maximin game over a rate reduction objective that expands features of all samples while compressing features of augmentations of each sample. Through this process, we see discriminative low-dimensional structures emerge in the resulting representations. Under comparable experimental conditions and network complexities, we demonstrate that these structured representations enable classification performance close to state-of-the-art unsupervised discriminative representations, and conditionally generated image quality significantly higher than that of state-of-the-art unsupervised generative models. Source code can be found at https://github.com/Delay-Xili/uCTRL.

[1] Yi Ma,et al. Incremental Learning of Structured Memory via Closed-Loop Transcription , 2022, ArXiv.

[2] Ping Luo,et al. Context Autoencoder for Self-Supervised Representation Learning , 2022, ArXiv.

[3] Yann LeCun,et al. Neural Manifold Clustering and Embedding , 2022, ArXiv.

[4] Kwan Ho Ryan Chan,et al. CTRL: Closed-Loop Transcription to an LDR via Minimaxing Rate Reduction , 2021, Entropy.

[5] Ross B. Girshick,et al. Masked Autoencoders Are Scalable Vision Learners , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Sungroh Yoon,et al. Stein Latent Optimization for Generative Adversarial Networks , 2021, ICLR.

[7] Yann LeCun,et al. VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning , 2021, ICLR.

[8] Xueqi Cheng,et al. Self-supervised GANs with Label Augmentation , 2021, NeurIPS.

[9] Saehoon Kim,et al. Hybrid Generative-Contrastive Representation Learning , 2021, ArXiv.

[10] John Wright,et al. ReduNet: A White-box Deep Network from the Principle of Maximizing Rate Reduction , 2021, ArXiv.

[11] Jinwoo Shin,et al. Training GANs with Stronger Augmentations via Contrastive Discriminator , 2021, ICLR.

[12] Yann LeCun,et al. Barlow Twins: Self-Supervised Learning via Redundancy Reduction , 2021, ICML.

[13] Xinlei Chen,et al. Exploring Simple Siamese Representation Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Zhuowen Tu,et al. Dual Contradistinctive Generative Autoencoder , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Hava T. Siegelmann,et al. Brain-inspired replay for continual learning with artificial neural networks , 2020, Nature Communications.

[16] Mark Chen,et al. Generative Pretraining From Pixels , 2020, ICML.

[17] Chong You,et al. Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction , 2020, NeurIPS.

[18] Pierre H. Richemond,et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[19] David Bau,et al. Diverse Image Generation via Self-Conditioned GANs , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Vignesh Prasad,et al. Variational Clustering: Leveraging Variational Autoencoders for Image Clustering , 2020, 2020 International Joint Conference on Neural Networks (IJCNN).

[21] Kamal Gupta,et al. PatchVAE: Learning Local Latent Codes for Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[23] Susumu Tonegawa,et al. Memory engrams: Recalling the past and imagining the future , 2020, Science.

[24] Ross B. Girshick,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Xiaohua Zhai,et al. Self-Supervised GANs via Auxiliary Rotation Loss , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Sreeram Kannan,et al. ClusterGAN : Latent Space Clustering in Generative Adversarial Networks , 2018, AAAI.

[27] Georg B. Keller,et al. Predictive Processing: A Canonical Cortical Computation , 2018, Neuron.

[28] Amos J. Storkey,et al. Data Augmentation Generative Adversarial Networks , 2017, ICLR 2018.

[29] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[30] Huachun Tan,et al. Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering , 2016, IJCAI.

[31] Christopher Burgess,et al. beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework , 2016, ICLR 2016.

[32] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[33] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[34] Murray Shanahan,et al. Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders , 2016, ArXiv.

[35] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[36] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[37] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.