论文信息 - Sparse Coding with Multi-Layer Decoders using Variance Regularization

Sparse Coding with Multi-Layer Decoders using Variance Regularization

Sparse representations of images are useful in many computer vision applications. Sparse coding with an $l_1$ penalty and a learned linear dictionary requires regularization of the dictionary to prevent a collapse in the $l_1$ norms of the codes. Typically, this regularization entails bounding the Euclidean norms of the dictionary's elements. In this work, we propose a novel sparse coding protocol which prevents a collapse in the codes without the need to regularize the decoder. Our method regularizes the codes directly so that each latent code component has variance greater than a fixed threshold over a set of sparse representations for a given set of inputs. Furthermore, we explore ways to effectively train sparse coding systems with multi-layer decoders since they can model more complex relationships than linear dictionaries. In our experiments with MNIST and natural image patches, we show that decoders learned with our approach have interpretable features both in the linear and multi-layer case. Moreover, we show that sparse autoencoders with multi-layer decoders trained using our variance regularization method produce higher quality reconstructions with sparser representations when compared to autoencoders with linear dictionaries. Additionally, sparse representations obtained with our variance regularization approach are useful in the downstream tasks of denoising and classification in the low-data regime.

Yann LeCun | Katrina Evtimova

[1] Kyle L. Luther,et al. Sensitivity of Sparse Codes to Image Distortions , 2022, Neural Computation.

[2] Yann LeCun,et al. VICReg: Variance-Invariance-Covariance Regularization for Self-Supervised Learning , 2021, ICLR.

[3] Yann LeCun,et al. Barlow Twins: Self-Supervised Learning via Redundancy Reduction , 2021, ICML.

[4] Nicu Sebe,et al. When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[5] Pierre H. Richemond,et al. Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[6] Yiwen Guo,et al. Sparse Coding with Gated Learned ISTA , 2020, ICLR.

[7] Kenichi Ohki,et al. Natural images are reliably represented by sparse and variable populations of neurons in visual cortex , 2020, Nature Communications.

[8] Ross B. Girshick,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Roderick Murray-Smith,et al. Variational Sparse Coding , 2019, UAI.

[10] Liyi Dai,et al. Deep Dictionary Learning: A PARametric NETwork Approach , 2018, IEEE Transactions on Image Processing.

[11] Adam S. Charles,et al. Sparse-Coding Variational Auto-Encoders , 2018, bioRxiv.

[12] Bruno A. Olshausen,et al. The Sparse Manifold Transform , 2018, NeurIPS.

[13] Ivor W. Tsang,et al. SC2Net: Sparse LSTMs for Sparse Coding , 2018, AAAI.

[14] Yap-Peng Tan,et al. Nonlinear dictionary learning with application to image classification , 2018, Pattern Recognit..

[15] Trac D. Tran,et al. Supervised Deep Sparse Coding Networks , 2017, 2018 25th IEEE International Conference on Image Processing (ICIP).

[16] Efstratios Gavves,et al. Self-Supervised Video Representation Learning with Odd-One-Out Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Mayank Vatsa,et al. Deep Dictionary Learning , 2016, IEEE Access.

[18] Martial Hebert,et al. Shuffle and Learn: Unsupervised Learning Using Temporal Order Verification , 2016, ECCV.

[19] David Zhang,et al. A Survey of Sparse Representation: Algorithms and Applications , 2015, IEEE Access.

[20] Yoshua Bengio,et al. Difference Target Propagation , 2014, ECML/PKDD.

[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22] Jean Ponce,et al. Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[23] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[24] Yanjun Qi,et al. Unsupervised Feature Learning by Deep Sparse Coding , 2013, SDM.

[25] David B. Dunson,et al. Deep Learning with Hierarchical Convolutional Factor Analysis , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Graham W. Taylor,et al. Adaptive deconvolutional networks for mid and high level feature learning , 2011, 2011 International Conference on Computer Vision.

[28] Andrew Y. Ng,et al. The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[29] Vincent Lepetit,et al. Are sparse representations really relevant for image classification? , 2011, CVPR 2011.

[30] Y-Lan Boureau,et al. Learning Convolutional Feature Hierarchies for Visual Recognition , 2010, NIPS.

[31] Yann LeCun,et al. Learning Fast Approximations of Sparse Coding , 2010, ICML.

[32] Graham W. Taylor,et al. Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33] Yann LeCun,et al. What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[34] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[35] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[36] Guillermo Sapiro,et al. Online dictionary learning for sparse coding , 2009, ICML '09.

[37] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[38] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[39] Guillermo Sapiro,et al. Supervised Dictionary Learning , 2008, NIPS.

[40] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[41] Guillermo Sapiro,et al. Discriminative learned dictionaries for local image analysis , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[42] Michael Elad,et al. Sparse Representation for Color Image Restoration , 2008, IEEE Transactions on Image Processing.

[43] Rajat Raina,et al. Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[44] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[45] Rajat Raina,et al. Efficient sparse coding algorithms , 2006, NIPS.

[46] Marc'Aurelio Ranzato,et al. Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[47] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[48] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[49] P. Földiák,et al. Forming sparse representations by local anti-Hebbian learning , 1990, Biological Cybernetics.

[50] Aapo Hyvärinen,et al. Bubbles: a unifying framework for low-level statistical properties of natural image sequences. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[51] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[52] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[53] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[54] Peter Földiák,et al. Learning Invariance from Transformation Sequences , 1991, Neural Comput..