论文信息 - Efficient Dictionary Learning with Sparseness-Enforcing Projections

Efficient Dictionary Learning with Sparseness-Enforcing Projections

Learning dictionaries suitable for sparse coding instead of using engineered bases has proven effective in a variety of image processing tasks. This paper studies the optimization of dictionaries on image data where the representation is enforced to be explicitly sparse with respect to a smooth, normalized sparseness measure. This involves the computation of Euclidean projections onto level sets of the sparseness measure. While previous algorithms for this optimization problem had at least quasi-linear time complexity, here the first algorithm with linear time complexity and constant space complexity is proposed. The key for this is the mathematically rigorous derivation of a characterization of the projection’s result based on a soft-shrinkage function. This theory is applied in an original algorithm called Easy Dictionary Learning (EZDL), which learns dictionaries with a simple and fast-to-compute Hebbian-like learning rule. The new algorithm is efficient, expressive and particularly simple to implement. It is demonstrated that despite its simplicity, the proposed learning algorithm is able to generate a rich variety of dictionaries, in particular a topographic organization of atoms or separable atoms. Further, the dictionaries are as expressive as those of benchmark learning algorithms in terms of the reproduction quality on entire images, and result in an equivalent denoising performance. EZDL learns approximately 30 % faster than the already very efficient Online Dictionary Learning algorithm, and is therefore eligible for rapid data set analysis and problems with vast quantities of learning samples.

[1] Thomas P. Hayes,et al. Block Coordinate Descent for Sparse NMF , 2013, ICLR.

[2] Aapo Hyvärinen,et al. Emergence of Phase- and Shift-Invariant Features by Decomposition of Natural Images into Independent Feature Subspaces , 2000, Neural Computation.

[3] Bruno A. Olshausen,et al. Learning sparse, overcomplete representations of time-varying natural images , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[4] Toshihisa Tanaka,et al. First results on uniqueness of sparse non-negative matrix factorization , 2005, 2005 13th European Signal Processing Conference.

[5] Aapo Hyvärinen,et al. Natural Image Statistics - A Probabilistic Approach to Early Computational Vision , 2009, Computational Imaging and Vision.

[6] Guillermo Sapiro,et al. Learning to Sense Sparse Signals: Simultaneous Sensing Matrix and Sparsifying Dictionary Optimization , 2009, IEEE Transactions on Image Processing.

[7] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[8] Teuvo Kohonen,et al. The self-organizing map , 1990, Neurocomputing.

[9] William H. Press,et al. Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[10] John A. Nelder,et al. A Simplex Method for Function Minimization , 1965, Comput. J..

[11] W. Press,et al. Numerical Recipes: The Art of Scientific Computing , 1987 .

[12] Bruno A. Olshausen,et al. Learning Sparse Representations of Depth , 2010, IEEE Journal of Selected Topics in Signal Processing.

[13] Kjersti Engan,et al. Image compression using learned dictionaries by RLS-DLA and compared with K-SVD , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14] Richard G. Baraniuk,et al. Sparse Coding via Thresholding and Local Competition in Neural Circuits , 2008, Neural Computation.

[15] D. Ruderman,et al. Independent component analysis of natural image sequences yields spatio-temporal filters similar to simple cells in primary visual cortex , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[16] Michael Elad,et al. Why Simple Shrinkage Is Still Relevant for Redundant Representations? , 2006, IEEE Transactions on Information Theory.

[17] S.G. Hoggar. Mathematics of Digital Images: Creation, Compression, Restoration, Recognition (Hoggar, S.G.; 2006) [Book Review] , 2008, IEEE Signal Processing Magazine.

[18] Roland Memisevic,et al. Feature grouping from spatially constrained multiplicative interaction , 2013, ICLR.

[19] Lei Zhang,et al. Image Deblurring and Super-Resolution by Adaptive Sparse Domain Selection and Adaptive Regularization , 2010, IEEE Transactions on Image Processing.

[20] R. Fergus,et al. Learning invariant features through topographic filter maps , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21] David J. Field,et al. Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[22] J. Traub. Iterative Methods for the Solution of Equations , 1982 .

[23] S. G. Hoggar. Mathematics of Digital Images: Frontmatter , 2006 .

[24] S. G. Hoggar. Mathematics of Digital Images: List of symbols , 2006 .

[25] Alan C. Bovik,et al. Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures , 2009, IEEE Signal Processing Magazine.

[26] Scott T. Rickard,et al. Comparing Measures of Sparsity , 2008, IEEE Transactions on Information Theory.

[27] Terrence J Sejnowski,et al. Communication in Neuronal Networks , 2003, Science.

[28] D. Donoho. For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[29] Kjersti Engan,et al. Recursive Least Squares Dictionary Learning Algorithm , 2010, IEEE Transactions on Signal Processing.

[30] Miles E. Lopes. Estimating Unknown Sparsity in Compressed Sensing , 2013 .

[31] J. P. Jones,et al. An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[32] Andrew B. Watson,et al. Image Compression Using the Discrete Cosine Transform , 1994 .

[33] I. Horev,et al. Adaptive image compression using sparse dictionaries , 2012, 2012 19th International Conference on Systems, Signals and Image Processing (IWSSIP).

[34] P O Hoyer,et al. Independent component analysis applied to feature extraction from colour and stereo images , 2000, Network.

[35] J. Rodgers,et al. Thirteen ways to look at the correlation coefficient , 1988 .

[36] Aapo Hyvärinen,et al. Topographic Independent Component Analysis , 2001, Neural Computation.

[37] K. Bredies,et al. Linear Convergence of Iterative Soft-Thresholding , 2007, 0709.1598.

[38] Terrence J. Sejnowski,et al. The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[39] Aapo Hyvärinen,et al. Sparse Code Shrinkage: Denoising of Nongaussian Data by Maximum Likelihood Estimation , 1999, Neural Computation.

[40] Brian Gough,et al. GNU Scientific Library Reference Manual - Third Edition , 2003 .

[41] Yann LeCun,et al. Large Scale Online Learning , 2003, NIPS.

[42] Karin Schwab,et al. Best Approximation In Inner Product Spaces , 2016 .

[43] Günther Palm,et al. Sparse activity and sparse connectivity in supervised learning , 2016, J. Mach. Learn. Res..

[44] D. Ringach. Spatial structure and symmetry of simple-cell receptive fields in macaque primary visual cortex. , 2002, Journal of neurophysiology.

[45] Holger Rauhut,et al. A Mathematical Introduction to Compressive Sensing , 2013, Applied and Numerical Harmonic Analysis.

[46] Thomas S. Huang,et al. Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[47] Martin Kleinsteuber,et al. Separable Dictionary Learning , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[48] Tony R. Martinez,et al. The general inefficiency of batch training for gradient descent learning , 2003, Neural Networks.

[49] Christopher M. Bishop,et al. Neural networks for pattern recognition , 1995 .

[50] Joseph F. Murray,et al. Dictionary Learning Algorithms for Sparse Representation , 2003, Neural Computation.

[51] Jun Liu,et al. Efficient Euclidean projections in linear time , 2009, ICML '09.

[52] Vincent Lepetit,et al. Learning Separable Filters , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[53] Dimitri P. Bertsekas,et al. Nonlinear Programming , 1997 .

[54] C. Eckart,et al. The approximation of one matrix by another of lower rank , 1936 .

[55] H. Neudecker. Some Theorems on Matrix Differentiation with Special Reference to Kronecker Matrix Products , 1969 .

[56] Guillermo Sapiro,et al. Online dictionary learning for sparse coding , 2009, ICML '09.

[57] Andriana Olmos,et al. A biologically inspired algorithm for the recovery of shading and reflectance images , 2004 .

[58] J. Demmel,et al. Sun Microsystems , 1996 .

[59] Andrew Y. Ng,et al. The Importance of Encoding Versus Training with Sparse Coding and Vector Quantization , 2011, ICML.

[60] Guillermo Sapiro,et al. Non-local sparse models for image restoration , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[61] Thomas S. Huang,et al. A fast orthogonal matching pursuit algorithm , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[62] David L. Donoho,et al. De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[63] D. Hubel,et al. Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[64] Yonina C. Eldar,et al. Dictionary Optimization for Block-Sparse Representations , 2010, IEEE Transactions on Signal Processing.

[65] Patrik O. Hoyer,et al. Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[66] Thomas S. Huang,et al. Coupled Dictionary Training for Image Super-Resolution , 2012, IEEE Transactions on Image Processing.

[67] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[68] D. Tolhurst,et al. Characterizing the sparseness of neural codes , 2001, Network.

[69] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.