What Is the Goal of Sensory Coding?

A number of recent attempts have been made to describe early sensory coding in terms of a general information processing strategy. In this paper, two strategies are contrasted. Both strategies take advantage of the redundancy in the environment to produce more effective representations. The first is described as a compact coding scheme. A compact code performs a transform that allows the input to be represented with a reduced number of vectors (cells) with minimal RMS error. This approach has recently become popular in the neural network literature and is related to a process called Principal Components Analysis (PCA). A number of recent papers have suggested that the optimal compact code for representing natural scenes will have units with receptive field profiles much like those found in the retina and primary visual cortex. However, in this paper, it is proposed that compact coding schemes are insufficient to account for the receptive field properties of cells in the mammalian visual pathway. In contrast, it is proposed that the visual system is near to optimal in representing natural scenes only if optimality is defined in terms of sparse distributed coding. In a sparse distributed code, all cells in the code have an equal response probability across the class of images but have a low response probability for any single image. In such a code, the dimensionality is not reduced. Rather, the redundancy of the input is transformed into the redundancy of the firing pattern of cells. It is proposed that the signature for a sparse code is found in the fourth moment of the response distribution (i.e., the kurtosis). In measurements with 55 calibrated natural scenes, the kurtosis was found to peak when the bandwidths of the visual code matched those of cells in the mammalian visual cortex. Codes resembling wavelet transforms are proposed to be effective because the response histograms of such codes are sparse (i.e., show high kurtosis) when presented with natural scenes. It is proposed that the structure of the image that allows sparse coding is found in the phase spectrum of the image. It is suggested that natural scenes, to a first approximation, can be considered as a sum of self-similar local functions (the inverse of a wavelet). Possible reasons for why sensory systems would evolve toward sparse coding are presented.

[1]  Dennis Gabor,et al.  Theory of communication , 1946 .

[2]  佐藤 保 主成分分析(Principal Components)の経済分析への応用 , 1954 .

[3]  H B Barlow,et al.  Single units and sensation: a neuron doctrine for perceptual psychology? , 1972, Perception.

[4]  D. Tolhurst,et al.  On the variety of spatial frequency selectivities shown by neurons in area 17 of the cat , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[5]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[6]  D. G. Albrecht,et al.  Spatial frequency selectivity of cells in macaque visual cortex , 1982, Vision Research.

[7]  J. Lund,et al.  Intrinsic laminar lattice connections in primate visual cortex , 1983, The Journal of comparative neurology.

[8]  Andrew B. Watson,et al.  Detection and Recognition of Simple Spatial Forms , 1983 .

[9]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[10]  G. Buchsbaum,et al.  Trichromacy, opponent colours coding and optimum colour information transmission in the retina , 1983, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[11]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[12]  Peter Lennie,et al.  SPATIAL FREQUENCY ANALYSIS IN THE VISUAL , 1985 .

[13]  G. Blasdel,et al.  Intrinsic connections of macaque striate cortex: axonal projections of cells outside lamina 4C , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[14]  S. Klein,et al.  Hyperacuity thresholds of 1 sec: theoretical predictions and empirical validation. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  R. L. de Valois,et al.  Relationship between spatial-frequency and orientation tuning of striate-cortex cells. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[16]  H. Barlow The Twelfth Bartlett Memorial Lecture: The Role of Single Neurons in the Psychology of Perception , 1985, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[17]  P. Lennie,et al.  Spatial frequency analysis in the visual system. , 1985, Annual review of neuroscience.

[18]  Terry Bossomaier,et al.  Why spatial frequency processing in the visual cortex? , 1986, Vision Research.

[19]  D. J. Evans Sparsity and its applications , 1986 .

[20]  L. Maloney Evaluation of linear models of surface spectral reflectance with small numbers of parameters. , 1986, Journal of the Optical Society of America. A, Optics and image science.

[21]  D. Field,et al.  The structure and symmetry of simple-cell receptive-field profiles in the cat’s visual cortex , 1986, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[22]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[23]  J. Austin Associative memory , 1987 .

[24]  G. J. Burton,et al.  Color and spatial structure in natural scenes. , 1987, Applied optics.

[25]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[26]  J. Friedman Exploratory Projection Pursuit , 1987 .

[27]  Edward H. Adelson,et al.  Orthogonal Pyramid Transforms For Image Coding. , 1987, Other Conferences.

[28]  D Kersten,et al.  Predictability and redundancy of natural images. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[29]  Ralph Linsker,et al.  Self-organization in a perceptual network , 1988, Computer.

[30]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[31]  Terrence J. Sejnowski,et al.  Network model of shape-from-shading: neural function arises from both receptive and projective fields , 1988, Nature.

[32]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[33]  D. Burr,et al.  Feature detection in human vision: a phase-dependent energy model , 1988, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[34]  David J. Field,et al.  What The Statistics Of Natural Images Tell Us About Visual Coding , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[35]  P. Foldiak,et al.  Adaptive network for optimal linear feature extraction , 1989, International 1989 Joint Conference on Neural Networks.

[36]  H. B. Barlow,et al.  Finding Minimum Entropy Codes , 1989, Neural Computation.

[37]  Terence D. Sanger,et al.  Optimal unsupervised learning in a single-layer linear feedforward neural network , 1989, Neural Networks.

[38]  Peter Földiák,et al.  Adaptation and decorrelation in the cortex , 1989 .

[39]  Steven W. Zucker,et al.  Two Stages of Curve Detection Suggest Two Styles of Visual Computation , 1989, Neural Computation.

[40]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Joseph J. Atick,et al.  Towards a Theory of Early Visual Processing , 1990, Neural Computation.

[42]  W. Bialek,et al.  Optimal Sampling of Natural Images: A Design Principle for the Visual System , 1990, NIPS 1990.

[43]  David J. C. MacKay,et al.  Analysis of Linsker's Simulations of Hebbian Rules , 1990, Neural Computation.

[44]  R. Baddeley,et al.  A statistical analysis of natural images matches psychophysically derived orientation tuning curves , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[45]  C. Webber Competitive learning, natural images and cortical cells , 1991 .

[46]  D. Tolhurst,et al.  Amplitude spectra of natural images. , 1992, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[47]  Nathan Intrator,et al.  Feature Extraction Using an Unsupervised Neural Network , 1992, Neural Computation.

[48]  J. V. van Hateren Real and optimal neural images in early vision , 1992, Nature.

[49]  Ralph Linsker,et al.  Deriving Receptive Fields Using an Optimal Encoding Criterion , 1992, NIPS.

[50]  C. Gilbert Horizontal integration and cortical dynamics , 1992, Neuron.

[51]  J. Urgen Schmidhuber Learning Factorial Codes by Predictability Minimization , 1992 .

[52]  Joseph J. Atick,et al.  What Does the Retina Know about Natural Scenes? , 1992, Neural Computation.

[53]  D. Tolhurst,et al.  Amplitude spectra of natural images , 1992 .

[54]  M P Eckert,et al.  Efficient coding of natural time varying images in the early visual system. , 1993, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[55]  Zhaoping Li,et al.  What does post-adaptation color appearance reveal about cortical color representation? , 1993, Vision Research.

[56]  David J. Field,et al.  Contour integration by the human visual system: Evidence for a local “association field” , 1993, Vision Research.

[57]  Nathan Intrator,et al.  Combining Exploratory Projection Pursuit and Projection Pursuit Regression with Application to Neural Networks , 1993, Neural Computation.

[58]  S. Klinke,et al.  Exploratory Projection Pursuit , 1995 .