Perception as an Inference Problem

Although the idea of thinking of perception as in inference problem goes back to Helmholtz, it is only recently that we have seen the emergence of neural models of perception that embrace this idea. Here I describe why inferential computations are necessary for perception, and how they go beyond traditional computational approaches based on deductive processes such as feature detection and classification. Neural models of perceptual inference rely heavily upon recurrent computation in which information propagates both within and between levels of representation in a bi- directional manner. The inferential framework shifts us away from thinking of 'receptive fields' and 'tuning' of individual neurons, and instead toward how populations of neurons interact via horizontal and top-down feedback connections to perform collective computations.

[1]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[2]  Lawrence G. Roberts,et al.  Machine Perception of Three-Dimensional Solids , 1963, Outstanding Dissertations in the Computer Sciences.

[3]  H. Barrow,et al.  RECOVERING INTRINSIC SCENE CHARACTERISTICS FROM IMAGES , 1978 .

[4]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[5]  D. Perrett,et al.  Time course of neural responses discriminating different views of the face and head. , 1992, Journal of neurophysiology.

[6]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[7]  H. Barlow Vision: A computational investigation into the human representation and processing of visual information: David Marr. San Francisco: W. H. Freeman, 1982. pp. xvi + 397 , 1983 .

[8]  David W. Arathorn,et al.  Map-Seeking Circuits in Visual Cognition: A Computational Mechanism for Biological and Machine Vision , 2002 .

[9]  Richard G. Baraniuk,et al.  Sparse Coding via Thresholding and Local Competition in Neural Circuits , 2008, Neural Computation.

[10]  Denis Fize,et al.  Speed of processing in the human visual system , 1996, Nature.

[11]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[12]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[13]  J. Bullier,et al.  Reaching beyond the classical receptive field of V1 neurons: horizontal or feedback axons? , 2003, Journal of Physiology-Paris.

[14]  D. Nilsson,et al.  A pessimistic estimate of the time required for an eye to evolve , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[15]  Rajesh P. N. Rao,et al.  Probabilistic Models of the Brain: Perception and Neural Function , 2002 .

[16]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[17]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[18]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[19]  Michael S. Lewicki,et al.  Emergence of complex cell properties by learning to generalize in natural scenes , 2009, Nature.

[20]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[21]  Bruno A. Olshausen,et al.  Scene analysis in the natural environment , 2014, Front. Psychol..

[22]  Bruno A. Olshausen,et al.  Group Sparse Coding with a Laplacian Scale Mixture Prior , 2010, NIPS.

[23]  E. Rolls,et al.  INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[24]  A. Yuille,et al.  Object perception as Bayesian inference. , 2004, Annual review of psychology.

[25]  Adam Binch,et al.  Perception as Bayesian Inference , 2014 .

[26]  M. Gazzaniga The new cognitive neurosciences, 2nd ed. , 2000 .

[27]  Farran Briggs,et al.  A Fast, Reciprocal Pathway between the Lateral Geniculate Nucleus and Visual Cortex in the Macaque Monkey , 2007, The Journal of Neuroscience.

[28]  B. Julesz Textons, the elements of texture perception, and their interactions , 1981, Nature.

[29]  David D. Cox,et al.  Opinion TRENDS in Cognitive Sciences Vol.11 No.8 Untangling invariant object recognition , 2022 .

[30]  Chris Eliasmith,et al.  Neural Engineering: Computation, Representation, and Dynamics in Neurobiological Systems , 2004, IEEE Transactions on Neural Networks.

[31]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[32]  A. Sillito,et al.  Corticothalamic feedback enhances stimulus response precision in the visual system , 2007, Proceedings of the National Academy of Sciences.

[33]  P. Lennie Single Units and Visual Cortical Organization , 1998, Perception.

[34]  Shinsuke Shimojo,et al.  Visual surface representation: a critical link between lower-level and higher level vision , 1995 .

[35]  Paul Schrater,et al.  Shape perception reduces activity in human primary visual cortex , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[36]  E. Adelson Lightness Perception and Lightness Illusions , 1999 .

[37]  David Arathorn Computation in the higher visual cortices: map-seeking circuit theory and application to machine vision , 2004, 33rd Applied Imagery Pattern Recognition Workshop (AIPR'04).

[38]  Tomaso Poggio,et al.  Fast Readout of Object Identity from Macaque Inferior Temporal Cortex , 2005, Science.

[39]  Richard E. Turner,et al.  Occlusive Components Analysis , 2009, NIPS.

[40]  Daniel Kersten,et al.  Bayesian models of object perception , 2003, Current Opinion in Neurobiology.

[41]  Bruno A. Olshausen,et al.  Highly overcomplete sparse coding , 2013, Electronic Imaging.

[42]  Allen R. Hanson,et al.  Computer Vision Systems , 1978 .

[43]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[44]  R. Guillery,et al.  Branched thalamic afferents: What are the messages that they relay to the cortex? , 2011, Brain Research Reviews.

[45]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[46]  R. von der Heydt,et al.  Coding of Border Ownership in Monkey Visual Cortex , 2000, The Journal of Neuroscience.

[47]  P Girard,et al.  Feedback connections act on the early part of the responses in monkey visual cortex. , 2001, Journal of neurophysiology.

[48]  Nicolas Le Roux,et al.  Learning a Generative Model of Images by Factoring Appearance and Shape , 2011, Neural Computation.

[49]  Christopher J. Rozell,et al.  Visual Nonclassical Receptive Field Effects Emerge from Sparse Coding in a Dynamical System , 2013, PLoS Comput. Biol..

[50]  József Fiser,et al.  Spontaneous Cortical Activity Reveals Hallmarks of an Optimal Internal Model of the Environment , 2011, Science.

[51]  Michael S. Lewicki,et al.  A Hierarchical Bayesian Model for Learning Nonlinear Statistical Regularities in Nonstationary Natural Signals , 2005, Neural Computation.

[52]  Jitendra Malik,et al.  Shape, albedo, and illumination from a single image of an unknown object , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Bruno A. Olshausen,et al.  Learning Intermediate-Level Representations of Form and Motion from Natural Movies , 2012, Neural Computation.