Computational Models of Object Recognition in Cortex: A Review

Understanding how biological visual systems perform object recognition is one of the ultimate goals in computational neuroscience. Among the biological models of recognition the main distinctions are between feedforward and feedback and between object-centered and view-centered. From a computational viewpoint the different recognition tasks — for instance categorization and identification — are very similar, representing different trade-offs between specificity and invariance. Thus the different tasks do not strictly require different classes of models. The focus of the review is on feedforward, view-based models that are supported by psychophysical and physiological data.

[1]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[2]  B. C. Motter,et al.  Neural correlates of feature selective memory and pop-out in extrastriate area V4 , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[3]  Yali Amit,et al.  A Computational Model for Visual Selection , 1999, Neural Computation.

[4]  I. Biederman,et al.  Dynamic binding in a neural network for shape recognition. , 1992, Psychological review.

[5]  T. Poggio,et al.  Hierarchical models of object recognition in cortex September 23 , 1999 , 1999 .

[6]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[7]  Geoffrey E. Hinton,et al.  The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[8]  R. Desimone,et al.  Neural Mechanisms of Visual Working Memory in Prefrontal Cortex of the Macaque , 1996, The Journal of Neuroscience.

[9]  Tomaso A. Poggio,et al.  Linear Object Classes and Image Synthesis From a Single Example Image , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  M. Tarr Rotating objects to recognize them: A case study on the role of viewpoint dependency in the recognition of three-dimensional objects , 1995, Psychonomic bulletin & review.

[11]  Tomaso Poggio,et al.  A Note on Object Class Representation and Categorical Perception , 1999 .

[12]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[13]  S. Harnad Categorical Perception: The Groundwork of Cognition , 1990 .

[14]  David I. Perrett,et al.  Neurophysiology of shape processing , 1993, Image Vis. Comput..

[15]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[16]  David L. Sheinberg,et al.  Visual object recognition. , 1996, Annual review of neuroscience.

[17]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[18]  G Tononi,et al.  Modeling perceptual grouping and figure-ground segregation by means of active reentrant connections. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. Desimone,et al.  Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque. , 1981, Journal of neurophysiology.

[20]  Bartlett W. Mel SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition , 1997, Neural Computation.

[21]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[22]  M. Tarr News On Views: Pandemonium Revisited , 1999, Nature Neuroscience.

[23]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[24]  Isabel Gauthier,et al.  Three-dimensional object recognition is viewpoint dependent , 1998, Nature Neuroscience.

[25]  E. Rolls,et al.  INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEM , 1997, Progress in Neurobiology.

[26]  E. Rolls High-level vision: Object recognition and visual cognition, Shimon Ullman. MIT Press, Bradford (1996), ISBN 0 262 21013 4 , 1997 .

[27]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[28]  J. Fuster Inferotemporal units in selective visual attention and short-term memory. , 1990, Journal of neurophysiology.

[29]  Shimon Ullman,et al.  Object Classification Using a Fragment-Based Representation , 2000, Biologically Motivated Computer Vision.

[30]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[31]  M. Young,et al.  Sparse population coding of faces in the inferotemporal cortex. , 1992, Science.

[32]  N. Logothetis,et al.  View-dependent object recognition by monkeys , 1994, Current Biology.

[33]  S. Ullman,et al.  Generalization to Novel Images in Upright and Inverted Faces , 1993, Perception.

[34]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[35]  R. Desimone,et al.  Responses of Neurons in Inferior Temporal Cortex during Memory- Guided Visual Search , 1998 .

[36]  Aapo Hyvärinen,et al.  Emergence of Phase- and Shift-Invariant Features by Decomposition of Natural Images into Independent Feature Subspaces , 2000, Neural Computation.

[37]  M. Tarr,et al.  Becoming a “Greeble” Expert: Exploring Mechanisms for Face Recognition , 1997, Vision Research.

[38]  T. Poggio,et al.  Are Cortical Models Really Bound by the “Binding Problem”? , 1999, Neuron.

[39]  Edmund T. Rolls,et al.  A Model of Invariant Object Recognition in the Visual System: Learning Rules, Activation Functions, Lateral Inhibition, and Information-Based Performance Measures , 2000, Neural Computation.

[40]  S. Thorpe,et al.  Speed of processing in the human visual system , 1996, Nature.

[41]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[42]  T Poggio,et al.  View-based models of 3D object recognition: invariance to imaging transformations. , 1995, Cerebral cortex.

[43]  N. Logothetis,et al.  Shape representation in the inferior temporal cortex of monkeys , 1995, Current Biology.

[44]  Tomaso Poggio,et al.  The Individual is Nothing, the Class Everything: Psychophysics and Modeling of Recognition in Obect Classes , 2000 .

[45]  Rajesh P. N. Rao,et al.  Dynamic Model of Visual Recognition Predicts Neural Response Properties in the Visual Cortex , 1997, Neural Computation.

[46]  Heinrich H Bülthoff,et al.  Image-based object recognition in man, monkey and machine , 1998, Cognition.

[47]  D C Van Essen,et al.  Shifter circuits: a computational strategy for dynamic aspects of visual processing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Kenji Kawano,et al.  Global and fine information coded by single neurons in the temporal visual cortex , 1999, Nature.

[49]  D Mumford,et al.  On the computational architecture of the neocortex. II. The role of cortico-cortical loops. , 1992, Biological cybernetics.

[50]  Shimon Edelman,et al.  Representation and recognition in vision , 1999 .

[51]  E. T. Rolls,et al.  Activity of neurones in the inferotemporal cortex of the alert monkey , 1977, Brain Research.

[52]  Keiji Tanaka,et al.  Optical Imaging of Functional Organization in the Monkey Inferotemporal Cortex , 1996, Science.

[53]  R. Desimone Face-Selective Cells in the Temporal Cortex of Monkeys , 1991, Journal of Cognitive Neuroscience.

[54]  P M Gochin Properties of simulated neurons from a model of primate inferior temporal cortex. , 1994, Cerebral cortex.

[55]  S. Ullman High-Level Vision: Object Recognition and Visual Cognition , 1996 .