Are Cortical Models Really Bound Review by the "Binding Problem"?

of models to perform higher-level visual processing such temporal cortex (IT) described neurons tuned to views of as viewpoint-invariant object recognition in cluttered complex objects such as faces; i.e., the cells discharged scenes has been questioned in recent years by several strongly to a face seen from a specific viewpoint but researchers, who in turn proposed an alternative class very little or not at all to other objects. A key property of models based on the synchronization of large assem- of these cells was their scale and translation invariance, blies of cells, within and across cortical areas. The main i.e., the robustness of their firing to stimulus transformaimplicit argument for this novel and controversial view tions such as changes in size or position in the visual was the assumption that hierarchical models cannot field. deal with the computational requirements of high-level These findings inspired various models of visual obvision and suffer from the so-called “binding problem.” ject recognition such as Fukushima’s Neocognitron Here, we review the present situation and discuss theo- (1980) or, later, Perrett and Oram’s (1993) outline of a retical and experimental evidence showing that the per- model of shape processing and Wallis and Rolls’ VisNet ceived weaknesses of hierarchical models are unsub- (1997), all of which share the basic idea of the visual stantiated. In particular, we show here that recognition system as a feedforward processing hierarchy where of multiple objects in cluttered scenes, arguably among invariance ranges and complexity of preferred features the most difficult tasks in vision, can be done in a hierar- grow as one ascends through the levels. chical feedforward model. Two problems in particular Models of this type prompted von der Malsburg (1981) make object recognition difficult: to formulate the binding problem. His claim was that 1. The segmentation problem. Visual scenes normally visual representations based on spatially invariant feacontain multiple objects. To recognize individual obture detectors (to achieve invariant recognition) were jects, features must be isolated from the surrounding ambiguous: “As generalizations are performed indepenclutter and extracted from the image, and the feature dently for each feature, information about neighborhood set must be parsed so that the different features are relations and relative position, size, and orientation is assigned to the correct object. The latter problem is lost. This lack of information can lead to the inability to

[1]  M. Wertheimer Laws of organization in perceptual forms. , 1938 .

[2]  Marvin Minsky,et al.  Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.

[3]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[4]  I. Rock,et al.  GROUPING BASED ON PHENOMENAL PROXIMITY. , 1964, Journal of experimental psychology.

[5]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[6]  W. Modell On the significance of significant , 1981, The New England journal of medicine.

[7]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[8]  W. Köhler The task of Gestalt psychology , 1969 .

[9]  Marilyn Bohl,et al.  Information processing , 1971 .

[10]  J. Jonides,et al.  A conceptual category effect in visual search: O as letter or as digit , 1972 .

[11]  J. Szentágothai Synaptology of the Visual Cortex , 1973 .

[12]  P. Milner A model for visual shape recognition. , 1974, Psychological review.

[13]  M. Potter Meaning in visual search. , 1975, Science.

[14]  J. A. Deutsch,et al.  On the category effect in visual search , 1977 .

[15]  V. Braitenberg Cell Assemblies in the Cerebral Cortex , 1978 .

[16]  S. Zeki Functional specialisation in the visual cortex of the rhesus monkey , 1978, Nature.

[17]  S Marcelja,et al.  Mathematical description of the responses of simple cortical cells. , 1980, Journal of the Optical Society of America.

[18]  K. Tanaka,et al.  Organization of cat visual cortex as investigated by cross-correlation technique. , 1981, Journal of neurophysiology.

[19]  W Prinzmetal,et al.  Principles of feature integration in visual perception , 1981, Perception & psychophysics.

[20]  P. Lennie The physiological basis of variations in visual latency , 1981, Vision Research.

[21]  J. Lund,et al.  Widespread periodic intrinsic connections in the tree shrew visual cortex. , 1982, Science.

[22]  S. Zeki The distribution of wavelength and orientation selective cells in different areas of monkey visual cortex , 1983, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23]  M. Mozer Letter migration in word perception. , 1983, Journal of experimental psychology. Human perception and performance.

[24]  B Julesz,et al.  "Where" and "what" in vision. , 1985, Science.

[25]  D. McCormick,et al.  Comparative electrophysiology of pyramidal and sparsely spiny stellate neurons of the neocortex. , 1985, Journal of neurophysiology.

[26]  Ken Nakayama,et al.  Serial and parallel processing of visual feature conjunctions , 1986, Nature.

[27]  T. Wiesel,et al.  Relationships between horizontal interactions and functional architecture in cat striate cortex as revealed by cross-correlation analysis , 1986, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[28]  A. Treisman,et al.  Illusory words: the roles of attention and of top-down constraints in conjoining letters to form words. , 1986, Journal of experimental psychology. Human perception and performance.

[29]  M. Posner,et al.  Does attention affect visual feature integration? , 1986, Journal of experimental psychology. Human perception and performance.

[30]  I. Ohzawa,et al.  The effects of contrast on visual orientation and spatial frequency discrimination: a comparison of single cells and behavior. , 1987, Journal of neurophysiology.

[31]  Christoph von der Malsburg,et al.  Pattern recognition by labeled graph matching , 1988, Neural Networks.

[32]  A Treisman,et al.  Feature analysis in early vision: evidence from search asymmetries. , 1988, Psychological review.

[33]  S. Shipp,et al.  The functional logic of cortical connections , 1988, Nature.

[34]  D. Ts'o,et al.  The organization of chromatic and spatial interactions in the primate striate cortex , 1988, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[35]  R. T. Bottle,et al.  References for reviews , 1989, J. Inf. Sci..

[36]  M. Stryker Is grandmother an oscillation? , 1989, Nature.

[37]  M. Silverman,et al.  Spatial-frequency organization in primate striate cortex. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[38]  William A. Phillips,et al.  On the acquisition of object concepts from sensory data , 1989 .

[39]  K. H. Britten,et al.  Neuronal correlates of a perceptual decision , 1989, Nature.

[40]  S. Hochstein,et al.  How Serial is Serial Processing in Vision? , 1989, Perception.

[41]  R. Desimone,et al.  Spectral properties of V4 neurons in the macaque , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[42]  D. Purves,et al.  Number of `blobs' in the primary visual cortex of neonatal and adult monkeys , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[43]  T. D. Albright,et al.  Transparency and coherence in human motion perception , 1990, Nature.

[44]  W. Pitts,et al.  A Logical Calculus of the Ideas Immanent in Nervous Activity (1943) , 2021, Ideas That Created the Future.

[45]  John K. Tsotsos Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[46]  A. Treisman,et al.  Conjunction search revisited. , 1990, Journal of experimental psychology. Human perception and performance.

[47]  J. Storm Potassium currents in hippocampal pyramidal cells. , 1990, Progress in brain research.

[48]  P. McLeod,et al.  Filtering by movement in visual search. , 1991, Journal of experimental psychology. Human perception and performance.

[49]  K. Stratford,et al.  Synaptic transmission between individual pyramidal neurons of the rat visual cortex in vitro , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[50]  Michael C. Mozer,et al.  The perception of multiple objects , 1991 .

[51]  W. Singer,et al.  Selection of intrinsic horizontal connections in the visual cortex by correlated neuronal activity. , 1992, Science.

[52]  Ad Aertsen,et al.  Coding and Computation in the Cortex: Single-Neuron Activity and Cooperative Phenomena , 1992 .

[53]  M. Young,et al.  On oscillating neuronal responses in the visual cortex of the monkey. , 1992, Journal of neurophysiology.

[54]  I. Biederman,et al.  Dynamic binding in a neural network for shape recognition. , 1992, Psychological review.

[55]  M. Young,et al.  Sparse population coding of faces in the inferotemporal cortex. , 1992, Science.

[56]  J. Maunsell,et al.  Visual response latencies in striate cortex of the macaque monkey. , 1992, Journal of neurophysiology.

[57]  M. Tovée,et al.  Oscillatory activity is not evident in the primate temporal visual cortex with static stimuli , 1992, Neuroreport.

[58]  A. Peters,et al.  Neuronal organization in area 17 of cat visual cortex. , 1993, Cerebral cortex.

[59]  Marvin M. Chun,et al.  Making use of texton gradients: visual search and perceptual grouping exploit the same parallel processes in different ways , 1995 .

[60]  E. Adelson Perceptual organization and the judgment of brightness. , 1993, Science.

[61]  J. Donoghue,et al.  Oscillations in local field potentials of the primate motor cortex during voluntary movement. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[62]  K. Reinikainen,et al.  Selective attention enhances the auditory 40-Hz transient response in humans , 1993, Nature.

[63]  H. Nothdurft Faces and Facial Expressions do not Pop Out , 1993, Perception.

[64]  K Tanaka,et al.  Neuronal mechanisms of object recognition. , 1993, Science.

[65]  B L McNaughton,et al.  Dynamics of the hippocampal ensemble code for space. , 1993, Science.

[66]  M. Goodale,et al.  Chapter 28 Visual pathways to perception and action , 1993 .

[67]  A. Treisman Visual Attention and the Perception of Features and Objects , 1994 .

[68]  Wolfgang Konen,et al.  A fast dynamic link matching algorithm for invariant pattern recognition , 1994, Neural Networks.

[69]  Charles M. Gray,et al.  Simulations of Intrinsically Bursting Neocortical Pyramidal Neurons , 1994, Neural Computation.

[70]  J. Wolfe Visual search in continuous, naturalistic stimuli , 1994, Vision Research.

[71]  J. B. Levitt,et al.  Receptive fields and functional architecture of macaque V2. , 1994, Journal of neurophysiology.

[72]  Christoph von der Malsburg,et al.  The Correlation Theory of Brain Function , 1994 .

[73]  D. V. Essen,et al.  Neural mechanisms of form and motion processing in the primate visual system , 1994, Neuron.

[74]  Ehud Zohary,et al.  Correlated neuronal discharge rate and its implications for psychophysical performance , 1994, Nature.

[75]  William T Newsome,et al.  Is there a signal in the noise? , 1995, Current Opinion in Neurobiology.

[76]  B. Sakmann,et al.  Amplification of EPSPs by axosomatic sodium channels in neocortical pyramidal neurons , 1995, Neuron.

[77]  A. Treisman Modularity and attention: Is the binding problem real? , 1995 .

[78]  John K. Tsotsos Toward a computational model of visual attention , 1995 .

[79]  P. Cavanagh,et al.  FACIAL ORGANIZATION BLOCKS ACCESS TO LOW-LEVEL FEATURES: AN OBJECT INFERIORITY EFFECT , 1995 .

[80]  R. Tootell,et al.  Anatomical evidence for MT and additional cortical visual areas in humans. , 1995, Cerebral cortex.

[81]  T. Sejnowski,et al.  A selection model for motion processing in area MT of primates , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[82]  Wolf Singer,et al.  Development and Plasticity of Cortical Processing Architectures , 1995, Science.

[83]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[84]  G. Buzsáki,et al.  Sharp wave-associated high-frequency oscillation (200 Hz) in the intact hippocampus: network and intracellular mechanisms , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[85]  W Singer,et al.  Visual feature integration and the temporal correlation hypothesis. , 1995, Annual review of neuroscience.

[86]  F. Varela Resonant cell assemblies: a new approach to cognitive functions and neuronal synchrony. , 1995, Biological research.

[87]  P A Salin,et al.  Corticocortical connections in the visual system: structure and function. , 1995, Physiological reviews.

[88]  A. Treisman The binding problem , 1996, Current Opinion in Neurobiology.

[89]  A. Dale,et al.  New images from human visual cortex , 1996, Trends in Neurosciences.

[90]  A. C. Yu,et al.  Temporal Hierarchical Control of Singing in Birds , 1996, Science.

[91]  Victor A. F. Lamme,et al.  Contextual Modulation in Primary Visual Cortex , 1996, The Journal of Neuroscience.

[92]  D. Tank,et al.  Dendritic Integration in Mammalian Neurons, a Century after Cajal , 1996, Neuron.

[93]  R. Traub,et al.  A mechanism for generation of long-range synchronous fast oscillations in the cortex , 1996, Nature.

[94]  W. Singer,et al.  Stimulus-dependent synchronization of neuronal responses in the visual cortex of the awake macaque monkey , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[95]  Laurenz Wiskott,et al.  Face recognition by dynamic link matching , 1996 .

[96]  E. Fetz,et al.  Synchronization of neurons during local field potential oscillations in sensorimotor cortex of awake monkeys. , 1996, Journal of neurophysiology.

[97]  J. B. Levitt,et al.  Relation between patterns of intrinsic lateral connectivity, ocular dominance, and cytochrome oxidase-reactive regions in macaque monkey striate cortex. , 1996, Cerebral cortex.

[98]  W. Singer,et al.  In search of common foundations for cortical computation , 1997, Behavioral and Brain Sciences.

[99]  C. Koch,et al.  On the relationship between synaptic input and spike output jitter in individual neurons. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[100]  D. Johnston,et al.  A Synaptically Controlled, Associative Signal for Hebbian Plasticity in Hippocampal Neurons , 1997, Science.

[101]  Tomaso A. Poggio,et al.  Just One View: Invariances in Inferotemporal Cell Tuning , 1997, NIPS.

[102]  A. Treisman,et al.  The Interaction of Spatial and Object Pathways: Evidence from Balint's Syndrome , 1997, Journal of Cognitive Neuroscience.

[103]  J. Lisman Bursts as a unit of neural information: making unreliable synapses reliable , 1997, Trends in Neurosciences.

[104]  A. Aertsen,et al.  Spike synchronization and rate modulation differentially involved in motor cortical function. , 1997, Science.

[105]  J. Wolfe,et al.  Preattentive Object Files: Shapeless Bundles of Basic Features , 1997, Vision Research.

[106]  T J Sejnowski,et al.  A new view of hemineglect based on the response properties of parietal neurones. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[107]  M. Margulis,et al.  Temporal integration can readily switch between sublinear and supralinear summation. , 1998, Journal of neurophysiology.

[108]  Catherine Tallon-Baudry,et al.  Induced γ-Band Activity during the Delay of a Visual Short-Term Memory Task in Humans , 1998, The Journal of Neuroscience.

[109]  W. Singer,et al.  The response of cat visual cortex to flicker stimuli of variable frequency , 1998, The European journal of neuroscience.

[110]  A Treisman,et al.  Feature binding, attention and object perception. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[111]  Victor A. F. Lamme,et al.  Neuronal synchrony does not represent texture segregation , 1998, Nature.

[112]  Marisa Carrasco,et al.  Attention improves or impairs visual performance by enhancing spatial resolution , 1998, Nature.

[113]  D. McCormick,et al.  H-Current Properties of a Neuronal and Network Pacemaker , 1998, Neuron.

[114]  W. Newsome,et al.  The Variable Discharge of Cortical Neurons: Implications for Connectivity, Computation, and Information Coding , 1998, The Journal of Neuroscience.

[115]  G. Edelman,et al.  Consciousness and Complexity , 1998 .

[116]  A. Treisman,et al.  Perceiving visually presented objets: recognition, awareness, and modularity , 1998, Current Opinion in Neurobiology.

[117]  Li I. Zhang,et al.  A critical window for cooperation and competition among developing retinotectal synapses , 1998, Nature.

[118]  Hartmut Neven,et al.  The Bochum/USC Face Recognition System And How it Fared in the FERET Phase III Test , 1998 .

[119]  N. Logothetis Single units and conscious vision. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[120]  Steffen Schmalz,et al.  Combining Multiple Views and Temporal Associations for 3-D object Recognition , 1998, ECCV.

[121]  R. Desimone,et al.  Competitive Mechanisms Subserve Attention in Macaque Areas V2 and V4 , 1999, The Journal of Neuroscience.

[122]  B. Sakmann,et al.  A new cellular mechanism for coupling inputs arriving at different cortical layers , 1999, Nature.

[123]  Christoph von der Malsburg,et al.  The What and Why of Binding The Modeler’s Perspective , 1999, Neuron.

[124]  J. Wolfe,et al.  The Psychophysical Evidence for a Binding Problem in Human Vision , 1999, Neuron.

[125]  Laurenz Wiskott,et al.  The role of topographical constraints in face recognition , 1999, Pattern Recognition Letters.

[126]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[127]  X. Wang Fast burst firing and short-term synaptic plasticity: A model of neocortical chattering neurons , 1999, Neuroscience.

[128]  Leif H. Finkel,et al.  Perceptual grouping in striate cortical networks mediated by synchronization and desynchronization , 1999, Neurocomputing.

[129]  Bartlett W. Mel,et al.  Minimizing Binding Errors Using Learned Conjunctive Features , 2000, Neural Computation.

[130]  D. Wilkin,et al.  Neuron , 2001, Brain Research.

[131]  Jessica Lowell Neural Network , 2001 .