Examining high level neural representations of cluttered scenes

Humans and other primates can rapidly categorize objects even when they are embedded in complex visual scenes (Thorpe et al., 1996; Fabre-Thorpe et al., 1998). Studies by Serre et al., 2007 have shown that the ability of humans to detect animals in brief presentations of natural images decreases as the size of the target animal decreases and the amount of clutter increases, and additionally, that a feedforward computational model of the ventral visual system, originally developed to account for physiological properties of neurons, shows a similar pattern of performance. Motivated by these studies, we recorded single and multi unit neural spiking activity from macaque superior temporal sulcus (STS) and anterior inferior temporal cortex (AIT), as a monkey passively viewed images of natural scenes. The stimuli consisted of 600 images of animals in natural scenes, and 600 images of natural scenes without animals in them, captured at four different viewing distances, and were the same images used by Serre et al. to allow for a direct comparison between human psychophysics, computational models, and neural data. To analyze the data, we applied population ‘readout‘ techniques (Hung et al., 2005; Meyers et al., 2008) to decode from the neural activity whether an image contained an animal or not. The decoding results showed a similar pattern of degraded decoding performance with increasing clutter as was seen in the human psychophysics and computational model results. However, overall the decoding accuracies from the neural data lower were than that seen in the computational model, and the latencies of information in IT were long (~125ms) relative to behavioral measures obtained from primates in other studies. Additional tests also showed that the responses of the model units were not capturing several properties of the neural responses, and that detecting animals in cluttered scenes using simple model units based on V1 cells worked almost as well as using more complex model units that were designed to model the responses of IT neurons. While these results suggest AIT might not be the primary brain region involved in this form of rapid categorization, additional studies are needed before drawing strong conclusions.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Nicolas Pinto,et al.  How far can you get with a modern face recognition test set using only simple features? , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Li Fei-Fei,et al.  Neural mechanisms of rapid natural scene categorization in human visual cortex , 2009, Nature.

[4]  David J. Freedman,et al.  Dynamic population coding of category information in inferior temporal and prefrontal cortex. , 2008, Journal of neurophysiology.

[5]  R. Vogels,et al.  Effects of Category Learning on the Stimulus Selectivity of Macaque References , 2022 .

[6]  Sidney R Lehky,et al.  Shape selectivity in primate frontal eye field. , 2008, Journal of neurophysiology.

[7]  N. Kanwisher,et al.  A stable topography of selectivity for unfamiliar shape classes in monkey inferior temporal cortex. , 2008, Cerebral cortex.

[8]  G. Orban,et al.  Coding of Shape and Position in Macaque Lateral Intraparietal Area , 2008, The Journal of Neuroscience.

[9]  P. Girard,et al.  Ultra-rapid categorisation in non-human primates , 2008, Animal Cognition.

[10]  T. Poggio,et al.  A model of V4 shape selectivity and invariance. , 2007, Journal of neurophysiology.

[11]  Keiji Tanaka,et al.  Object category structure in response patterns of neuronal population in monkey inferior temporal cortex. , 2007, Journal of neurophysiology.

[12]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[13]  J. Tanji,et al.  Categorization of behavioural sequences in the prefrontal cortex , 2007, Nature.

[14]  S. R. Lehky,et al.  Comparison of shape encoding in primate dorsal and ventral visual pathways. , 2007, Journal of neurophysiology.

[15]  David J. Freedman,et al.  Experience-dependent representation of visual categories in parietal cortex , 2006, Nature.

[16]  J. Maunsell,et al.  Feature-based attention in visual cortex , 2006, Trends in Neurosciences.

[17]  Simon J. Thorpe,et al.  Ultra-rapid object detection with saccadic eye movements: Visual processing speed revisited , 2006, Vision Research.

[18]  Thomas Serre,et al.  A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .

[19]  Tomaso Poggio,et al.  Fast Readout of Object Identity from Macaque Inferior Temporal Cortex , 2005, Science.

[20]  Michèle Fabre-Thorpe,et al.  Rapid categorization of natural scenes in monkeys: target predictability and processing speed , 2005, Neuroreport.

[21]  Detlef Wegener,et al.  The Influence of Sustained Selective Attention on Stimulus Selectivity in Macaque Visual Area MT , 2004, The Journal of Neuroscience.

[22]  Michèle Fabre-Thorpe,et al.  Interaction of top-down and bottom-up processing in the fast visual analysis of natural scenes. , 2004, Brain research. Cognitive brain research.

[23]  Polina Golland,et al.  Permutation Tests for Classification: Towards Statistical Significance in Image-Based Studies , 2003, IPMI.

[24]  David J. Freedman,et al.  A Comparison of Primate Prefrontal and Inferior Temporal Cortices during Visual Categorization , 2003, The Journal of Neuroscience.

[25]  Edmund T Rolls,et al.  The Receptive Fields of Inferior Temporal Cortex Neurons in Natural Scenes , 2003, The Journal of Neuroscience.

[26]  Guillaume A. Rousselet,et al.  Parallel processing in high-level categorization of natural images , 2002, Nature Neuroscience.

[27]  P. Perona,et al.  Rapid natural scene categorization in the near absence of attention , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  N. Sigala,et al.  Visual categorization shapes feature selectivity in the primate temporal cortex , 2002, Nature.

[29]  Y. Yamane,et al.  Complex objects are represented in macaque inferotemporal cortex by the combination of feature columns , 2001, Nature Neuroscience.

[30]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[31]  David L. Sheinberg,et al.  Noticing Familiar Objects in Real World Scenes: The Role of Temporal Cortical Neurons in Natural Vision , 2001, The Journal of Neuroscience.

[32]  David J. Freedman,et al.  Categorical representation of visual stimuli in the primate prefrontal cortex. , 2001, Science.

[33]  P. Fldik,et al.  The Speed of Sight , 2001, Journal of Cognitive Neuroscience.

[34]  G Richard,et al.  Ultra-rapid categorisation of natural scenes does not rely on colour cues: a study in monkeys and humans , 2000, Vision Research.

[35]  J. Maunsell,et al.  Shape selectivity in primate lateral intraparietal cortex , 1998, Nature.

[36]  Keiji Tanaka,et al.  Functional architecture in monkey inferotemporal cortex revealed by in vivo optical imaging , 1998, Neuroscience Research.

[37]  S. Thorpe,et al.  Rapid categorization of natural images by rhesus monkeys , 1998, Neuroreport.

[38]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[39]  A. Treves,et al.  The representational capacity of the distributed encoding of information provided by populations of neurons in primate temporal visual cortex , 1997, Experimental Brain Research.

[40]  Jean Bullier,et al.  The Timing of Information Transfer in the Visual System , 1997 .

[41]  Denis Fize,et al.  Speed of processing in the human visual system , 1996, Nature.

[42]  B L McNaughton,et al.  Dynamics of the hippocampal ensemble code for space. , 1993, Science.

[43]  P. C. Murphy,et al.  Cerebral Cortex , 2017, Cerebral Cortex.