Separating Figure from Ground with a Parallel Network

The differentiation of figure from ground plays an important role in the perceptual organization of visual stimuli. The rapidity with which we can discriminate the inside from the outside of a figure suggests that at least this step in the process may be performed in visual cortex by a large number of neurons in several different areas working together in parallel. We have attempted to simulate this collective computation by designing a network of simple processing units that receives two types of information: bottom-up input from the image containing the outlines of a figure, which may be incomplete, and a top-down attentional input that biases one part of the image to be the inside of the figure. No presegmentation of the image was assumed. Two methods for performing the computation were explored: gradient descent, which seeks locally optimal states, and simulated annealing, which attempts to find globally optimal states by introducing noise into the computation. For complete outlines, gradient descent was faster, but the range of input parameters leading to successful performance was very narrow. In contrast, simulated annealing was more robust: it worked over a wider range of attention parameters and a wider range of outlines, including incomplete ones. Our network model is too simplified to serve as a model of human performance, but it does demonstrate that one global property of outlines can be computed through local interactions in a parallel network. Some features of the model, such as the role of noise in escaping from nonglobal optima, may generalize to more realistic models.

[1]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[2]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[3]  I. Z. Fisher MONTE CARLO METHOD IN STATISTICAL PHYSICS , 1959 .

[4]  H. E. Derksen,et al.  Fluctuation phenomena in nerve membrane , 1968 .

[5]  J. Gibson,et al.  The Senses Considered As Perceptual Systems , 1967 .

[6]  G. Sperling Binocular Vision: A Physical and a Neural Theory , 1970 .

[7]  R. Gregory The intelligent eye , 1970 .

[8]  B. Julesz Foundations of Cyclopean Perception , 1971 .

[9]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[10]  Parvati Dev,et al.  Perception of Depth Surfaces in Random-Dot Stereograms: A Neural Model , 1975, Int. J. Man Mach. Stud..

[11]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[12]  A. Holden Models of the stochastic activity of neurones , 1976 .

[13]  V. Mountcastle,et al.  An organizing principle for cerebral function : the unit module and the distributed system , 1978 .

[14]  H. Barrow,et al.  RECOVERING INTRINSIC SCENE CHARACTERISTICS FROM IMAGES , 1978 .

[15]  D Marr,et al.  A computational theory of human stereo vision. , 1979, Proceedings of the Royal Society of London. Series B, Biological sciences.

[16]  R. Hummel,et al.  Toward a low-level description of dot clusters: Labeling edge, interior, and noise points , 1979 .

[17]  John M. Prager,et al.  Extracting and Labeling Boundary Segments in Natural Scenes , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Shmuel Peleg,et al.  A New Probabilistic Relaxation Scheme , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Tj Sejnowski,et al.  Skeleton filters in the brain , 2014 .

[20]  Azriel Rosenfeld,et al.  Blob Detection by Relaxation , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Geoffrey E. Hinton,et al.  OPTIMAL PERCEPTUAL INFERENCE , 1983 .

[23]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  I. Rock The Logic of Perception , 1983 .

[25]  Paul Smolensky,et al.  Schema Selection and Stochastic Inference in Modular Environments , 1983, AAAI.

[26]  Geoffrey E. Hinton,et al.  Massively Parallel Architectures for AI: NETL, Thistle, and Boltzmann Machines , 1983, AAAI.

[27]  Geoffrey E. Hinton,et al.  Parallel visual computation , 1983, Nature.

[28]  Michael S. Landy,et al.  Image processing in perception and cognition , 1983 .

[29]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[30]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  K. Binder Applications of the Monte Carlo Method in Statistical Physics , 1984 .

[32]  S Grossberg,et al.  Neural dynamics of brightness perception: Features, boundaries, diffusion, and resonance , 1984, Perception & Psychophysics.

[33]  S. Ullman Visual routines , 1984, Cognition.

[34]  Demetri Terzopoulos,et al.  Multiresolution computation of visible-surface representations , 1984 .

[35]  J. A. Anderson,et al.  A neural network model of multistable perception. , 1985, Acta psychologica.

[36]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[37]  Terrence J. Sejnowski,et al.  Open questions about computation in cerebral cortex , 1986 .

[38]  G. Sperling,et al.  Tradeoffs between stereopsis and proximity luminance covariance as determinants of perceived 3D structure , 1986, Vision Research.

[39]  Geoffrey E. Hinton,et al.  Learning symmetry groups with hidden units: beyond the perceptron , 1986 .

[40]  Bir Bhanu,et al.  Segmentation of natural scenes , 1987, Pattern Recognit..

[41]  Jerome A. Feldman,et al.  Connectionist Models and Their Properties , 1982, Cogn. Sci..