A computational theory of human stereo vision

An algorithm is proposed for solving the stereoscopic matching problem. The algorithm consists of five steps: (1) Each image is filtered at different orientations with bar masks of four sizes that increase with eccentricity; the equivalent filters are one or two octaves wide. (2) Zero-crossings in the filtered images, which roughly correspond to edges, are localized. Positions of the ends of lines and edges are also found. (3) For each mask orientation and size, matching takes place between pairs of zero-crossings or terminations of the same sign in the two images, for a range of disparities up to about the width of the mask’s central region. (4) Wide masks can control vergence movements, thus causing small masks to come into correspondence. (5) When a correspondence is achieved, it is stored in a dynamic buffer, called the 2½-D sketch. It is shown that this proposal provides a theoretical framework for most existing psychophysical and neurophysiological data about stereopsis. Several critical experimental predictions are also made, for instance about the size of Panum’s area under various conditions. The results of such experiments would tell us whether, for example, co-operativity is necessary for the matching process.

[1]  S. Rice Mathematical analysis of random noise , 1944 .

[2]  L. Riggs,et al.  Eye Movements Recorded During Convergence and Divergence , 1960 .

[3]  B. Julesz Binocular depth perception of computer-generated patterns , 1960 .

[4]  W. H. Ittelson Visual space perception , 1961 .

[5]  G WESTHEIMER,et al.  Independence of conjugate and disjunctive eye movements , 1961, The Journal of physiology.

[6]  G. Westheimer,et al.  Disjunctive eye movements , 1961, The Journal of physiology.

[7]  Béla Julesz,et al.  Towards the Automation of Binocular Depth Perception , 1962, IFIP Congress.

[8]  M. Longuet-Higgins The distribution of intervals between zeros of a stationary random function , 1962, Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences.

[9]  L. Kaufman ON THE NATURE OF BINOCULAR DISPARITY. , 1964, The American journal of psychology.

[10]  D. Mitchell Retinal disparity and diplopia. , 1966, Vision research.

[11]  C. Blakemore,et al.  The neural mechanism of binocular depth discrimination , 1967, The Journal of physiology.

[12]  B. Julesz,et al.  Extension of Panum's fusional area in binocularly stabilized vision. , 1967, Journal of the Optical Society of America.

[13]  J. Robson,et al.  Application of fourier analysis to the visibility of gratings , 1968, The Journal of physiology.

[14]  C Blakemore,et al.  On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images , 1969, The Journal of physiology.

[15]  G Westheimer,et al.  The sensory stimulus for disjunctive eye movements. , 1969, Vision research.

[16]  M. R. Leadbetter On the Distributions of the Times between Events in a Stationary Stream of Events , 1969 .

[17]  G. Sperling Binocular Vision: A Physical and a Neural Theory , 1970 .

[18]  W. Richards,et al.  Anomalous stereoscopic depth perception. , 1971, Journal of the Optical Society of America.

[19]  B. Julesz Foundations of Cyclopean Perception , 1971 .

[20]  P. O. Bishop,et al.  Binocular interaction fields of single units in the cat striate cortex , 1971, The Journal of physiology.

[21]  R. A. Smith,et al.  Disparity processing of spatial frequencies in man , 1972, The Journal of physiology.

[22]  S Blomfield Implicit features and stereoscopy. , 1973, Nature: New biology.

[23]  R. Harriss,et al.  Methylmercury in Estuarine Sediments , 1973, Nature.

[24]  D Regan,et al.  A stereo field map with implications for disparity processing. , 1973, Investigative ophthalmology.

[25]  D. Hubel,et al.  Sequence regularity and geometry of orientation columns in the monkey striate cortex , 1974, The Journal of comparative neurology.

[26]  T. H. Applebaum,et al.  Stereopsis with large disparities: Discrimination and depth magnitude , 1975, Vision Research.

[27]  J I Nelson,et al.  Globality and stereoscopic fusion in binocular vision. , 1975, Journal of theoretical biology.

[28]  B. Julesz,et al.  Independent Spatial-Frequency-Tuned Channels in Binocular Fusion and Rivalry , 1975 .

[29]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[30]  John P. Frisby,et al.  Learning to See Complex Random-Dot Stereograms , 1975 .

[31]  M. Georgeson,et al.  Contrast constancy: deblurring in human vision by spatial frequency channels. , 1975, The Journal of physiology.

[32]  J. Frisby,et al.  The Role of Monocularly Conspicuous Features in Facilitating Stereopsis from Random-Dot Stereograms , 1975 .

[33]  Parvati Dev,et al.  Perception of Depth Surfaces in Random-Dot Stereograms: A Neural Model , 1975, Int. J. Man Mach. Stud..

[34]  J. Mayhew,et al.  Rivalrous texture stereograms , 1976, Nature.

[35]  P. Schiller,et al.  Quantitative studies of single-cell properties in monkey striate cortex. III. Spatial frequency. , 1976, Journal of neurophysiology.

[36]  D. Whitteridge,et al.  Binocular visual mechanisms in cortical areas I and II of the sheep. , 1976, The Journal of physiology.

[37]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[38]  Michael L. Hines Line spread function variation near the fovea , 1976, Vision Research.

[39]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[40]  David Marr,et al.  Three-dimensional shape representation (A) , 1977 .

[41]  H. Wilson,et al.  Threshold visibility of frequency gradient patterns , 1977, Vision Research.

[42]  G. Poggio,et al.  Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. , 1977, Journal of neurophysiology.

[43]  Tomaso Poggio,et al.  A Theory of Human Stereo Vision , 1977 .

[44]  Rick A. Williams,et al.  The synchrony of binocular saccadic eye movements , 1977, Vision Research.

[45]  David Marr,et al.  Representing Visual Information , 1977 .

[46]  Whitman Richards Stereopsis with and without monocular contours , 1977, Vision Research.

[47]  P. O. Bishop,et al.  Discrimination of orientation and position disparities by binocularly activated neurons in cat straite cortex. , 1977, Journal of neurophysiology.

[48]  H. Wilson Quantitative characterization of two types of line-spread function near the fovea , 1978, Vision Research.

[49]  H. Wilson Quantitative prediction of line spread function measurements: Implications for channel bandwidths , 1978, Vision Research.

[50]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[51]  H. Wilson,et al.  Spatial probability summation and disinhibition in psychophysically measured line-spread functions , 1979, Vision Research.

[52]  J. E. W. MAYHEW,et al.  Texture contours can facilitate stereopsis by initiating vergence eye movements , 1979, Nature.

[53]  J. Bergen,et al.  A four mechanism model for threshold spatial vision , 1979, Vision Research.

[54]  J. Mayhew,et al.  Convergent disparity discriminations in narrow-band-filtered random-dot stereograms , 1979, Vision Research.

[55]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[56]  A. Papoulis Systems and transforms with applications in optics , 1981 .

[57]  W E Grimson,et al.  A computer implementation of a theory of human stereo vision. , 1981, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.