论文信息 - Representing visual schemas in neural networks for scene analysis

Representing visual schemas in neural networks for scene analysis

Using object recognition in simple scenes as the task, two fundamental problems in neural network systems are addressed: (1) processing large amounts of input with limited resources, and (2) the representation and use of structured knowledge. The solution to the first problem is to process a small amount of the input in parallel, and successively focus on other parts of the input. This strategy requires that the system maintains structured knowledge for describing and interpreting successively gathered information. The proposed system, VISOR (Visual Schemas for Object Representation), consists of two main modules. The low-level visual module extracts featural and positional information from the visual input. The schema module encodes structured knowledge about possible objects, and provides top-down information for the low-level visual module to focus attention at different parts of the scene. Working cooperatively with the low-level visual module, it builds a globally consistent interpretation of successively gathered visual information.<<ETX>>

Risto Miikkulainen | Wee Kheng Leow | W. Leow | R. Miikkulainen

[1] M. Lévesque. Perception , 1986, The Yale Journal of Biology and Medicine.

[2] Geoffrey E. Hinton,et al. A Distributed Connectionist Production System , 1988, Cogn. Sci..

[3] David C. Van Essen,et al. Information processing strategies and pathways in the primate retina and visual cortex , 1990 .

[4] Geoffrey E. Hinton,et al. Schemata and Sequential Thought Processes in PDP Models , 1986 .

[5] D. Norman,et al. Attention to Action: Willed and Automatic Control of Behavior Technical Report No. 8006. , 1980 .

[6] T. Shallice. Specific impairments of planning. , 1982, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.