Delayed reinforcement learning for closed-loop object recognition

Object recognition is a multi-level process requiring a sequence of algorithms at low, intermediate and high levels. Generally, such systems are open loop with no feedback between levels and assuring their robustness is a key challenge in computer vision research. A robust closed-loop system based on "delayed" reinforcement learning is introduced in this paper. The parameters of a multi-level system employed for model-based object recognition are learned. The method improves recognition results over time by using the output at the highest level as feedback for the learning system. It has been experimentally validated by learning the parameters of image segmentation and feature extraction and thereby recognizing 2D objects. The approach systematically controls feedback in a multi-level vision system and provides a potential solution to a long-standing problem in the field of computer vision.

[1]  Steven A. Shafer,et al.  The Phoenix Image Segmentation System: Description and Evaluation , 1982 .

[2]  Theodosios Pavlidis,et al.  Structural pattern recognition , 1977 .

[3]  Bir Bhanu,et al.  Closed-loop object recognition using reinforcement learning , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Pascal Fua,et al.  Computational strategies for object recognition , 1992, CSUR.

[5]  Bir Bhanu,et al.  Recognition of occluded objects: A cluster-structure algorithm , 1987, Pattern Recognit..

[6]  Jing Peng,et al.  Incremental multi-step Q-learning , 1994, Machine-mediated learning.

[7]  PengJing,et al.  Incremental multi-step Q-learning , 1996 .