论文信息 - Efiective Online Detection of Task-Independent Landmarks

Efiective Online Detection of Task-Independent Landmarks

One of the key problems in building adaptive autonomous agents is landmark detection. Landmarks can be used for efficient navigation as well as for developing hierarchical cognitive structures. Previous approaches to landmark detection often simply chose landmarks as the agent’s location after fixed intervals of time. Other approaches to landmark detection have focused on the reliability and ease of detection of landmarks. However, systems that use landmarks for hierarchy formations rely on a set of landmarks that provides a means for a concise and effective decomposition of the environment. We believe that such a decomposition is achieved most effectively by identifying transitions that partition the environment into relatively independent sub-regions. Using notions of surprise and consolidation via continued novelty, implemented by relatively simple statistics on the sensory inputs, we introduce an online landmark detection mechanism that reliably identifies landmarks that correspond to such transitions. Since the detected landmarks partition the environment into relatively independent subspaces, the resulting set of landmarks should be very useful for the formation of an online adaptive hierarchical problem decomposition enabling efficient hierarchical adaptation and cognition.

[1] Annie S. Wu,et al. Mobile robot exploration and navigation of indoor spaces using sonar and vision , 1994 .

[2] Russell Greiner,et al. Learning to Select Useful Landmarks , 1994, AAAI.

[3] Benjamin Kuipers,et al. Learning to Explore and Build Maps , 1994, AAAI.

[4] Shumeet Baluja,et al. Using the Representation in a Neural Network's Hidden Layer for Task-Specific Focus of Attention , 1995, IJCAI.

[5] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6] Randall D. Beer,et al. Spatial learning for navigation in dynamic environments , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[7] Matteo Golfarelli,et al. A Hierarchical Approach to Sonar-Based Landmark Detection in Mobile Robots , 1997 .

[8] Benjamin Kuipers,et al. A Hierarchy of Qualitative Representations for Space , 1998, Spatial Cognition.

[9] Tom Duckett,et al. Mobile robot self-localisation and measurement of performance in middle-scale environments , 1998, Robotics Auton. Syst..

[10] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[11] Carme Torras,et al. Detection of natural landmarks through multiscale opponent features , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12] P. Lanzi,et al. Adaptive Agents with Reinforcement Learning and Internal Memory , 2000 .

[13] C. Torras,et al. Color constancy for landmark detection in outdoor environments , 2001 .

[14] Stephen R. Marsland,et al. Learning to select distinctive landmarks for mobile robot navigation , 2001, Robotics Auton. Syst..

[15] Ralf Möller,et al. Insects could exploit UV-green contrast for Landmark navigation. , 2002, Journal of theoretical biology.

[16] David E. Goldberg,et al. The Design of Innovation: Lessons from and for Competent Genetic Algorithms , 2002 .

[17] Martin V. Butz,et al. Anticipations Control Behavior: Animal Behavior in an Anticipatory Learning Classifier System , 2002, Adapt. Behav..

[18] Martin V. Butz,et al. Anticipatory Learning Classifier Systems , 2002, Genetic Algorithms and Evolutionary Computation.

[19] Stephen Marsland,et al. Learning to autonomously select landmarks for navigation and communication , 2002 .

[20] Stephen R. Marsland,et al. Sensory Anticipation for Autonomous Selection of Robot Landmarks , 2003, ABiALS.

[21] Sebastian Thrun,et al. Bayesian Landmark Learning for Mobile Robot Localization , 1998, Machine Learning.