Attention to multiple local critics in decision making and control

Dealing with uncertainties and lack of knowledge about problems and situations, there is a perpetual difficulty to evaluate the situations and action values in all time steps. On the other hand, the design of critics which delicately guide the agent even with reinforcement rewards and punishments in these complicated or blurred environments is laborious and cumbersome. In this study, we propose a framework for concurrent learning of control of attention to the sensory space, attention to the various critics to evaluate the selected motor actions, and the motor actions themselves. Previous works include the implementation of the control of attention for selecting the most important parts of data and/or reducing the dimensionality of the input space. However, decision making can depend not only on objective sensory data, but also upon mental states or subjective inputs as well. Specifically, we examine attention for the evaluations of selected action by various local critics, as well as sensory inputs. Each local critic evaluates the agent's actions regarding to its standpoints on task domain. Our agent tries to learn the degree of importance of each local critic's assessment, using sparse super-critic punishing revisions. So the agent learns the way of combining or even disregarding some of local critics while it learns to focus on the appropriate subset of features, and learns physical actions concurrently. By discovering the effective combination of local critics, the agent does not need any prior accurately designed critic. The mathematical formulation of proposed learning method is developed. Also, in order to evaluate the proposed method, two benchmarks are discussed. The effect of using attention control on robustness is analyzed via Monte Carlo analysis. The experimental results show the efficiency of proposed formulation in presence of uncertainties.

[1]  Sridhar Mahadevan,et al.  A reinforcement learning model of selective visual attention , 2001, AGENTS '01.

[2]  Caro Lucas,et al.  Introducing Belbic: Brain Emotional Learning Based Intelligent Controller , 2004, Intell. Autom. Soft Comput..

[3]  Christian Balkenius,et al.  Attention as selection-for-action: a scheme for active perception , 1999, 1999 Third European Workshop on Advanced Mobile Robots (Eurobot'99). Proceedings (Cat. No.99EX355).

[4]  M. Corbetta,et al.  Control of goal-directed and stimulus-driven attention in the brain , 2002, Nature Reviews Neuroscience.

[5]  Bernhard Sendhoff,et al.  Pareto-Based Multiobjective Machine Learning: An Overview and Case Studies , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[6]  Antonio Fernández-Caballero,et al.  Algorithmic lateral inhibition method in dynamic and selective visual attention task: Application to moving objects detection and labelling , 2006, Expert Syst. Appl..

[7]  Arash Arami,et al.  Emotion on FPGA: Model driven approach , 2009, Expert Syst. Appl..

[8]  Majid Nili Ahmadabadi,et al.  Biologically Inspired Framework for Learning and Abstract Representation of Attention Control , 2008, WAPCV.

[9]  C. Lucas,et al.  A fast model free intelligent controller based on fused emotions: A practical case implementation , 2008, 2008 16th Mediterranean Conference on Control and Automation.

[10]  Matthew H Tong,et al.  Information Attracts Attention: A Probabilistic Account of the Cross-Race Advantage in Visual Search , 2007 .

[11]  Lucas Paletta,et al.  Cascaded Sequential Attention for Object Recognition with Informative Local Descriptors and Q-learning of Grouping Strategies , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[12]  Kalyanmoy Deb,et al.  Dynamic multiobjective optimization problems: test cases, approximations, and applications , 2004, IEEE Transactions on Evolutionary Computation.

[13]  Katsuhiko Ogata,et al.  Modern Control Engineering , 1970 .

[14]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[15]  Charles Curtis Eriksen,et al.  The extent of processing of noise elements during selective encoding from visual displays , 1973 .

[16]  Babak Nadjar Araabi,et al.  Implementation of Emotional Controller for Interior Permanent Magnet Synchronous Motor Drive , 2006, Conference Record of the 2006 IEEE Industry Applications Conference Forty-First IAS Annual Meeting.

[17]  Caro Lucas,et al.  Real Time Emotional Control for Anti-Swing and Positioning Control of SIMO Overhead Traveling Crane , 2008 .

[18]  L. Itti,et al.  Modeling the influence of task on attention , 2005, Vision Research.

[19]  George Sperling,et al.  The information available in brief visual presentations. , 1960 .

[20]  Caro Lucas,et al.  Emotional control of inverted pendulum system: A soft switching from imitative to emotional learning , 2000, 2009 4th International Conference on Autonomous Robots and Agents.

[21]  Oliver Brock,et al.  Cascaded filter approach to multi-objective control , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[22]  Babak Nadjar Araabi,et al.  Brain emotional learning based intelligent controller applied to neurofuzzy model of micro-heat exchanger , 2007, Expert Syst. Appl..

[23]  Gordon B. Moskowitz,et al.  Preconscious effects of temporary goals on attention , 2002 .

[24]  Mohammad Javad Yazdanpanah,et al.  Applying brain emotional learning algorithm for multivariable control of HVAC systems , 2006, J. Intell. Fuzzy Syst..