A cooperative learning approach to Mixed Performance Controller design: a behavioural viewpoint

Based on the idea of cooperative learning in multiagent systems, two approaches to design Mixed Performance Controllers (MPCs) are presented. Each controller is assumed as an expert agent. These agents share their knowledge to construct a MPC with mixture of initial controllers' properties. In the first approach, the knowledge of each agent is extracted and presented in a Q-table like tabular form. These tables are combined, and with greedy action selection policy the MPC is found. In the second approach, controllers are combined from a behavioural viewpoint. Stability of the MPC is formalised in a theorem with a sufficient condition. Simulation studies indicate that the combined controller shows a mixture of desired properties associated with individual controller agents.

[1]  Mark Klein,et al.  Towards a Systematic Repository of Knowledge about Managing Collaborative Design Conflicts , 2000, AID.

[2]  M. Wells,et al.  Learning with delayed rewards in Octopus , 1968, Zeitschrift für vergleichende Physiologie.

[3]  Majid Nili Ahmadabadi,et al.  Cooperative Q-learning: the knowledge sharing issue , 2001, Adv. Robotics.

[4]  Jakob Stoustrup,et al.  Multiobjective Control for Multivariable Systems with Mixed-sensitivity Specifications , 1997 .

[5]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[6]  Majid Nili Ahmadabadi,et al.  A Cooperative Learning Approach to Qualitative Mixed Performance Controllers , 2005, IC-AI.

[7]  Mark Klein,et al.  Supporting conflict resolution in cooperative design systems , 1991, IEEE Trans. Syst. Man Cybern..

[8]  Jeffrey B. Burl,et al.  Linear Optimal Control: H(2) and H (Infinity) Methods , 1998 .

[9]  Donald E. Kirk,et al.  Optimal Control Theory , 1970 .

[10]  Cooperative Q-Learning in a Team of Agents with Different Skills and Expertness , 2002 .

[11]  Bor-Sen Chen,et al.  Mixed H2/H∞ fuzzy output feedback control design for nonlinear dynamic systems: an LMI approach , 2000, IEEE Trans. Fuzzy Syst..

[12]  Majid Nili Ahmadabadi,et al.  On Stability Analysis of Behavioural Mixed Performance Controllers , 2005, IC-AI.

[13]  Bor-Sen Chen,et al.  Nonlinear mixed H 2/H ∞ control for robust tracking design of robotic systems , 1997 .

[14]  Mark Klein,et al.  Using Domain-Independent Exception Handling Services to Enable Robust Open Multi-Agent Systems: The Case of Agent Death , 2003, Autonomous Agents and Multi-Agent Systems.

[15]  M. N. Ahmadabadi,et al.  An extension of weighted strategy sharing in cooperative Q-learning for specialized agents , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[16]  José del R. Millán,et al.  Specialization in multi-agent systems through learning , 1997, Biological Cybernetics.