论文信息 - Skill Characterization Based on Betweenness

Skill Characterization Based on Betweenness

We present a characterization of a useful class of skills based on a graphical representation of an agent's interaction with its environment. Our characterization uses betweenness, a measure of centrality on graphs. It captures and generalizes (at least intuitively) the bottleneck concept, which has inspired many of the existing skill-discovery algorithms. Our characterization may be used directly to form a set of skills suitable for a given task. More importantly, it serves as a useful guide for developing incremental skill-discovery algorithms that do not rely on knowing or representing the interaction graph in its entirety.

Andrew G. Barto | Özgür Simsek | A. Barto | Özgür Simsek

[1] Shie Mannor,et al. Q-Cut - Dynamic Discovery of Sub-goals in Reinforcement Learning , 2002, ECML.

[2] Alicia P. Wolfe,et al. Identifying useful subgoals in reinforcement learning by local graph partitioning , 2005, ICML.

[3] AUTOMATED DISCOVERY OF OPTIONS IN REINFORCEMENT LEARNING , 2003 .

[4] Shie Mannor,et al. Dynamic abstraction in reinforcement learning via clustering , 2004, ICML.

[5] Andrew G. Barto,et al. Automatic Discovery of Subgoals in Reinforcement Learning using Diverse Density , 2001, ICML.

[6] U. Brandes. A faster algorithm for betweenness centrality , 2001 .

[7] Saul Amarel,et al. On representations of problems of reasoning about actions , 1968 .

[8] L. Freeman. Centrality in social networks conceptual clarification , 1978 .

[9] Andrew G. Barto,et al. Using relative novelty to identify useful temporal abstractions in reinforcement learning , 2004, ICML.

[10] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..

[11] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[12] Nuttapong Chentanez,et al. Intrinsically Motivated Reinforcement Learning , 2004, NIPS.

[13] Doina Precup,et al. Learning Options in Reinforcement Learning , 2002, SARA.

[14] Nuttapong Chentanez,et al. Intrinsically Motivated Learning of Hierarchical Collections of Skills , 2004 .

[15] Richard S. Sutton,et al. Roles of Macro-Actions in Accelerating Reinforcement Learning , 1998 .

[16] Doina Precup,et al. Temporal abstraction in reinforcement learning , 2000, ICML 2000.

[17] Leonard M. Freeman,et al. A set of measures of centrality based upon betweenness , 1977 .