论文信息 - Retaining Learned Behavior During Real-Time Neuroevolution

Retaining Learned Behavior During Real-Time Neuroevolution

Creating software-controlled agents in videogames who can learn and adapt to player behavior is a difficult task. Using the real-time NeuroEvolution of Augmenting Topologies (rtNEAT) method for evolving increasingly complex artificial neural networks in real-time has been shown to be an effective way of achieving behaviors beyond simple scripted character behavior. In NERO, a videogame built to showcase the features of rtNEAT, agents are trained in various tasks, including shooting enemies, avoiding enemies, and navigating around obstacles. Training the neural networks to perform a series of distinct tasks can be problematic: the longer they train in a new task, the more likely it is that they will forget their skills. This paper investigates a technique for increasing the probability that a population will remember old skills as they learn new ones. By setting aside the most fit individuals at a time when a skill has been learned and then occasionally introducing their offspring into the population, the skill is retained. How large to make this milestone pool of individuals and how often to insert the offspring of the milestone pool into the general population is the primary focus of this paper.

[1] M. Mateas,et al. Architecture, Authorial Idioms and Early Observations of the Interactive Drama Facade , 2002 .

[2] Stefan Schaal,et al. Arm and Hand Movement Control , 2002 .

[3] Risto Miikkulainen,et al. Evolving a Roving Eye for Go , 2004, GECCO.

[4] Risto Miikkulainen,et al. Evolving Complex Othello Strategies Using Marker-Based Genetic Encoding ofNeural Networks , 1993 .

[5] Stacy Marsella,et al. Interactive pedagogical drama , 2000, AGENTS '00.

[6] Dona Cooper. Writing Great Screenplays for Film and TV , 1994 .

[7] Kenneth O. Stanley,et al. The NERO Real-time Video Game , 2004 .

[8] Michael I. Jordan,et al. Boltzmann Chains and Hidden Markov Models , 1994, NIPS.

[9] Fred Charles,et al. Sex, lies, and video games: an interactive storytelling prototype , 2002, AAAI 2002.

[10] A. Rollett,et al. The Monte Carlo Method , 2004 .

[11] Roberto Pieraccini,et al. A stochastic model of computer-human interaction for learning dialogue strategies , 1997, EUROSPEECH.

[12] J. William Murdock. Semi-Formal Functional Software Modeling with TMK , 2000 .

[13] W. S. Reilly,et al. Believable Social and Emotional Agents. , 1996 .

[14] Risto Miikkulainen,et al. Efficient evolution of neural networks through complexification , 2004 .

[15] Eric A. Hansen,et al. Space-Efficient Memory-Based Heuristics , 2004, AAAI.

[16] Jean-Claude Latombe,et al. Interactive manipulation planning for animated characters , 2000, Proceedings the Eighth Pacific Conference on Computer Graphics and Applications.

[17] Robert Michael Young,et al. Managing interaction between users and agents in a multi-agent storytelling environment , 2003, AAMAS '03.

[18] Benjamin Geisler,et al. An Empirical Study of Machine Learning Algorithms Applied to Modeling Player Behavior in a "First Person Shooter" Video Game , 2002 .

[19] David E. Goldberg,et al. Genetic Algorithms with Sharing for Multimodalfunction Optimization , 1987, ICGA.

[20] Steven M. LaValle,et al. RRT-connect: An efficient approach to single-query path planning , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[21] Joelle Pineau,et al. Spoken Dialogue Management Using Probabilistic Reasoning , 2000, ACL.

[22] Andrew Stern,et al. Natural Language Understanding in Façade: Surface-Text Processing , 2004, TIDSE.

[23] Norman I. Badler,et al. Real-Time Inverse Kinematics of the Human Arm , 1996, Presence: Teleoperators & Virtual Environments.

[24] Emmanuel Roche,et al. Finite-State Language Processing , 1997 .

[25] Robert Zubek,et al. Character Participation in Social Interaction , 2004 .

[26] Christopher W. Geib,et al. SodaJack: An Architecture For Agents That Search For And Manipulate Objects , 1994 .

[27] Glorianna Davenport,et al. Narrative guidance of interactivity , 1995 .

[28] N. Magnenat-Thalmann,et al. Interaction with 3-D Objects , 2004 .

[29] Curt Bererton,et al. State Estimation for Game AI Using Particle Filters , 2004 .

[30] Daniel Thalmann,et al. PII: S0925-7721(99)00046-2 , 2000 .

[31] Joseph Bates,et al. Guiding interactive drama , 1997 .

[32] Aaron Bryan Loyall,et al. Believable agents: building interactive personalities , 1997 .

[33] Mark H. Overmars,et al. A Comparative Study of Probabilistic Roadmap Planners , 2002, WAFR.

[34] F. Sebastian Grassia,et al. Practical Parameterization of Rotations Using the Exponential Map , 1998, J. Graphics, GPU, & Game Tools.

[35] Michael F. McTear,et al. Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit , 1998, ICSLP.

[36] Kenneth O. Stanley and Bobby D. Bryant and Risto Miikkulainen,et al. The NERO Real-time Video Game , 2004 .

[37] Andreas Stolcke,et al. Finding consensus among words: lattice-based word error minimization , 1999, EUROSPEECH.

[38] Martin Shubik,et al. Games that people play , 1992, Nature.

[39] Michael Lebowitz. Story-telling as planning and learning , 1985 .

[40] David B. Fogel,et al. Evolution, neural networks, games, and intelligence , 1999, Proc. IEEE.

[41] Michael van Lent,et al. An Explainable Artificial Intelligence System for Small-unit Tactical Behavior , 2004, AAAI.

[42] Jean-Claude Latombe,et al. Planning motions with intentions , 1994, SIGGRAPH.

[43] David Blair,et al. Tools for an Interactive Virtual Cinema , 1997, Creating Personalities for Synthetic Actors.

[44] Wucius Wong,et al. Principles of Form and Design , 1993 .

[45] J. Murray. Hamlet on the Holodeck , 1997 .

[46] Richard E. Korf,et al. Real-Time Heuristic Search , 1990, Artif. Intell..

[47] Masayuki Nakajima,et al. Database guided computer animation of human grasping using forward and inverse kinematics , 1999, Comput. Graph..

[48] Ken Perlin,et al. A platform for affective agent research , 2004 .

[49] Herbert A. Simon,et al. The Sciences of the Artificial , 1970 .

[50] Joelle Pineau,et al. A Hierarchical Approach to POMDP Planning and Execution , 2004 .

[51] Justine Cassell,et al. Avatar-augmented online conversation , 2003 .

[52] Brian Magerko,et al. AI Characters and Directors for Interactive Computer Games , 2004, AAAI.

[53] Sebastian Thrun,et al. Decentralized Sensor Fusion with Distributed Particle Filters , 2002, UAI.

[54] John E. Laird,et al. Creating Human-like Synthetic Characters with Multiple Skill Levels: A Case Study using the Soar Quakebot , 2001 .

[55] James H. Martin,et al. Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[56] Mark Collins,et al. AI Techniques for Game Programming , 2002 .

[57] Dan Klein,et al. Accurate Unlexicalized Parsing , 2003, ACL.

[58] E. J.,et al. ON THE COMPLEXITY OF MOTION PLANNING FOR MULTIPLE INDEPENDENT OBJECTS ; PSPACE HARDNESS OF THE " WAREHOUSEMAN ' S PROBLEM " . * * ) , 2022 .

[59] Risto Miikkulainen,et al. Real-time neuroevolution in the NERO video game , 2005, IEEE Transactions on Evolutionary Computation.

[60] Robert McKee,et al. Story: Substance, Structure, Style, and the Principles of Screenwriting , 1997 .

[61] Andrew Stern,et al. A Behavior Language: Joint Action and Behavioral Idioms , 2004, Life-like characters.

[62] Patrick Pérez,et al. Maintaining multimodality through mixture tracking , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[63] Dana S. Nau,et al. New Advances in GraphHTN: Identifying Independent Subproblems in Large HTN Domains , 2000, AIPS.

[64] T. Başar,et al. A New Approach to Linear Filtering and Prediction Problems , 2001 .

[65] Gary Izzo,et al. The Art of Play: The New Genre of Interactive Theatre , 1997 .

[66] A. Mukerjee,et al. Metrics for evaluation of behaviour-based robotic systems , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[67] Johanna D. Moore,et al. Decomposition and Causality in Partial-order Planning , 1994, AIPS.

[68] Mitul Saha,et al. Exact Collision Checking of Robot Paths , 2002, WAFR.

[69] Dilek Z. Hakkani-Tür,et al. A general algorithm for word graph matrix decomposition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[70] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.

[71] S. Field. Screenplay: The Foundations of Screenwriting , 1979 .

[72] Andrew Stern,et al. A Behavior Language for Story-Based Believable Agents , 2002, IEEE Intell. Syst..

[73] Daniel S. Weld,et al. UCPOP: A Sound, Complete, Partial Order Planner for ADL , 1992, KR.

[74] S. LaValle. Rapidly-exploring random trees : a new tool for path planning , 1998 .

[75] Norman I. Badler,et al. Real-time reach planning for animated characters using hardware acceleration , 2003, Proceedings 11th IEEE International Workshop on Program Comprehension.

[76] Michael D. Alder,et al. Introducing MegaHAL , 1998, CoNLL.

[77] Jonathan Schaeffer,et al. Efficiently Searching the 15-Puzzle , 1994 .

[78] Robert Michael Young,et al. An intent-driven planner for multi-agent story generation , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[79] Michael P. Wellman,et al. Probabilistic State-Dependent Grammars for Plan Recognition , 2000, UAI.

[80] Michael L. Mauldin,et al. CHATTERBOTS, TINYMUDS, and the Turing Test: Entering the Loebner Prize Competition , 1994, AAAI.

[81] David W. Aha,et al. Integrating Learning in Interactive Gaming Simulators , 2004 .

[82] R. Michael Young,et al. An architecture for integrating plan-based behavior generation with interactive game environments , 2004, J. Game Dev..

[83] Damian A. Isla,et al. Creature Smarts: The Art and Architecture of a Virtual Brain , 2001 .

[84] Jordan B. Pollack,et al. Coevolution of a Backgammon Player , 1996 .

[85] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[86] Stephen Young. Probabilistic methods in spoken–dialogue systems , 2000, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[87] Ronan Boulic,et al. Task-priority formulations for the kinematic control of highly redundant articulated structures , 1998, Proceedings. 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, Practice and Applications (Cat. No.98CH36190).

[88] Maja J. Mataric,et al. Coordinating mobile robot group behavior using a model of interaction dynamics , 1999, AGENTS '99.

[89] Craig A. Knoblock. Generating Parallel Execution Plans with a Partial-order Planner , 1994, AIPS.

[90] Brenda Kay Laurel. Toward the design of a computer-based interactive fantasy system / , 1986 .

[91] Arnav Jhala,et al. Branching Storylines in Virtual Reality Environments for Leadership Development , 2004, AAAI.

[92] Daniel Thalmann,et al. Planning Collision‐Free Reaching Motions for Interactive Object Manipulation and Grasping , 2003, Comput. Graph. Forum.

[93] James C. Lester,et al. Increasing believability in animated pedagogical agents , 1997, AGENTS '97.

[94] Paul J. Bailey. Searching for Storiness: Story-Generation from a Reader's Perspective , 1999 .

[95] Bruce Blumberg,et al. Multi-level Control for Animated Autonomous Agents: Do the Right Thing...Oh, Not That , 1997, Creating Personalities for Synthetic Actors.

[96] W. Lewis Johnson. Interaction tactics for socially intelligent pedagogical agents , 2003, IUI '03.

[97] Hoon Kim,et al. Monte Carlo Statistical Methods , 2000, Technometrics.

[98] Jaime G. Carbonell,et al. Interactive drama, art and artificial intelligence , 2002 .

[99] Jay Earley,et al. An efficient context-free parsing algorithm , 1970, Commun. ACM.