论文信息 - Policy Generation for Continuous-time Stochastic Domains with Concurrency

Policy Generation for Continuous-time Stochastic Domains with Concurrency

We adopt the framework of Younes, Musliner, & Simmons for planning with concurrency in continuous-time stochastic domains. Our contribution is a set of concrete techniques for policy generation, failure analysis, and repair. These techniques have been implemented in TEMPASTIC, a novel temporal probabilistic planner, and we demonstrate the performance of the planner on two variations of a transportation domain with concurrent actions and exogenous events. TEMPASTIC makes use of a deterministic temporal planner to generate initial policies. Policies are represented using decision trees, and we use incremental decision tree induction to efficiently incorporate changes suggested by the failure analysis.

Håkan L. S. Younes | Reid G. Simmons | R. Simmons

[1] Carlos Guestrin,et al. Multiagent Planning with Factored MDPs , 2001, NIPS.

[2] Håkan L. S. Younes,et al. On the Role of Ground Actions in Refinement Planning , 2002, AIPS.

[3] David J. Musliner,et al. Toward Decision-Theoretic CIRCA with Application to Real-Time Computer Security Control , 2002 .

[4] Håkan L. S. Younes. Extending PDDL to Model Stochastic Decision Processes , 2003 .

[5] Reid G. Simmons,et al. A Theory of Debugging Plans and Interpretations , 1988, AAAI.

[6] David E. Smith,et al. Planning Under Continuous Time and Resource Uncertainty: A Challenge for AI , 2002, AIPS Workshop on Planning for Temporal Domains.

[7] John L. Bresina,et al. Anytime Synthetic Projection: Maximizing the Probability of Goal Satisfaction , 1990, AAAI.

[8] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.

[9] Michael L. Littman,et al. Exact Solutions to Time-Dependent MDPs , 2000, NIPS.

[10] Blai Bonet,et al. A Robust and Fast Action Selection Mechanism for Planning , 1997, AAAI/IAAI.

[11] P. Glynn. A GSMP formalism for discrete event systems , 1989, Proc. IEEE.