论文信息 - Robot trajectory optimization using approximate inference

Robot trajectory optimization using approximate inference

The general stochastic optimal control (SOC) problem in robotics scenarios is often too complex to be solved exactly and in near real time. A classical approximate solution is to first compute an optimal (deterministic) trajectory and then solve a local linear-quadratic-gaussian (LQG) perturbation model to handle the system stochasticity. We present a new algorithm for this approach which improves upon previous algorithms like iLQG. We consider a probabilistic model for which the maximum likelihood (ML) trajectory coincides with the optimal trajectory and which, in the LQG case, reproduces the classical SOC solution. The algorithm then utilizes approximate inference methods (similar to expectation propagation) that efficiently generalize to non-LQG systems. We demonstrate the algorithm on a simulated 39-DoF humanoid robot.

Marc Toussaint | Marc Toussaint

[1] Arthur E. Bryson,et al. Applied Optimal Control , 1969 .

[2] Bonaventure Intercontinental,et al. ON DECISION AND CONTROL , 1985 .

[3] R. Stengel. Stochastic Optimal Control: Theory and Application , 1986 .

[4] Ross D. Shachter. Probabilistic Inference and Influence Diagrams , 1988, Oper. Res..

[5] M. K rn,et al. Stochastic Optimal Control , 1988 .

[6] Gregory F. Cooper,et al. A Method for Using Belief Networks as Influence Diagrams , 2013, UAI 1988.

[7] Yao-Chon Chen. Solving robot trajectory planning problems with uniform cubic B‐splines , 1991 .

[8] Ross D. Shachter,et al. Decision Making Using Probabilistic Inference Methods , 1992, UAI.

[9] Jianwei Zhang,et al. An Enhanced Optimization Approach for Generating Smooth Robot Trajectories in the Presence of Obstacles , 1995 .

[10] Geoffrey E. Hinton,et al. Using Expectation-Maximization for Reinforcement Learning , 1997, Neural Computation.

[11] Maximilian Schlemmer,et al. Real-Time Collision- Free Trajectory Optimization of Robot Manipulators via Semi-Infinite Parameter Optimization , 1998, Int. J. Robotics Res..