论文信息 - AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

In this letter, we introduce a deep reinforcement learning (DRL) based multi-robot formation controller for the task of autonomous aerial human motion capture (MoCap). We focus on vision-based MoCap, where the objective is to estimate the trajectory of body pose, and shape of a single moving person using multiple micro aerial vehicles. State-of-the-art solutions to this problem are based on classical control methods, which depend on hand-crafted system, and observation models. Such models are difficult to derive, and generalize across different systems. Moreover, the non-linearities, and non-convexities of these models lead to sub-optimal controls. In our work, we formulate this problem as a sequential decision making task to achieve the vision-based motion capture objectives, and solve it using a deep neural network-based RL method. We leverage proximal policy optimization (PPO) to train a stochastic decentralized control policy for formation control. The neural network is trained in a parallelized setup in synthetic environments. We performed extensive simulation experiments to validate our approach. Finally, real-robot experiments demonstrate that our policies generalize to real world conditions.

[1] Jonathan P. How,et al. Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2] Alexander Domahidi,et al. Real-Time Motion Planning for Aerial Videography With Real-Time With Dynamic Obstacle Avoidance and Viewpoint Optimization , 2017, IEEE Robotics and Automation Letters.

[3] Sebastian Scherer,et al. Autonomous aerial cinematography in unstructured environments with learned artistic decision‐making , 2019, J. Field Robotics.

[4] Evangelos A. Theodorou,et al. Aggressive Perception-Aware Navigation Using Deep Optical Flow Dynamics and PixelMPC , 2020, IEEE Robotics and Automation Letters.

[5] Jia Pan,et al. Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios , 2018, ArXiv.

[6] Sergey Levine,et al. Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[7] Michael J. Black,et al. Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8] Davide Scaramuzza,et al. PAMPC: Perception-Aware Model Predictive Control for Quadrotors , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9] Heinrich H. Bülthoff,et al. Active Perception Based Formation Control for Multiple Aerial Vehicles , 2019, IEEE Robotics and Automation Letters.

[10] Ming C. Lin,et al. Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11] H. Jin Kim,et al. Online Trajectory Generation of a MAV for Chasing a Moving Target in 3D Dense Environments , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12] Qionghai Dai,et al. FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras , 2016, IEEE Transactions on Visualization and Computer Graphics.

[13] Javier Alonso-Mora,et al. Flycon: real-time environment-independent multi-view human pose estimation with aerial vehicles , 2019, ACM Trans. Graph..

[14] Jia Pan,et al. Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios , 2020, Int. J. Robotics Res..

[15] Heinrich H. Bülthoff,et al. Deep Neural Network-Based Cooperative Visual Tracking Through Multiple Micro Aerial Vehicles , 2018, IEEE Robotics and Automation Letters.

[16] Kamalakar Karlapalem,et al. Motion Planning for Multi-Mobile-Manipulator Payload Transport Systems , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[17] Michael J. Black,et al. Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18] Yaser Sheikh,et al. OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Simon J. D. Prince,et al. Computer Vision: Models, Learning, and Inference , 2012 .

[20] Taeyoung Lee,et al. Geometric tracking control of a quadrotor UAV on SE(3) , 2010, 49th IEEE Conference on Decision and Control (CDC).