AirCapRL: Autonomous Aerial Human Motion Capture Using Deep Reinforcement Learning

In this letter, we introduce a deep reinforcement learning (DRL) based multi-robot formation controller for the task of autonomous aerial human motion capture (MoCap). We focus on vision-based MoCap, where the objective is to estimate the trajectory of body pose, and shape of a single moving person using multiple micro aerial vehicles. State-of-the-art solutions to this problem are based on classical control methods, which depend on hand-crafted system, and observation models. Such models are difficult to derive, and generalize across different systems. Moreover, the non-linearities, and non-convexities of these models lead to sub-optimal controls. In our work, we formulate this problem as a sequential decision making task to achieve the vision-based motion capture objectives, and solve it using a deep neural network-based RL method. We leverage proximal policy optimization (PPO) to train a stochastic decentralized control policy for formation control. The neural network is trained in a parallelized setup in synthetic environments. We performed extensive simulation experiments to validate our approach. Finally, real-robot experiments demonstrate that our policies generalize to real world conditions.

[1]  Jonathan P. How,et al.  Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Alexander Domahidi,et al.  Real-Time Motion Planning for Aerial Videography With Real-Time With Dynamic Obstacle Avoidance and Viewpoint Optimization , 2017, IEEE Robotics and Automation Letters.

[3]  Sebastian Scherer,et al.  Autonomous aerial cinematography in unstructured environments with learned artistic decision‐making , 2019, J. Field Robotics.

[4]  Evangelos A. Theodorou,et al.  Aggressive Perception-Aware Navigation Using Deep Optical Flow Dynamics and PixelMPC , 2020, IEEE Robotics and Automation Letters.

[5]  Jia Pan,et al.  Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios , 2018, ArXiv.

[6]  Sergey Levine,et al.  Learning deep control policies for autonomous aerial vehicles with MPC-guided policy search , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Michael J. Black,et al.  Markerless Outdoor Human Motion Capture Using Multiple Autonomous Micro Aerial Vehicles , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8]  Davide Scaramuzza,et al.  PAMPC: Perception-Aware Model Predictive Control for Quadrotors , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[9]  Heinrich H. Bülthoff,et al.  Active Perception Based Formation Control for Multiple Aerial Vehicles , 2019, IEEE Robotics and Automation Letters.

[10]  Ming C. Lin,et al.  Shape-Aware Human Pose and Shape Reconstruction Using Multi-View Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  H. Jin Kim,et al.  Online Trajectory Generation of a MAV for Chasing a Moving Target in 3D Dense Environments , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Qionghai Dai,et al.  FlyCap: Markerless Motion Capture Using Multiple Autonomous Flying Cameras , 2016, IEEE Transactions on Visualization and Computer Graphics.

[13]  Javier Alonso-Mora,et al.  Flycon: real-time environment-independent multi-view human pose estimation with aerial vehicles , 2019, ACM Trans. Graph..

[14]  Jia Pan,et al.  Distributed multi-robot collision avoidance via deep reinforcement learning for navigation in complex scenarios , 2020, Int. J. Robotics Res..

[15]  Heinrich H. Bülthoff,et al.  Deep Neural Network-Based Cooperative Visual Tracking Through Multiple Micro Aerial Vehicles , 2018, IEEE Robotics and Automation Letters.

[16]  Kamalakar Karlapalem,et al.  Motion Planning for Multi-Mobile-Manipulator Payload Transport Systems , 2019, 2019 IEEE 15th International Conference on Automation Science and Engineering (CASE).

[17]  Michael J. Black,et al.  Learning to Reconstruct 3D Human Pose and Shape via Model-Fitting in the Loop , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Simon J. D. Prince,et al.  Computer Vision: Models, Learning, and Inference , 2012 .

[20]  Taeyoung Lee,et al.  Geometric tracking control of a quadrotor UAV on SE(3) , 2010, 49th IEEE Conference on Decision and Control (CDC).