VisDrone-MOT2019: The Vision Meets Drone Multiple Object Tracking Challenge Results

The Vision Meets Drone Multiple Object Tracking (MOT) Challenge 2019 is the second annual activity focusing on evaluating multi-object tracking algorithms on drones, held in conjunction with the 17-th International Conference on Computer Vision (ICCV 2019). Results of 12 submitted MOT algorithms on the collected drone-based dataset are presented. Meanwhile, we also report the results of 6 state-of-the-art MOT algorithms, and provide a comprehensive analysis and discussion of the results. The results of all submissions are publicly available at the website: http://www.aiskyeye.com/. The challenge results show that MOT on drones is far from being solved. We believe the challenge can largely boost the research and development in MOT on drone platforms.

Yong Wang | Martin Lauer | Dong Wang | Qingming Huang | Guna Seetharaman | Xinyu Zhang | Xin Chen | Guorong Li | Hailin Shi | Long Chen | Noor M. Al-Shakarji | Kannappan Palaniappan | Brejesh Lall | Vinay Kaushik | Haibin Ling | Longyin Wen | Dawei Du | Yanting Zhang | Robert Laganiere | Mikael Nilsson | Filiz Bunyak | Qinghua Hu | Yanyun Zhao | Huchuan Lu | Chang Liu | Liefeng Bo | Zhaotang Chen | Lu Ding | Haotian Zhang | Yue Zhang | Rui Zhu | Guizhong Liu | Zhipeng Luo | Feng Ni | Chunhui Zhang | Pengfei Zhu | Shuhao Chen | Jenq-Neng Hwang | Tao Peng | Jiayu Zheng | Håkan Ardö | Jiatong Mu | Yuehan Yao | Wei Tian | Siyang Pan | Gaoang Wang | Zhihang Tong | Prerana Mukherjee | Weiqiang Li | Xiao Bian | Wei Shi | Zhuojin Sun | Xinyao Wang | Jinrong Hu | Yuduo Song | Ajit Jadhav | Bing Dong | Hongyang Yu | Zhenyu Xu | Zhibin Xiao | Longyin Wen | Liefeng Bo | Haibin Ling | R. Laganière | Huchuan Lu | Jenq-Neng Hwang | Chang Liu | Q. Hu | Pengfei Zhu | Guizhong Liu | Dawei Du | F. Bunyak | G. Seetharaman | K. Palaniappan | Qingming Huang | Xiao Bian | Shuhao Chen | Brejesh Lall | Hailin Shi | Feng Ni | Lu Ding | Yong Wang | Zhipeng Luo | Dong Wang | Gaoang Wang | Wei Tian | Yue Zhang | Rui Zhu | Tao Peng | Jiayu Zheng | Xin Chen | Xinyao Wang | Xinyu Zhang | Yanyun Zhao | Zhihang Tong | H. Ardö | M. Nilsson | Jinrong Hu | Prerana Mukherjee | Guorong Li | Bin Dong | Siyang Pan | Yuehan Yao | Zhenyu Xu | Chunhui Zhang | Haotian Zhang | Yuduo Song | Hongyang Yu | Zhibin Xiao | W. Shi | Yanting Zhang | Ajit Jadhav | V. Kaushik | Zhuojin Sun | Zhaotang Chen | Longwei Chen | M. Lauer | Weiqiang Li | Jiatong Mu

[1]  Pascal Fua,et al.  Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Wongun Choi,et al.  Deep Network Flow for Multi-object Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Hua Yang,et al.  Online Multi-Object Tracking with Dual Matching Attention Networks , 2018, ECCV.

[4]  Wenhan Luo,et al.  Multiple Object Tracking: A Review , 2014, ArXiv.

[5]  Jiwen Lu,et al.  Spatial-Temporal Attention-Aware Learning for Video-Based Person Re-Identification , 2019, IEEE Transactions on Image Processing.

[6]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[7]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Xiaogang Wang,et al.  Diversity Regularized Spatiotemporal Attention for Video-Based Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[10]  Jenq-Neng Hwang,et al.  Multi-View Vehicle Re-Identification using Temporal Attention Model and Metadata Re-ranking , 2019, CVPR Workshops.

[11]  Ramakant Nevatia,et al.  Revisiting Temporal Modeling for Video-based Person ReID , 2018, ArXiv.

[12]  Mario Sznaier,et al.  The Way They Move: Tracking Multiple Targets with Similar Appearance , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  James M. Rehg,et al.  Multi-object Tracking with Neural Gating Using Bilinear LSTM , 2018, ECCV.

[14]  Guna Seetharaman,et al.  Robust multi-object tracking with semantic color correlation , 2017, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[15]  Xingyi Zhou,et al.  Objects as Points , 2019, ArXiv.

[16]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Silvio Savarese,et al.  Learning Social Etiquette: Human Trajectory Understanding In Crowded Scenes , 2016, ECCV.

[18]  Junjie Yan,et al.  Multiple Target Tracking Based on Undirected Hierarchical Relation Hypergraph , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Stefan Roth,et al.  MOT16: A Benchmark for Multi-Object Tracking , 2016, ArXiv.

[20]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21]  Ming-Hsuan Yang,et al.  UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking , 2015, Comput. Vis. Image Underst..

[22]  Martin Lauer,et al.  3D Traffic Scene Understanding From Movable Platforms , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Raquel Urtasun,et al.  End-to-end Learning of Multi-sensor 3D Tracking by Detection , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Jian Sun,et al.  AlignedReID: Surpassing Human-Level Performance in Person Re-Identification , 2017, ArXiv.

[25]  Konrad Schindler,et al.  Continuous Energy Minimization for Multitarget Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Dimitri P. Bertsekas,et al.  Auction algorithms for network flow problems: A tutorial introduction , 1992, Comput. Optim. Appl..

[27]  Kuk-Jin Yoon,et al.  Robust Online Multi-object Tracking Based on Tracklet Confidence and Online Discriminative Appearance Learning , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Alex Bewley,et al.  Deep Cosine Metric Learning for Person Re-identification , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[29]  Thomas Brox,et al.  FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Noor M. Al-Shakarji,et al.  UA-DETRAC 2017 : Report of AVSS 2017 & IT 4 S Challenge on Advance Traffic Monitoring , 2017 .

[31]  Thomas Brox,et al.  Motion Segmentation & Multiple Object Tracking by Correlation Co-Clustering , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Siwei Lyu,et al.  Learning Non-Uniform Hypergraph for Multi-Object Tracking , 2018, AAAI.

[33]  Haibin Ling,et al.  FAMNet: Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[34]  Mubarak Shah,et al.  Deep Affinity Network for Multiple Object Tracking , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Qi Tian,et al.  Beyond Part Models: Person Retrieval with Refined Part Pooling , 2017, ECCV.

[36]  Yiannis Kompatsiaris,et al.  VisDrone-VDT2018: The Vision Meets Drone Video Detection and Tracking Challenge Results , 2018, ECCV Workshops.

[37]  Dietrich Paulus,et al.  Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[38]  Silvio Savarese,et al.  Social LSTM: Human Trajectory Prediction in Crowded Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Long Chen,et al.  Real-Time Multiple People Tracking with Deeply Learned Candidate Selection and Person Re-Identification , 2018, 2018 IEEE International Conference on Multimedia and Expo (ICME).

[40]  Stephen Lin,et al.  GCNet: Non-Local Networks Meet Squeeze-Excitation Networks and Beyond , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[41]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Guna Seetharaman,et al.  Multi-object Tracking Cascade with Multi-Step Data Association and Occlusion Handling , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[43]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[44]  Xiaogang Wang,et al.  DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Winston H. Hsu,et al.  Drone-Based Object Counting by Spatially Regularized Regional Proposal Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[46]  Martin Lauer,et al.  Online Multi-Object Tracking Using Joint Domain Information in Traffic Scenarios , 2020, IEEE Transactions on Intelligent Transportation Systems.

[47]  Gang Wang,et al.  Dual Attention Matching Network for Context-Aware Feature Sequence Based Person Re-identification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[48]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Qinghua Hu,et al.  Vision Meets Drones: A Challenge , 2018, ArXiv.

[50]  Håkan Ardö,et al.  Multi Target Tracking by Learning from Generalized Graph Differences , 2019, ArXiv.

[51]  Qi Tian,et al.  The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking , 2018, ECCV.

[52]  Rui Caseiro,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence High-speed Tracking with Kernelized Correlation Filters , 2022 .

[53]  Qi Tian,et al.  Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[54]  Jenq-Neng Hwang,et al.  Exploit the Connectivity: Multi-Object Tracking with TrackletNet , 2018, ACM Multimedia.

[55]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[56]  Bernard Ghanem,et al.  A Benchmark and Simulator for UAV Tracking , 2016, ECCV.

[57]  Gaoang Wang,et al.  Eye in the Sky: Drone-Based Object Tracking and 3D Localization , 2019, ACM Multimedia.

[58]  Wei Wu,et al.  Distractor-aware Siamese Networks for Visual Object Tracking , 2018, ECCV.

[59]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[61]  Yang Yang,et al.  Unsupervised Learning of Multi-Level Descriptors for Person Re-Identification , 2017, AAAI.

[62]  Charless C. Fowlkes,et al.  Globally-optimal greedy algorithms for tracking a variable number of objects , 2011, CVPR 2011.

[63]  Jan Kautz,et al.  PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[64]  Stefan Roth,et al.  MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking , 2015, ArXiv.

[65]  Kiyoung Moon,et al.  UA-DETRAC 2018: Report of AVSS2018 & IWT4S Challenge on Advanced Traffic Monitoring , 2018, 2018 15th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).