TLCS-Anchor: a new anchor strategy for detecting small-scale unmanned aerial vehicle

Faster R-CNN is a general-purpose detection algorithm that performs well in most cases. However, Faster R-CNN performs poorly on detecting small-scale UAVs. In order to improve the detection performance for small-scale UAVs, a new anchor strategy (TLCS-Anchor) which could be adopted by Faster R-CNN is proposed in this paper. Firstly, the anchor templates are designed to be suitable for the UAV dataset by using the clustering method so that the aspect ratios and scales for anchors are more targeted to UAVs. Then, a new compensation strategy of anchors is proposed to help detect small-scale UAVs in this paper, which could not only improve the number of anchors matched with the UAVs, but also alleviate the problem that small-scale UAVs can’t match with enough anchors to some extent. Experimental results show that TLCS-Anchor can help improve the detection performance for UAVs, especially for small-scale UAVs. In theory, TLCS-Anchor can also be used to detect other small-scale targets.

[1]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[2]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Sergio Guadarrama,et al.  Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Leszek Gasieniec,et al.  Proceedings of the eighteenth annual ACM-SIAM symposium on discrete algorithms , 2007, SODA 2007.

[6]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[9]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[10]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.