论文信息 - Improved generic categorical object detection fusing depth cue with 2D appearance and shape features

Improved generic categorical object detection fusing depth cue with 2D appearance and shape features

We propose a novel 3D depth cue-based generic categorical object detection model, which extends our previous 2D feature-based object detection method for object detection with severe occlusions. Since the novel model integrates complementary 3D depth cue with 2D appearance and shape features, it significantly improves the detection performance and robustness of the current 2D-based object detection system. The depth cue, derived from the disparity map, is obtained via stereo matching of input image pairs. Disparity map is clustered to different layers, then appearance and shape features are extracted at each layer and matched with the learnt 2D codebooks. Finally, detection hypotheses at all layers are merged to generate the final detection result. Experimental results show that the novel 3D depth cue-based model achieves a 2.57% gain of the average recall rate over the 2D feature-based method on our collected stereo car-side dataset.

Si-Yu Xia | A. Kai Qin | Hong Pan | Yaping Zhu

[1] Frédéric Jurie,et al. Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Luc Van Gool,et al. Object Detection and Tracking for Autonomous Navigation in Dynamic Environments , 2010, Int. J. Robotics Res..

[3] Andrea Fusiello,et al. Quasi-Euclidean epipolar rectification of uncalibrated images , 2010, Machine Vision and Applications.

[4] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[5] Truong Q. Nguyen,et al. Combining generic and class-specific codebooks for object categorization and detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6] Timo Kohlberger,et al. A Multigrid Platform for Real-Time Motion Computation with Discontinuity-Preserving Variational Methods , 2006, International Journal of Computer Vision.

[7] Joachim Denzler,et al. Combining Appearance and Range Based Information for Multi-class Generic Object Recognition , 2009, CIARP.

[8] B. Schiele,et al. Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .