Multi-class fruit detection based on image region selection and improved object proposals

Abstract This paper proposes a novel approach for multi-class fruit detection using effective image region selection and improved object proposals. Five complementary features, namely local binary patterns (LBPs), histograms of oriented gradient (HOGs), LBP based on magnitude of Gabor feature (GaborLBP), global color histograms, and global shape features, are utilized to improve the detection accuracy. An optimal combination of regions (i.e., features) is selected using an image region selection method based on feature similarity and cross-validation accuracy. To combine the strength of the five complementary features, a weighted score-level feature fusion approach based on the average confidence coefficient is used. Moreover, during detection, an object proposal method, “EdgeBoxes,” is improved by calibrating scores considering the image region similarity to generate windows that are likely to contain fruits and to speed up detection. The experimental results show that the image region selection method can select an effective and optimal combination of regions, which exhibits better recognition accuracy than the method without image region selection. This proposed method demonstrates a low miss rate (0.0377) at 0.0682 false positives per image (FPPI) and outperforms some baselines: multi-class fruit detection using the traditional sliding window mechanism, the well-known deformable parts model (DPM) method, convolutional neural networks features (CNN) with support vector machine (SVM) for classification (CNN + SVM), cascade detection framework and faster RCNN in terms of the miss rate vs. FPPI and precision vs. recall curves. The proposed multi-class fruit detection can detect multi-class and multiple fruits in a variety of sizes, backgrounds, angles, locations, and image conditions.

[1]  Yudong Zhang,et al.  Fruit classification using computer vision and feedforward neural network , 2014 .

[2]  Luc Van Gool,et al.  The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[3]  Seyed Hadi Mirisaee,et al.  A new method for fruits recognition system , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[4]  Yanwen Chong,et al.  MutualCascade method for pedestrian detection , 2014, Neurocomputing.

[5]  C. Lawrence Zitnick,et al.  Edge Boxes: Locating Object Proposals from Edges , 2014, ECCV.

[6]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[8]  R. N. Shebiah,et al.  Fruit Recognition using Color and Texture Features , 2010 .

[9]  Yudong Zhang,et al.  Classification of Fruits Using Computer Vision and a Multiclass Support Vector Machine , 2012, Sensors.

[10]  Yael Edan,et al.  Computer vision for fruit harvesting robots - state of the art and challenges ahead , 2012, Int. J. Comput. Vis. Robotics.

[11]  Anand Singh Jalal,et al.  Robust Approach for Fruit and Vegetable Classification , 2012 .

[12]  Thomas Deselaers,et al.  Measuring the Objectness of Image Windows , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  L. Yang,et al.  A fruit recognition method for automatic harvesting , 2007, 2007 14th International Conference on Mechatronics and Machine Vision in Practice.

[14]  Syed Khaleel Ahmed,et al.  Classification of fruits using Probabilistic Neural Networks - Improvement using color features , 2011, TENCON 2011 - 2011 IEEE Region 10 Conference.

[15]  Andrew Zisserman,et al.  Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[16]  Vinod Chandran,et al.  Random Gabor based templates for facial expression recognition in images with facial occlusion , 2014, Neurocomputing.

[17]  Stefan Carlsson,et al.  CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Nikolaos Mavridis,et al.  Image-based date fruit classification , 2012, 2012 IV International Congress on Ultra Modern Telecommunications and Control Systems.

[19]  Mei-Chen Yeh,et al.  Fast Human Detection Using a Cascade of Histograms of Oriented Gradients , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  Esa Rahtu,et al.  Generating Object Segmentation Proposals Using Global and Local Search , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2014, Computational Visual Media.

[23]  Manjunath V. Joshi,et al.  Fruit Detection using Improved Multiple Features based Algorithm , 2011 .

[24]  Harsh S Holalad,et al.  An FPGA based Efficient Fruit Recognition System Using Minimum Distance Classifier , 2012 .

[25]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[26]  Weiming Shen,et al.  A new pedestrian detection method based on combined HOG and LSS features , 2015, Neurocomputing.

[27]  Amir Alipasandi,et al.  Classification of three varieties of peach fruit using artificial neural network assisted with image processing techniques. , 2013 .

[28]  Bhanu Pratap,et al.  Development of ANN based Efficient Fruit Recognition Technique , 2014 .

[29]  Cordelia Schmid,et al.  Combining efficient object localization and image classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[30]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Bernt Schiele,et al.  How good are detection proposals, really? , 2014, BMVC.

[32]  Matti Pietikäinen,et al.  Face Recognition with Local Binary Patterns , 2004, ECCV.

[33]  Jianping Yin,et al.  Local binary pattern (LBP) and local phase quantization (LBQ) based on Gabor filter for face representation , 2013, Neurocomputing.

[34]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[36]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[37]  Philip H. S. Torr,et al.  Object Proposal Generation Using Two-Stage Cascade SVMs , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Lijuan Cao,et al.  A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine , 2003, Neurocomputing.

[39]  I. Song,et al.  Working Set Selection Using Second Order Information for Training Svm, " Complexity-reduced Scheme for Feature Extraction with Linear Discriminant Analysis , 2022 .

[40]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[41]  Hong Yan,et al.  An Empirical Study on the Characteristics of Gabor Representations for Face Recognition , 2009, Int. J. Pattern Recognit. Artif. Intell..

[42]  Takashi Kataoka,et al.  Fruit detection system and an end effector for robotic harvesting of Fuji apples , 2010 .

[43]  Yanwen Chong,et al.  Float Cascade Method for Pedestrian Detection , 2012, ICIC.

[44]  Jacques Wainer,et al.  Automatic fruit and vegetable classification from images , 2010 .