论文信息 - Scalable Detection of Offensive and Non-compliant Content / Logo in Product Images

Scalable Detection of Offensive and Non-compliant Content / Logo in Product Images

In e-commerce, product content, especially product images have a significant influence on a customer’s journey from product discovery to evaluation and finally, purchase decision. Since many e-commerce retailers sell items from other third-party marketplace sellers besides their own, the content published by both internal and external content creators needs to be monitored and enriched, wherever possible. Despite guidelines and warnings, product listings that contain offensive and non-compliant images continue to enter catalogs. Offensive and non-compliant content can include a wide range of objects, logos, and banners conveying violent, sexually explicit, racist, or promotional messages. Such images can severely damage the customer experience, lead to legal issues, and erode the company brand. In this paper, we present a computer vision driven offensive and non-compliant image detection system for extremely large image datasets. This paper delves into the unique challenges of applying deep learning to real-world product image data from retail world. We demonstrate how we resolve a number of technical challenges such as lack of training data, severe class imbalance, fine-grained class definitions etc. using a number of practical yet unique technical strategies. Our system combines state-of-the-art image classification and object detection techniques with budgeted crowd-sourcing to develop a solution customized for a massive, diverse, and constantly evolving product catalog.

[1] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[2] Rainer Lienhart,et al. Bundle min-hashing for logo recognition , 2013, ICMR '13.

[3] Bjørn Olstad,et al. Classifying offensive sites based on image content , 2004, Comput. Vis. Image Underst..

[4] Olivier Buisson,et al. Logo retrieval with a contrario visual query expansion , 2009, ACM Multimedia.

[5] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Shaogang Gong,et al. Deep Learning Logo Detection with Data Expansion by Synthesising Context , 2016, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[10] Raylene Reese,et al. How users read on the web , 2013 .

[11] Christian Eggert,et al. On the Benefit of Synthetic Data for Company Logo Detection , 2015, ACM Multimedia.

[12] Forrest N. Iandola,et al. DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer , 2015, ArXiv.

[13] Ali Shokoufandeh,et al. Item Popularity Prediction in E-commerce Using Image Quality Feature Vectors , 2016, ArXiv.

[14] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] István Fehérvári,et al. Scalable Logo Recognition Using Proxies , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[16] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Thomas Brox,et al. FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18] Shreyansh Gandhi,et al. A Smart System for Selection of Optimal Product Images in E-Commerce , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[19] Huicheng Zheng,et al. Blocking objectionable images: adult images and harmful symbols , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[20] Yannis Avrithis,et al. Scalable triangulation-based logo recognition , 2011, ICMR.

[21] Yi Li,et al. R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[22] Vijay Vasudevan,et al. Learning Transferable Architectures for Scalable Image Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[25] Alexei A. Efros,et al. Feature Matching , 2017, Encyclopedia of GIS.

[26] Robinson Piramuthu,et al. Is a picture really worth a thousand words?: - on the role of images in e-commerce , 2014, WSDM.

[27] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Rainer Lienhart,et al. Scalable logo recognition in real-world images , 2011, ICMR.

[29] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[30] Matthieu Cord,et al. Pooling in image representation: The visual codeword point of view , 2013, Comput. Vis. Image Underst..

[31] Raimondo Schettini,et al. Deep Learning for Logo Recognition , 2017, Neurocomputing.