论文信息 - Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment

Rank-Smoothed Pairwise Learning In Perceptual Quality Assessment

Conducting pairwise comparisons is a widely used approach in curating human perceptual preference data. Typically raters are instructed to make their choices according to a specific set of rules that address certain dimensions of image quality and aesthetics. The outcome of this process is a dataset of sampled image pairs with their associated empirical preference probabilities. Training a model on these pairwise preferences is a common deep learning approach. However, optimizing by gradient descent through mini-batch learning means that the “global” ranking of the images is not explicitly taken into account. In other words, each step of the gradient descent relies only on a limited number of pairwise comparisons. In this work, we demonstrate that regularizing the pairwise empirical probabilities with aggregated rankwise probabilities leads to a more reliable training loss. We show that training a deep image quality assessment model with our rank-smoothed loss consistently improves the accuracy of predicting human preferences.

[1] Naila Murray,et al. AVA: A large-scale database for aesthetic visual analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Peyman Milanfar,et al. NIMA: Neural Image Assessment , 2017, IEEE Transactions on Image Processing.

[4] Gregory N. Hullender,et al. Learning to rank using gradient descent , 2005, ICML.

[5] Nikolay N. Ponomarenko,et al. Image database TID2013: Peculiarities, results and perspectives , 2015, Signal Process. Image Commun..

[6] D. Amnon Silverstein,et al. Efficient method for paired comparison , 2001, J. Electronic Imaging.

[7] Chih-Jen Lin,et al. Ranking individuals by group comparisons , 2006, ICML.

[8] O. Dykstra. Rank Analysis of Incomplete Block Designs: A Method of Paired Comparisons Employing Unequal Repetitions on Pairs , 1960 .

[9] Devavrat Shah,et al. Iterative ranking from pair-wise comparisons , 2012, NIPS.

[10] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Zhou Wang,et al. dipIQ: Blind Image Quality Assessment by Learning-to-Rank Discriminable Image Pairs , 2017, IEEE Transactions on Image Processing.

[12] Alan C. Bovik,et al. Massive Online Crowdsourced Study of Subjective and Objective Picture Quality , 2015, IEEE Transactions on Image Processing.

[13] Rafal Mantiuk,et al. Comparison of Four Subjective Methods for Image Quality Assessment , 2012, Comput. Graph. Forum.

[14] Nikolay N. Ponomarenko,et al. TID2008 – A database for evaluation of full-reference visual quality assessment metrics , 2004 .

[15] Peyman Milanfar,et al. Learned perceptual image enhancement , 2017, 2018 IEEE International Conference on Computational Photography (ICCP).

[16] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[17] Lei Zhang,et al. Deep Convolutional Neural Models for Picture-Quality Prediction: Challenges and Solutions to Data-Driven Image Quality Assessment , 2017, IEEE Signal Processing Magazine.