论文信息 - Understanding invariance via feedforward inversion of discriminatively trained classifiers

Understanding invariance via feedforward inversion of discriminatively trained classifiers

A discriminatively trained neural net classifier can fit the training data perfectly if all information about its input other than class membership has been discarded prior to the output layer. Surprisingly, past research has discovered that some extraneous visual detail remains in the logit vector. This finding is based on inversion techniques that map deep embeddings back to images. We explore this phenomenon further using a novel synthesis of methods, yielding a feedforward inversion model that produces remarkably high fidelity reconstructions, qualitatively superior to those of past efforts. When applied to an adversarially robust classifier model, the reconstructions contain sufficient local detail and global structure that they might be confused with the original image in a quick glance, and the object category can clearly be gleaned from the reconstruction. Our approach is based on BigGAN (Brock, 2019), with conditioning on logits instead of one-hot class labels. We use our reconstruction model as a tool for exploring the nature of representations, including: the influence of model architecture and training objectives (specifically robust losses), the forms of invariance that networks achieve, representational differences between correctly and incorrectly classified images, and the effects of manipulating logits and images. We believe that our method can inspire future investigations into the nature of information flow in a neural net and can provide diagnostics for improving discriminative models. We provide pre-trained models and visualizations at https://sites.google.com/view/ understanding-invariance/home. Presently at Boston University; work was begun while author was an AI Resident at Google Research Google Research University of Colorado, Boulder. Correspondence to: Piotr Teterwak <piotrt@bu.edu>. Proceedings of the 38 th International Conference on Machine Learning, PMLR 139, 2021. Copyright 2021 by the author(s).

[1] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Pushmeet Kohli,et al. Adversarial Robustness through Local Linearization , 2019, NeurIPS.

[3] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[4] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Ashish Kapoor,et al. Do Adversarially Robust ImageNet Models Transfer Better? , 2020, NeurIPS.

[7] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Takeru Miyato,et al. cGANs with Projection Discriminator , 2018, ICLR.

[9] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[10] Roland Vollgraf,et al. Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms , 2017, ArXiv.

[11] Aleksander Madry,et al. Image Synthesis with a Single (Robust) Classifier , 2019, NeurIPS.

[12] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[13] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[14] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[15] Hugo Larochelle,et al. Modulating early visual processing by language , 2017, NIPS.

[16] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17] Kibok Lee,et al. Augmenting Supervised Neural Networks with Unsupervised Objectives for Large-scale Image Classification , 2016, ICML.

[18] Alexander Kolesnikov,et al. MLP-Mixer: An all-MLP Architecture for Vision , 2021, ArXiv.

[19] Andrew C. Gallagher,et al. von Mises–Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[20] Trevor Darrell,et al. Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[21] Jonathon Shlens,et al. A Learned Representation For Artistic Style , 2016, ICLR.

[22] William T. Freeman,et al. Boundless: Generative Adversarial Networks for Image Extension , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23] Georg Heigold,et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.

[24] Aleksander Madry,et al. Learning Perceptually-Aligned Representations via Adversarial Robustness , 2019, ArXiv.

[25] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[26] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[27] Thomas Brox,et al. Inverting Visual Representations with Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] William T. Freeman,et al. Semantic Pyramid for Image Generation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Andrea Vedaldi,et al. Understanding deep image representations by inverting them , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Bjorn Ommer,et al. Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs , 2020, ECCV.

[31] Jeff Donahue,et al. Large Scale GAN Training for High Fidelity Natural Image Synthesis , 2018, ICLR.

[32] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[33] Christopher K. I. Williams,et al. Inverting Supervised Representations with Autoregressive Neural Density Models , 2018, AISTATS.