论文信息 - Conditional Neural Style Transfer with Peer-Regularized Feature Transform

Conditional Neural Style Transfer with Peer-Regularized Feature Transform

This paper introduces a neural style transfer model to conditionally generate a stylized image using only a set of examples describing the desired style. The proposed solution produces high-quality images even in the zero-shot setting and allows for greater freedom in changing the content geometry. This is thanks to the introduction of a novel Peer-Regularization Layer that recomposes style in latent space by means of a custom graph convolutional layer aiming at separating style and content. Contrary to the vast majority of existing solutions our model does not require any pre-trained network for computing perceptual losses and can be trained fully end-to-end with a new set of cyclic losses that operate directly in latent space. An extensive ablation study confirms the usefulness of the proposed losses and of the Peer-Regularization Layer, with qualitative results that are competitive with respect to the current state-of-the-art even in the challenging zero-shot setting. This opens the door to more abstract and artistic neural image generation scenarios and easier deployment of the model in. production

[1] Björn Ommer,et al. A Style-Aware Content Loss for Real-time HD Style Transfer , 2018, ECCV.

[2] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[3] Sylvain Lefebvre,et al. State of the Art in Example-based Texture Synthesis , 2009, Eurographics.

[4] Xiaogang Wang,et al. Avatar-Net: Multi-scale Zero-Shot Style Transfer by Feature Decoration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[5] Nenghai Yu,et al. StyleBank: An Explicit Representation for Neural Image Style Transfer , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Jürgen Schmidhuber,et al. Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction , 2011, ICANN.

[8] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[9] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[10] Sung Yong Shin,et al. On pixel-based texture synthesis by non-parametric sampling , 2006, Comput. Graph..

[11] Ming-Hsuan Yang,et al. Diversified Texture Synthesis with Feed-Forward Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Leon A. Gatys,et al. A Neural Algorithm of Artistic Style , 2015, ArXiv.

[13] Andrea Vedaldi,et al. Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Matthias Bethge,et al. ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness , 2018, ICLR.

[15] Jonathon Shlens,et al. A Learned Representation For Artistic Style , 2016, ICLR.

[16] Mark W. Schmidt,et al. Fast Patch-based Style Transfer of Arbitrary Style , 2016, ArXiv.

[17] Zunlei Feng,et al. Neural Style Transfer: A Review , 2017, IEEE Transactions on Visualization and Computer Graphics.

[18] Yann LeCun,et al. Stacked What-Where Auto-encoders , 2015, ArXiv.

[19] Yu-Bin Yang,et al. Image Restoration Using Convolutional Auto-encoders with Symmetric Skip Connections , 2016, ArXiv.

[20] Luc Van Gool,et al. ComboGAN: Unrestrained Scalability for Image Domain Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21] Karen Simonyan,et al. Hierarchical Autoregressive Image Models with Auxiliary Decoders , 2019, ArXiv.

[22] Andrea Vedaldi,et al. Texture Networks: Feed-forward Synthesis of Textures and Stylized Images , 2016, ICML.

[23] Alexia Jolicoeur-Martineau,et al. The relativistic discriminator: a key element missing from standard GAN , 2018, ICLR.

[24] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[25] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[26] Leonidas J. Guibas,et al. PeerNets: Exploiting Peer Wisdom Against Adversarial Attacks , 2018, ICLR.

[27] Ming-Hsuan Yang,et al. Universal Style Transfer via Feature Transforms , 2017, NIPS.

[28] Serge J. Belongie,et al. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Aaron C. Courville,et al. FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[30] Jiaying Liu,et al. Demystifying Neural Style Transfer , 2017, IJCAI.

[31] Eric P. Xing,et al. ZM-Net: Real-time Zero-shot Image Manipulation Network , 2017, ArXiv.

[32] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[33] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[34] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[35] Chuan Li,et al. Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[37] Hang Zhang,et al. Multi-style Generative Network for Real-time Transfer , 2017, ECCV Workshops.

[38] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.