论文信息 - Separating Style and Content

Separating Style and Content

We seek to analyze and manipulate two factors, which we call style and content, underlying a set of observations. We fit training data with bilinear models which explicitly represent the two-factor structure. These models can adapt easily during testing to new styles or content, allowing us to solve three general tasks: extrapolation of a new style to unobserved content; classification of content observed in a new style; and translation of new content observed in a new style. For classification, we embed bilinear models in a probabilistic framework, Separable Mixture Models (SMMs), which generalizes earlier work on factorial mixture models [7, 3]. Significant performance improvement on a benchmark speech dataset shows the benefits of our approach.

Joshua B. Tenenbaum | William T. Freeman | J. Tenenbaum | W. Freeman

[1] J. Magnus,et al. Matrix Differential Calculus with Applications in Statistics and Econometrics (Revised Edition) , 1999 .

[2] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[3] J. Magnus,et al. Matrix Differential Calculus with Applications in Statistics and Econometrics , 1991 .

[4] David G. Stork,et al. Connectionist generalization for production: An example from GridFont , 1992, Neural Networks.

[5] B A Wandell,et al. Linear models of surface and illuminant spectra. , 1992, Journal of the Optical Society of America. A, Optics and image science.

[6] Geoffrey E. Hinton,et al. Autoencoders, Minimum Description Length and Helmholtz Free Energy , 1993, NIPS.

[7] Rich Caruana,et al. Learning Many Related Tasks at the Same Time with Backpropagation , 1994, NIPS.

[8] Peter W. Hallinan. A low-dimensional representation of human faces for arbitrary lighting conditions , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[9] Zoubin Ghahramani,et al. Factorial Learning and the EM Algorithm , 1994, NIPS.

[10] Stephen M. Omohundro. Family Discovery , 1995, NIPS.

[11] Douglas R. Hofstadter,et al. Fluid Concepts and Creative Analogies , 1995 .

[12] Robert Tibshirani,et al. Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[13] Joshua B. Tenenbaum,et al. Learning bilinear models for two-factor problems in vision , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.