Towards an Example-Based Image Compression Architecture for Video-Conferencing

This paper consists of two major parts. First, we present the outline of a simple approach to very-low bandwidth video-conferencing system relying on an example-based hierarchical image compression scheme. In particular, we discuss the use of example images as a model, the number of required examples, faces as a class of semi-rigid objects, a hierarchical model based on decomposition into different time-scales, and the decomposition of face images into patches of interest. In the second part, we present several algorithms for image processing and animation as well as experimental evaluations. Among the original contributions of this paper is an automatic algorithm for pose estimation and normalization. We also review and compare different algorithms for finding the nearest neighbors in a database for a new input as well as a generalized algorithm for blending patches of interest in order to synthesize new images. Finally, we outline the possible integration of several algorithms to illustrate a simple model-based video-conference system.

[1]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[2]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[3]  佐藤 孝紀,et al.  A Hierarchical Data Structure for Picture Processing , 1976 .

[4]  Sebastian Tölg Strukturuntersuchungen zur Informationsverarbeitung in neuronaler Architektur am Beispiel der Modellierung von Augenbewegungen für aktives Sehen , 1992 .

[5]  George Wolberg,et al.  Digital image warping , 1990 .

[6]  A. Verri,et al.  Constraints for the computation of optical flow , 1989, [1989] Proceedings. Workshop on Visual Motion.

[7]  Vicki Bruce,et al.  Processing Images of Faces , 1992 .

[8]  David Beymer,et al.  Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[9]  T. Poggio A theory of how the brain might work. , 1990, Cold Spring Harbor symposia on quantitative biology.

[10]  Tomaso Poggio,et al.  Example Based Image Analysis and Synthesis , 1993 .

[11]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[12]  Yasuhito Suenaga,et al.  Automatic Extraction of Target Images for Face Identification Using the Sub-Space Classification Method (Special Section on Machine Vision Applications) , 1993 .

[13]  Richard M. Stern,et al.  Fast Computation of the Difference of Low-Pass Transform , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Tomaso Poggio,et al.  A Novel Approach to Graphics , 1992 .

[15]  Hiroshi Harashima,et al.  Model-based/waveform hybrid coding for videotelephone images , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[16]  Edward H. Adelson,et al.  Probability distributions of optical flow , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[18]  Cleve Moler,et al.  Mathematical Handbook for Scientists and Engineers , 1961 .

[19]  Rama Chellappa,et al.  A feature based approach to face recognition , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Larry S. Davis,et al.  Computing spatio-temporal representations of human faces , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[22]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23]  R L Hight Lip-reader trainer: a computer program for the hearing-impaired. , 1983, Medical electronics.

[24]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[25]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[26]  Kenji Mase,et al.  Recognition of Facial Expression from Optical Flow , 1991 .

[27]  Yasuhito Suenaga,et al.  An accurate and robust face identification scheme , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[28]  Dana H. Ballard,et al.  Computer Vision , 1982 .

[29]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[30]  Garrison W. Cottrell,et al.  EMPATH: Face, Emotion, and Gender Recognition Using Holons , 1990, NIPS.

[31]  J. M. Gilbert,et al.  A real-time face recognition system using custom VLSI hardware , 1993, 1993 Computer Architectures for Machine Perception.

[32]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[33]  Sebastian Toelg Gaze control for an active camera system by modeling human pursuit eye movements , 1992, Other Conferences.

[34]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[35]  Amnon Shashua,et al.  The Quadric Reference Surface: Applications in Registering Views of Complex 3D Objects , 1994, ECCV.

[36]  H. Harashima,et al.  Analysis and synthesis of facial expressions in knowledge-based coding of facial image sequences , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[37]  Amnon Shashua,et al.  Algebraic Functions For Recognition , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Edward H. Adelson,et al.  Merging Images Through Pattern Decomposition , 1985, Optics & Photonics.

[39]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Claude L. Fennema,et al.  Velocity determination in scenes containing several moving objects , 1979 .

[41]  Joseph K. Kearney,et al.  Optical Flow Estimation: An Error Analysis of Gradient-Based Methods with Local Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Azriel Rosenfeld,et al.  Multiresolution image processing and analysis , 1984 .

[43]  Amnon Shashua,et al.  Trilinearity in Visual Recognition by Alignment , 1994, ECCV.

[44]  Takeo Kanade,et al.  Picture Processing System by Computer Complex and Recognition of Human Faces , 1974 .

[45]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[46]  Roberto Brunelli,et al.  Face Recognition: Features Versus Templates , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..

[49]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[50]  Yasuhito Suenaga,et al.  Robust face identification scheme: KL expansion of an invariant feature space , 1992, Other Conferences.

[51]  Robert J. Baron,et al.  Mechanisms of Human Facial Recognition , 1981, Int. J. Man Mach. Stud..