Alignment by Maximization of Mutual Information

A new information-theoretic approach is presented for finding the pose of an object in an image. The technique does not require information about the surface properties of the object, besides its shape, and is robust with respect to variations of illumination. In our derivation few assumptions are made about the nature of the imaging process. As a result the algorithms are quite general and may foreseeably be used in a wide variety of imaging situations.Experiments are presented that demonstrate the approach registering magnetic resonance (MR) images, aligning a complex 3D object model to real scenes including clutter and occlusion, tracking a human head in a video sequence and aligning a view-based 2D object model to real images.The method is based on a formulation of the mutual information between the model and the image. As applied here the technique is intensity-based, rather than feature-based. It works well in domains where edge or gradient-magnitude based methods have difficulty, yet it is more robust than traditional correlation. Additionally, it has an efficient implementation that is based on stochastic approximation.

[1]  I. Miller Probability, Random Variables, and Stochastic Processes , 1966 .

[2]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[3]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[4]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[5]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[6]  E. Bienenstock,et al.  Theory for the development of neuron selectivity: orientation specificity and binocular interaction in visual cortex , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[7]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[8]  P. Kumar,et al.  Theory and practice of recursive identification , 1985, IEEE Transactions on Automatic Control.

[9]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[10]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[11]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[12]  R. Linsker,et al.  From basic network principles to neural architecture , 1986 .

[13]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[14]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Ralph Linsker,et al.  Self-organization in a perceptual network , 1988, Computer.

[16]  James A. Anderson,et al.  Neurocomputing: Foundations of Research , 1988 .

[17]  Gunilla Borgefors,et al.  Hierarchical Chamfer Matching: A Parametric Edge Matching Algorithm , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[19]  John S. Bridle,et al.  Training Stochastic Model Recognition Algorithms as Networks can Lead to Maximum Mutual Information Estimation of Parameters , 1989, NIPS.

[20]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[21]  Micha Sharir,et al.  The upper envelope of voronoi surfaces and its applications , 1991, SCG '91.

[22]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[23]  Geoffrey E. Hinton,et al.  Learning to Make Coherent Predictions in Domains with Discontinuities , 1991, NIPS.

[24]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[25]  Terrence J. Sejnowski,et al.  Unsupervised Discrimination of Clustered Data via Optimization of Binary Information Gain , 1992, NIPS.

[26]  A. Shashua Geometry and Photometry in 3D Visual Recognition , 1992 .

[27]  Nathan Intrator,et al.  Objective function formulation of the BCM theory of visual cortical plasticity: Statistical connections, stability conditions , 1992, Neural Networks.

[28]  S. Ullman,et al.  Geometry and photometry in three-dimensional visual recognition , 1993 .

[29]  L O Hall,et al.  Review of MR image segmentation techniques using pattern recognition. , 1993, Medical physics.

[30]  R. Gupta,et al.  Polynomial modeling and reduction of RF body coil spatial inhomogeneity in MRI , 1993, IEEE Trans. Medical Imaging.

[31]  William Wells,et al.  Statistical object recognition , 1993 .

[32]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[33]  Colin Studholme,et al.  Voxel similarity measures for automated image registration , 1994, Other Conferences.

[34]  Guy Marchal,et al.  3D Multi-Modality Medical Image Registration Using Feature Space Clustering , 1995, CVRMed.

[35]  J. P. Mellor Realtime Camera Calibration for Enhanced Reality Visualization , 1995, CVRMed.

[36]  Paul A. Viola,et al.  Alignment by Maximization of Mutual Information , 1995, Proceedings of IEEE International Conference on Computer Vision.

[37]  Tomaso A. Poggio,et al.  Model-based matching of line drawings by linear combinations of prototypes , 1995, Proceedings of IEEE International Conference on Computer Vision.

[38]  Paul A. Viola,et al.  Multi-modal volume registration by maximization of mutual information , 1996, Medical Image Anal..

[39]  Tomas Lozano-Perez,et al.  An automatic registration method for frameless stereotaxy, image guided surgery, and enhanced reality visualization , 1996 .

[40]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[41]  Yvan G. Leclerc,et al.  Constructing simple stable descriptions for image partitioning , 1989, International Journal of Computer Vision.

[42]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[43]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .