nerative Models for andwritten Digit Recognition

We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B- splines with Gaussian "ink generators" spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. 1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. 2) During the process of explaining the image, generative models can perform recognition driven segmentation. 3) The method involves a relatively small number of parameters and hence training is relatively easy and fast. 4) Unlike many other recognition schemes, it does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques. Index Terms-Deformable model, elastic net, optical character recognition, generative model, probabilistic model, mixture model

[1]  Bernard Widrow,et al.  The "rubber-mask" technique - I. Pattern measurement and analysis , 1973, Pattern Recognit..

[2]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[3]  J. Hale Machine Perception of Patterns and Pictures , 1973 .

[4]  David J. Burr,et al.  Elastic Matching of Line Drawings , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  D. J. Burr,et al.  Matching Elastic Templates , 1983 .

[6]  Malayappan Shridhar,et al.  Recognition of isolated and simply connected handwritten numerals , 1986, Pattern Recognition.

[7]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[8]  Richard Durbin,et al.  An analogue approach to the travelling salesman problem using an elastic net method , 1987, Nature.

[9]  Mehdi Hatamian,et al.  Optical character recognition by the method of moments , 1987 .

[10]  Ching Y. Suen,et al.  Structural classification and relaxation matching of totally unconstrained handwritten zip-code numbers , 1988, Pattern Recognit..

[11]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[12]  Ruzena Bajcsy,et al.  Multiresolution elastic matching , 1989, Comput. Vis. Graph. Image Process..

[13]  Richard Szeliski,et al.  An Analysis of the Elastic Net Approach to the Traveling Salesman Problem , 1989, Neural Computation.

[14]  Alexander H. Waibel,et al.  A novel objective function for improved phoneme recognition using time delay neural networks , 1990, International 1989 Joint Conference on Neural Networks.

[15]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[16]  M. Varga,et al.  Dynamic elastic image stretching technique applied to thermographic images , 1990 .

[17]  James D. Keeler,et al.  Integrated Segmentation and Recognition of Hand-Printed Numerals , 1990, NIPS.

[18]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[19]  Geoffrey E. Hinton,et al.  Adaptive Elastic Models for Hand-Printed Character Recognition , 1991, NIPS.

[20]  Mehran Moshfeghi,et al.  Elastic matching of multimodality medical images , 1991, CVGIP Graph. Model. Image Process..

[21]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[22]  Fumitaka Kimura,et al.  Handwritten numerical recognition based on multiple algorithms , 1991, Pattern Recognit..

[23]  Ching Y. Suen,et al.  Computer recognition of unconstrained handwritten numerals , 1992, Proc. IEEE.

[24]  Y. Le Cun,et al.  Shortest path segmentation: a method for training a neural network to recognize character strings , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[25]  Yann LeCun,et al.  Efficient Pattern Recognition Using a New Transformation Distance , 1992, NIPS.

[26]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[27]  John S. Denker,et al.  Improving Rejection Performance on Handwritten Digits by Training with Rubbish , 1993, Neural Computation.

[28]  Patrick Shen-Pei Wang,et al.  An Integrated Architecture for Recognition of Totally Unconstrained Handwritten Numerals , 1993, Int. J. Pattern Recognit. Artif. Intell..

[29]  Jean-Michel Bertille An elastic matching approach applied to digit recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[30]  Timothy F. Cootes,et al.  A Generic System For Classifying Variable Objects Using Flexible Template Matching , 1993, BMVC.

[31]  Geoffrey E. Hinton,et al.  Combining deformable models and neural networks for handprinted digit recognition , 1994 .

[32]  Geoffrey E. Hinton,et al.  To appear in : Advances in Neural Information Processing Systems , 2007 .

[33]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Geoffrey E. Hinton,et al.  Recognizing Handwritten Digits Using Mixtures of Linear Models , 1994, NIPS.

[35]  Geoffrey E. Hinton,et al.  Hand-printed digit recognition using deformable models , 1994 .

[36]  Patrick J. Grother,et al.  The Second Census Optical Character Recognition Systems Conference , 1994 .

[37]  Trevor Hastie,et al.  Handwritten Digit Recognition via Deformable Prototypes , 1994 .

[38]  Sebastiano Impedovo,et al.  Fundamentals in Handwriting Recognition , 1994, NATO ASI Series.

[39]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[40]  Horst Bunke,et al.  Off-Line, Handwritten Numeral Recognition by Perturbation Method , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  M. Carter Computer graphics: Principles and practice , 1997 .

[42]  Thomas Ertl,et al.  Computer graphics—principles and practice , 1997 .

[43]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .