Using Generative Models for Handwritten Digit Recognition

We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian "ink generators" spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the expectation maximization algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages: 1) the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style; 2) the generative models can perform recognition driven segmentation; 3) the method involves a relatively small number of parameters and hence training is relatively easy and fast; and 4) unlike many other recognition schemes, it does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated that our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is that it requires much more computation than more standard OCR techniques.

[1]  Bernard Widrow,et al.  The "Rubber-Mask" Technique I. Pattern Measurement and Analysis , 1973 .

[2]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  M. Berthod,et al.  Automatic recognition of handprinted characters—The state of the art , 1980, Proceedings of the IEEE.

[5]  D. Burr A dynamic model for image registration , 1981 .

[6]  David J. Burr,et al.  Elastic Matching of Line Drawings , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  R. Bajcsy,et al.  A computerized system for the elastic matching of deformed radiographic images to idealized atlas images. , 1983, Journal of computer assisted tomography.

[8]  D. J. Burr,et al.  Matching Elastic Templates , 1983 .

[9]  Brian Everitt,et al.  An Introduction to Latent Variable Models , 1984 .

[10]  J. Mantas,et al.  An overview of character recognition methodologies , 1986, Pattern Recognit..

[11]  Malayappan Shridhar,et al.  Recognition of isolated and simply connected handwritten numerals , 1986, Pattern Recognition.

[12]  Peter F. Brown,et al.  The acoustic-modeling problem in automatic speech recognition , 1987 .

[13]  Richard Durbin,et al.  An analogue approach to the travelling salesman problem using an elastic net method , 1987, Nature.

[14]  Mehdi Hatamian,et al.  Optical character recognition by the method of moments , 1987 .

[15]  R. Lathe Phd by thesis , 1988, Nature.

[16]  Ching Y. Suen,et al.  Structural classification and relaxation matching of totally unconstrained handwritten zip-code numbers , 1988, Pattern Recognit..

[17]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[18]  M. V. Rossum,et al.  In Neural Computation , 2022 .

[19]  Ruzena Bajcsy,et al.  Multiresolution elastic matching , 1989, Comput. Vis. Graph. Image Process..

[20]  Richard Szeliski,et al.  An Analysis of the Elastic Net Approach to the Traveling Salesman Problem , 1989, Neural Computation.

[21]  Waibel A novel objective function for improved phoneme recognition using time delay neural networks , 1989 .

[22]  Lawrence D. Jackel,et al.  Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.

[23]  M. Varga,et al.  Dynamic elastic image stretching technique applied to thermographic images , 1990 .

[24]  James D. Keeler,et al.  Integrated Segmentation and Recognition of Hand-Printed Numerals , 1990, NIPS.

[25]  Ulf Grenander,et al.  Hands: A Pattern Theoretic Study of Biological Shapes , 1990 .

[26]  Geoffrey E. Hinton,et al.  Adaptive Elastic Models for Hand-Printed Character Recognition , 1991, NIPS.

[27]  Mehran Moshfeghi,et al.  Elastic matching of multimodality medical images , 1991, CVGIP Graph. Model. Image Process..

[28]  K Fukushima,et al.  Handwritten alphanumeric character recognition by the neocognitron , 1991, IEEE Trans. Neural Networks.

[29]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[30]  Fumitaka Kimura,et al.  Handwritten numerical recognition based on multiple algorithms , 1991, Pattern Recognit..

[31]  Yuchun Lee,et al.  Handwritten Digit Recognition Using K Nearest-Neighbor, Radial-Basis Function, and Backpropagation Neural Networks , 1991, Neural Computation.

[32]  Ching Y. Suen,et al.  Computer recognition of unconstrained handwritten numerals , 1992, Proc. IEEE.

[33]  Y. Le Cun,et al.  Shortest path segmentation: a method for training a neural network to recognize character strings , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[34]  Yann LeCun,et al.  Efficient Pattern Recognition Using a New Transformation Distance , 1992, NIPS.

[35]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[36]  Radford M. Neal A new view of the EM algorithm that justifies incremental and other variants , 1993 .

[37]  John S. Denker,et al.  Improving Rejection Performance on Handwritten Digits by Training with Rubbish , 1993, Neural Computation.

[38]  Patrick Shen-Pei Wang,et al.  An Integrated Architecture for Recognition of Totally Unconstrained Handwritten Numerals , 1993, Int. J. Pattern Recognit. Artif. Intell..

[39]  Jean-Michel Bertille An elastic matching approach applied to digit recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[40]  Timothy F. Cootes,et al.  A Generic System For Classifying Variable Objects Using Flexible Template Matching , 1993, BMVC.

[41]  Geoffrey E. Hinton,et al.  Combining deformable models and neural networks for handprinted digit recognition , 1994 .

[42]  Geoffrey E. Hinton,et al.  To appear in : Advances in Neural Information Processing Systems , 2007 .

[43]  Jonathan J. Hull,et al.  A Database for Handwritten Text Recognition Research , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Geoffrey E. Hinton,et al.  Recognizing Handwritten Digits Using Mixtures of Linear Models , 1994, NIPS.

[45]  Geoffrey E. Hinton,et al.  Hand-printed digit recognition using deformable models , 1994 .

[46]  Patrick J. Grother,et al.  The Second Census Optical Character Recognition Systems Conference , 1994 .

[47]  Trevor Hastie,et al.  Handwritten Digit Recognition via Deformable Prototypes , 1994 .

[48]  Sebastiano Impedovo,et al.  Fundamentals in Handwriting Recognition , 1994, NATO ASI Series.

[49]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[50]  David J. C. MacKay,et al.  Bayesian Methods for Backpropagation Networks , 1996 .

[51]  Horst Bunke,et al.  Off-Line, Handwritten Numeral Recognition by Perturbation Method , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  M. Carter Computer graphics: Principles and practice , 1997 .

[53]  Thomas Ertl,et al.  Computer graphics—principles and practice , 1997 .

[54]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .