Performance evaluation of pattern classifiers for handwritten character recognition

Abstract. This paper describes a performance evaluation study in which some efficient classifiers are tested in handwritten digit recognition. The evaluated classifiers include a statistical classifier (modified quadratic discriminant function, MQDF), three neural classifiers, and an LVQ (learning vector quantization) classifier. They are efficient in that high accuracies can be achieved at moderate memory space and computation cost. The performance is measured in terms of classification accuracy, sensitivity to training sample size, ambiguity rejection, and outlier resistance. The outlier resistance of neural classifiers is enhanced by training with synthesized outlier data. The classifiers are tested on a large data set extracted from NIST SD19. As results, the test accuracies of the evaluated classifiers are comparable to or higher than those of the nearest neighbor (1-NN) rule and regularized discriminant analysis (RDA). It is shown that neural classifiers are more susceptible to small sample size than MQDF, although they yield higher accuracies on large sample size. As a neural classifier, the polynomial classifier (PC) gives the highest accuracy and performs best in ambiguity rejection. On the other hand, MQDF is superior in outlier rejection even though it is not trained with outlier data. The results indicate that pattern classifiers have complementary advantages and they should be appropriately combined to achieve higher performance.

[1]  Rama Chellappa,et al.  Evaluation of pattern classifiers for fingerprint and OCR applications , 1994, Pattern Recognit..

[2]  Alexander Shustorovich,et al.  A subspace projection approach to feature extraction: The two-dimensional gabor transform for character recognition , 1994, Neural Networks.

[3]  David G. Stork,et al.  Pattern Classification , 1973 .

[4]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Erkki Oja,et al.  Neural and statistical classifiers-taxonomy and two case studies , 1997, IEEE Trans. Neural Networks.

[6]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[7]  Sargur N. Srihari,et al.  Handprinted character/digit recognition using a multiple feature/resolution philos-ophy , 1994 .

[8]  Tetsushi Wakabayashi,et al.  Handwritten numeral recognition using autoassociative neural networks , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[9]  Tetsushi Wakabayashi,et al.  Evaluation and Synthesis of Feature Vectors for Handwritten Numeral Recognition (Special Issue on Character Recognition and Document Understanding) , 1996 .

[10]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[11]  Fumitaka Kimura,et al.  Handwritten numerical recognition based on multiple algorithms , 1991, Pattern Recognit..

[12]  J. Friedman Regularized Discriminant Analysis , 1989 .

[13]  Franco Scarselli,et al.  Are Multilayer Perceptrons Adequate for Pattern Recognition and Verification? , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15]  Horst Bunke,et al.  Off-Line, Handwritten Numeral Recognition by Perturbation Method , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Bernard Dubuisson,et al.  A statistical decision rule with incomplete knowledge about classes , 1993, Pattern Recognit..

[17]  Robert P. W. Duin,et al.  A note on comparing classifiers , 1996, Pattern Recognit. Lett..

[18]  Jung-Hsien Chiang,et al.  Neural and Fuzzy Methods in Handwriting Recognition , 1997, Computer.

[19]  Michael T. Manry,et al.  Classification-based segmentation of ZIP codes , 1993, IEEE Trans. Syst. Man Cybern..

[20]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Hiroshi Sako,et al.  Aspect Ratio Adaptive Normalization for Handwritten Character Recognition , 2000, ICMI.

[22]  Jürgen Schürmann,et al.  Pattern classification - a unified view of statistical and neural approaches , 2008 .

[23]  Harris Drucker,et al.  Comparison of learning algorithms for handwritten digit recognition , 1995 .

[24]  Jung-Hsien Chiang,et al.  Handwritten word recognition with character and inter-character neural networks , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[25]  Horst Bunke,et al.  Off-line handwritten numeral string recognition by combining segmentation-based and segmentation-free methods , 1998, Pattern Recognit..

[26]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[27]  Ulrich Kressel,et al.  PATTERN CLASSIFICATION TECHNIQUES BASED ON FUNCTION APPROXIMATION , 1997 .

[28]  Robert P. W. Duin,et al.  Outlier Detection Using Classifier Instability , 1998, SSPR/SPR.

[29]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[30]  Shinichiro Omachi,et al.  Precise Selection of Candidates for Handwritten Character Recognition Using Feature Regions (Special Issue on Character Recognition and Document Understanding) , 1996 .

[31]  Mario Vento,et al.  A method for improving classification reliability of multilayer perceptrons , 1995, IEEE Trans. Neural Networks.

[32]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[33]  Stephen J. Roberts,et al.  Supervised and unsupervised learning in radial basis function classifiers , 1994 .

[34]  Gale Martin,et al.  Recognizing Overlapping Hand-Printed Characters by Centered-Object Integrated Segmentation and Recognition , 1991, NIPS.

[35]  Cheng-Lin Liu,et al.  Preprocessing and statistical/structural feature extraction for handwritten numeral recognition , 1997 .

[36]  C. K. Chow,et al.  On optimum recognition error and reject tradeoff , 1970, IEEE Trans. Inf. Theory.

[37]  Anil K. Jain,et al.  Feature extraction methods for character recognition-A survey , 1996, Pattern Recognit..

[38]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[39]  Jung-Hsien Chiang,et al.  A hybrid neural network model in handwritten word recognition , 1998, Neural Networks.

[40]  Geoffrey E. Hinton,et al.  Modeling the manifolds of images of handwritten digits , 1997, IEEE Trans. Neural Networks.

[41]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[42]  T. Kohonen,et al.  Statistical pattern recognition with neural networks: benchmarking studies , 1988, IEEE 1988 International Conference on Neural Networks.

[43]  Geoffrey E. Hinton,et al.  Learning representations by back-propagation errors, nature , 1986 .

[44]  Michael D. Garris,et al.  Neural network-based systems for handprint OCR applications , 1998, IEEE Trans. Image Process..

[45]  Fumitaka Kimura,et al.  Handwritten ZIP code recognition using lexicon free word recognition algorithm , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[46]  Ching Y. Suen,et al.  Sorting and Recognizing Cheques and Financial Documents , 1998, Document Analysis Systems.

[47]  Thomas G. Dietterich,et al.  Improving the Performance of Radial Basis Function Networks by Learning Center Locations , 1991, NIPS.

[48]  Simon Haykin,et al.  GradientBased Learning Applied to Document Recognition , 2001 .

[49]  John S. Denker,et al.  Improving Rejection Performance on Handwritten Digits by Training with Rubbish , 1993, Neural Computation.

[50]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[51]  Soo-Hyung Kim,et al.  PERFORMANCE COMPARISON OF STATISTICAL AND NEURAL NETWORK CLASSIFIERS IN HANDWRITTEN DIGITS RECOGNITION , 1999 .

[52]  Alex Pentland,et al.  Probabilistic Visual Learning for Object Representation , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[53]  Erkki Oja,et al.  Subspace methods of pattern recognition , 1983 .

[54]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[55]  Yifan Shi,et al.  A new distinguishing algorithm of connected character image based on Fourier transform , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[56]  Teuvo Kohonen,et al.  Improved versions of learning vector quantization , 1990, 1990 IJCNN International Joint Conference on Neural Networks.

[57]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Masaki Nakagawa,et al.  Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition , 2001, Pattern Recognit..