An unconstrained handwriting recognition system

Abstract. In this paper, an integrated offline recognition system for unconstrained handwriting is presented. The proposed system consists of seven main modules: skew angle estimation and correction, printed-handwritten text discrimination, line segmentation, slant removing, word segmentation, and character segmentation and recognition, stemming from the implementation of already existing algorithms as well as novel algorithms. This system has been tested on the NIST, IAM-DB, and GRUHD databases and has achieved accuracy that varies from 65.6% to 100% depending on the database and the experiment.

[1]  Anil K. Jain,et al.  A Generic System for Form Dropout , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[3]  Nikos Fakotakis,et al.  Skew angle estimation for printed and handwritten documents using the Wigner-Ville distribution , 2002, Image Vis. Comput..

[4]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Tieniu Tan,et al.  A general algorithm for document skew angle estimation , 1997, Proceedings of International Conference on Image Processing.

[6]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[7]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Patrick J. Grother,et al.  The First Census Optical Character Recognition Systems Conference | NIST , 1992 .

[9]  Beatrice Lazzerini,et al.  A fuzzy classification based system for handwritten character recognition , 1998, 1998 Second International Conference. Knowledge-Based Intelligent Electronic Systems. Proceedings KES'98 (Cat. No.98EX111).

[10]  Nikos Fakotakis,et al.  New algorithms for skewing correction and slant removal on word-level [OCR] , 1999, ICECS'99. Proceedings of ICECS '99. 6th IEEE International Conference on Electronics, Circuits and Systems (Cat. No.99EX357).

[11]  Ehud Rivlin,et al.  Offline cursive script word recognition – a survey , 1999, International Journal on Document Analysis and Recognition.

[12]  L. D. Harmon,et al.  Automatic recognition of print and script , 1972 .

[13]  Zen Chen,et al.  Handwritten Chinese Character Recognition Using Stroke Structural Sequence Code , 1997, Journal of information science and engineering.

[14]  A. Harvey,et al.  Skew detection in handwritten scripts , 1997, TENCON '97 Brisbane - Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162).

[15]  Horst Bunke,et al.  Syntactic and structural pattern recognition : theory and applications , 1990 .

[16]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[17]  Sang-Yong Han,et al.  Line removal and restoration of handwritten characters on the form documents , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[18]  G. Kokkinakis,et al.  Handwritten character segmentation using transformation-based learning , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[19]  Paul D. Gader,et al.  Handwritten Word Recognition Using Segmentation-Free Hidden Markov Modeling and Segmentation-Based Dynamic Programming Techniques , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Nikos Fakotakis,et al.  A slant removal algorithm , 2000, Pattern Recognit..

[21]  Yves Lecourtier,et al.  A structural/statistical feature based vector for handwritten character recognition , 1998, Pattern Recognit. Lett..

[22]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[24]  Rama Chellappa,et al.  Page segmentation using decision integration and wavelet packets , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[25]  Sargur N. Srihari,et al.  Computer Text Recognition and Error Correction , 1985 .

[26]  V. K. Govindan,et al.  Character recognition - A review , 1990, Pattern Recognit..

[27]  Samy Bengio,et al.  Offline cursive word recognition using continuous density hidden Markov models trained with PCA or ICA features , 2002, Object recognition supported by user interaction for service robots.

[28]  L. Cohen Generalized Phase-Space Distribution Functions , 1966 .

[29]  A. Peter Johnson,et al.  A Fast Algorithm for Bottom-Up Document Layout Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Michel Gilloux Hidden Markov Models in Handwriting Recognition , 1994 .

[31]  H.-M. Suen,et al.  Text string extraction from images of colour-printed documents , 1996 .

[32]  Horst Bunke,et al.  A full English sentence database for off-line handwriting recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[33]  Malayappan Shridhar,et al.  Handwritten address interpretation using word recognition with and without lexicon , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.