Using a Statistical Language Model to Improve the Performance of an HMM-Based Cursive Handwriting Recognition System

In this paper, a system for the reading of totally unconstrained handwritten text is presented. The kernel of the system is a hidden Markov model (HMM) for handwriting recognition. The HMM is enhanced by a statistical language model. Thus linguistic knowledge beyond the lexicon level is incorporated in the recognition process. Another novel feature of the system is that the HMM is applied in such a way that the difficult problem of segmenting a line of text into individual words is avoided. A number of experiments with various language models and large vocabularies have been conducted. The language models used in the system were also analytically compared based on their perplexity.

[1]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[2]  Torsten Caesar,et al.  Sophisticated topology of hidden Markov models for cursive script recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Amlan Kundu,et al.  HANDWRITTEN WORD RECOGNITION USING HIDDEN MARKOV MODEL , 1997 .

[4]  Torsten Caesar,et al.  Preprocessing and feature extraction for a handwriting recognition system , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[5]  Constantine A. Balanis,et al.  Antenna theory: a review , 1992, Proc. IEEE.

[6]  Hang Joon Kim,et al.  An HMM-based character recognition network using level building , 1997, Pattern Recognit..

[7]  Isabelle Guyon,et al.  OVERVIEW AND SYNTHESIS OF ON-LINE CURSIVE HANDWRITING RECOGNITION TECHNIQUES , 1997 .

[8]  Ching Y. Suen,et al.  Computer recognition of unconstrained handwritten numerals , 1992, Proc. IEEE.

[9]  Johansson. Stig,et al.  Manual of information to accompany the Lancaster-Oslo : Bergen Corpus of British English, for use with digital computers , 1978 .

[10]  Sargur N. Srihari,et al.  Off-Line Cursive Script Word Recognition , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Yves Lecourtier,et al.  Recognition of handwritten sentences using a restricted lexicon , 1993, Pattern Recognit..

[12]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[13]  Emmanuel Augustin,et al.  A2iA Check Reader: a family of bank check recognition systems , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[14]  Alexander Filatov,et al.  HANDWRITTEN WORD RECOGNITION - THE APPROACH PROVED BY PRACTICE , 1999 .

[15]  Arun Agarwal,et al.  BANK CHECK ANALYSIS AND RECOGNITION BY COMPUTERS , 1997 .

[16]  Horst Bunke,et al.  A full English sentence database for off-line handwriting recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[17]  Horst Bunke,et al.  Handwritten sentence recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[18]  Frederick Jelinek,et al.  Self-organizing language modeling for speech recognition , 1990 .

[19]  Gyeonghwan Kim,et al.  An architecture for handwritten text recognition systems , 1999, International Journal on Document Analysis and Recognition.

[20]  Mounim A. El-Yacoubi,et al.  Conjoined location and recognition of street names within a postal address delivery line , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[21]  Sargur N. Srihari,et al.  Interpretation of handwritten addresses in US mailstream , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[22]  S. Ganapathy,et al.  Preprocessing techniques for cursive script word recognition , 1983, Pattern Recognit..

[23]  Horst Bunke,et al.  Off-line cursive handwriting recognition using hidden markov models , 1995, Pattern Recognit..

[24]  Jean-Michel Bertille,et al.  Handwritten Word Recognition with Contextual Hidden Markov Models , 1999 .

[25]  Horst Bunke,et al.  A System for the Automated Reading of Check Amounts - Some Key Ideas , 1998, Document Analysis Systems.

[26]  H. Niemann,et al.  A HMM–based System for Recognition of Handwritten Address Words , 1999 .