A Database for Handwritten Text Recognition Research

An image database for handwritten text recognition research is described. Digital images of approximately 5000 city names, 5000 state names, 10000 ZIP Codes, and 50000 alphanumeric characters are included. Each image was scanned from mail in a working post office at 300 pixels/in in 8-bit gray scale on a high-quality flat bed digitizer. The data were unconstrained for the writer, style, and method of preparation. These characteristics help overcome the limitations of earlier databases that contained only isolated characters or were prepared in a laboratory setting under prescribed circumstances. Also, the database is divided into explicit training and testing sets to facilitate the sharing of results among researchers as well as performance comparisons. >

[1]  George Nagy Candide's Practical Principles of Experimental Pattern Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Sargur N. Srihari,et al.  Understanding Handwritten Text in a Structured Environment: Determining ZIP Codes from Addresses , 1991, Int. J. Pattern Recognit. Artif. Intell..

[3]  J.-C. Simon,et al.  Off-line cursive word recognition , 1992, Proc. IEEE.

[4]  George Nagy,et al.  At the frontiers of OCR , 1992, Proc. IEEE.

[5]  Sargur N. Srihari,et al.  Combination of Decisions by Multiple Classifiers , 1992 .