论文信息 - Feature Selection and Language Syntax in Text Recognition

Feature Selection and Language Syntax in Text Recognition

249 There are many features that can be used to recognize images of text. The choice of a feature set is usually made intuitively to optimize performance in single character recognition. This approach to feature set selection does not utilize some of the evidence about human processing during reading that suggests feature extraction occurs in parallel with the development of an understanding of the text. Feature extraction in hum~ reading is a two-step process that can be framed as hypothesis generation anq 'testing. The understanding process includes syntactic as well as semantic components. This paper presents a set of algorithms for text recognition that model the essence of human reading with two feature extraction stages and an understanding phase that uses information about the syntactic context between words. An objective is to discover how different feature sets affect the perfonnance of syntax. Statistical experiments show that a simple representation for syntax reduces the number of words in a large lexicon that can match an input word by about 20 percent. Also, the error rate is reduced as the power of the feature detectors is increased.

Jonathan J. Hull | J. C. Simon | J. Hull | J. Simon

[1] Lyn Frazier,et al. The interaction of syntax and semantics during sentence processing: eye movements in the analysis of semantically biased sentences , 1983 .

[2] RAOUF F. H. FARAG,et al. Word-Level Recognition of Cursive Script , 1979, IEEE Transactions on Computers.

[3] Jonathan J. Hull,et al. A computational theory of visual word recognition , 1988 .

[4] Ken Thompson,et al. Reading Chess , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Jonathan J. Hull. Hypothesis Testing in a Computational Theory of Visual Word Recognition , 1987, AAAI.

[6] H. Kucera,et al. Computational analysis of present-day American English , 1967 .

[7] Scott A. Small,et al. The effect of mood on word recognition , 1985 .

[8] D. F. Fisher,et al. Reading and visual search , 1975, Memory & cognition.

[9] Terry Winograd,et al. Language as a Cognitive Process , 1983, CL.

[10] Jonathan J. Hull. Hypothesis Generation in a Computational Model for Visual Word Recognition , 1986, IEEE Expert.