Ambiguity and Constraint in Mathematical Expression Recognition

The problem of recognizing mathematical expressions differs significantly from the recognition of standard prose. While in prose significant constraints can be put on the interpretation of a character by the characters immediately preceding and following it, few such simple constraints are present in a mathematical expression. In order to make the problem tractable, effective methods of recognizing mathematical expressions will need to put intelligent constraints on the possible interpretations. The authors present preliminary results on a system for the recognition of both handwritten and typeset mathematical expressions. While previous systems perform character recognition out of context, the current system maintains ambiguity of the characters until context can be used to disambiguate the interpretation. In addition, the system limits the number of potentially valid interpretations by decomposing the expressions into a sequence of compatible convex regions. The system uses A-star to search for the best possible interpretation of an expression. We provide a new lower bound estimate on the cost to goal that improves performance significantly.

[1]  William Martin A Fast-Parsing Scheme for Hand-Printed Mathematical Expressions , 1967 .

[2]  R. J. Nelson,et al.  Introduction to Automata , 1968 .

[3]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[4]  P. A. Chou,et al.  Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar , 1989, Other Conferences.

[5]  Kazuhiko Yamamoto,et al.  Structured Document Image Analysis , 1992, Springer Berlin Heidelberg.

[6]  Masayuki Okamoto,et al.  An Experimental Implementation of a Document Recognition System for Papers Containing Mathematical Expressions , 1992 .

[7]  Daniel P. Huttenlocher,et al.  Comparing Images Using the Hausdorff Distance , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[9]  Hsi-Jian Lee,et al.  Understanding mathematical expressions using procedure-oriented transformation , 1994, Pattern Recognit..

[10]  Manfred K. Lang,et al.  A soft-decision approach for structural analysis of handwritten mathematical expressions , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[11]  Richard F. Lyon,et al.  Effective Training of a Neural Network Character Classifier for Word Recognition , 1996, NIPS.

[12]  Richard J. Fateman,et al.  Progress in recognizing typeset mathematics , 1996, Electronic Imaging.

[13]  Jesse F Hull Recognition of mathematics using a two-dimensional trainable context-free grammar , 1996 .

[14]  Stephane Lavirotte Optical formula recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[15]  Hsi-Jian Lee,et al.  Design of a mathematical expression understanding system , 1997, Pattern Recognit. Lett..