Extraction of reference lines from documents with grey-level background using sub-images of wavelets

Based on wavelets, a new theoretical method has been developed to process form documents. In this method, two-dimensional multiresolution analysis (MSA), wavelet decomposition algorithm, and compactly supported orthonormal wavelets are used to transform a document image into sub-images. According to these sub-images, the reference lines of forms can be extracted, and knowledge about the geometric structure of the document can be acquired. Experiments prove that this new method can be applied to process documents with promising results.