A general segmentation scheme for DjVu document compression

We describe the “DjVu” (Déjà Vu) technology: an efficient document image compression methodology, a file format, and a delivery platform that together, enable instant access to high quality documents from essentially any platform, over any connection. Originally developed for scanned color documents, it was recently expanded to electronic documents, so DjVu has now truly become a universal document interchange format. With DjVu, a color magazine page scanned at 300dpi typically occupies between 40KB and 80KB, i.e. approximately 5 to 10 times smaller than JPEG for a similar level of readability (the typical compression ratio is 500:1). Converting electronic documents to DjVu also offers substantial advantages, as described in the paper. The technology relies on a classification of each pixel as either foreground (text, drawing) or background (pictures, paper texture and color), thereby producing a segmentation into layers that are compressed separately. The novel contribution of this paper is a unified approach for segmentation of scanned or electronic documents, using a rigorous approach based on the Minimum Description Length (MDL) principle. The foreground layer is compressed using a pattern matching technique taking advantage of the similarities between character shapes. A progressive, wavelet-based compression technique, combined with a masking algorithm, is then used to compress the background image at lower resolution, while minimizing the number of bits spent on the pixels that are otherwise covered by foreground pixels. Encoders, decoders, and real-time, memory efficient plug-ins for various web browsers are available for all the major platforms. H. Talbot, R. Beare (Eds): Proceedings of ISMM2002 Redistribution rights reserved CSIRO Publishing. ISBN 0 643 06804 X 17

[1]  Yann LeCun,et al.  Efficient conversion of digital documents to multilayer raster formats , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[2]  Robert H. Thibadeau,et al.  Antique Books , 1997, D Lib Mag..

[3]  David S. Taubman,et al.  High performance scalable image compression with EBCOT , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[4]  William A. Pearlman,et al.  A new, fast, and efficient image codec based on set partitioning in hierarchical trees , 1996, IEEE Trans. Circuits Syst. Video Technol..

[5]  Yoshua Bengio,et al.  The Z-coder adaptive binary coder , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[6]  Paul G. Howard,et al.  Text Image Compression Using Soft Pattern Matching , 1997, Comput. J..

[7]  W. Sweldens The Lifting Scheme: A Custom - Design Construction of Biorthogonal Wavelets "Industrial Mathematics , 1996 .

[8]  George Nagy,et al.  A Means for Achieving a High Degree of Compaction on Scan-Digitized Printed Text , 1974, IEEE Transactions on Computers.

[9]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[10]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[11]  J. Rissanen Stochastic Complexity and Modeling , 1986 .

[12]  Pascal Vincent,et al.  Color documents on the Web with DjVu , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[13]  Stuart Inglis Lossless Document Image Compression , 1999 .

[14]  Wayne Niblack,et al.  Unsupervised image segmentation using the minimum description length principle , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[15]  Patrick Haffner,et al.  DjVu document browsing with on-demand loading and rendering of image components , 2000, IS&T/SPIE Electronic Imaging.

[16]  Steven Pigeon,et al.  Lossy compression of partially masked still images , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[17]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[18]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.