Statistical multi-resolution schemes for historical document binarization
Abstract
In previous work, we proposed the application of the Expectation- Maximization (EM) algorithm in the binarization of historical documents by defining a multi-resolution framework. In this work, we extend the multiresolution framework to the Otsu algorithm for effective binarization of historical documents. We compare the effectiveness of the EM based binarization technique to the Otsu thresholding algorithm on historical documents. We demonstrate how the EM can be extended to perform an effective segmentation of historical documents by taking into account multiple features beyond the intensity of the document image. Experimental results, analysis and comparisons to known techniques are presented using the document image collection from the DIBCO 2009 contest. © 2011 SPIE-IS&T.
Document Type
Conference Proceeding
DOI
https://doi.org/10.1117/12.876582
Keywords
binarization, document image analysis, historical documents, image thresholding
Publication Date
5-12-2011
Recommended Citation
Obafemi-Ajayi, Tayo, and Gady Agam. "Statistical multi-resolution schemes for historical document binarization." In Document Recognition and Retrieval XVIII, vol. 7874, p. 78740S. International Society for Optics and Photonics, 2011.
Journal Title
Proceedings of SPIE - The International Society for Optical Engineering