Statistical multi-resolution schemes for historical document binarization

Abstract

In previous work, we proposed the application of the Expectation- Maximization (EM) algorithm in the binarization of historical documents by defining a multi-resolution framework. In this work, we extend the multiresolution framework to the Otsu algorithm for effective binarization of historical documents. We compare the effectiveness of the EM based binarization technique to the Otsu thresholding algorithm on historical documents. We demonstrate how the EM can be extended to perform an effective segmentation of historical documents by taking into account multiple features beyond the intensity of the document image. Experimental results, analysis and comparisons to known techniques are presented using the document image collection from the DIBCO 2009 contest. © 2011 SPIE-IS&T.

Document Type

Conference Proceeding

DOI

https://doi.org/10.1117/12.876582

Keywords

binarization, document image analysis, historical documents, image thresholding

Publication Date

5-12-2011

Journal Title

Proceedings of SPIE - The International Society for Optical Engineering

Share

COinS