Learning shape features for document enhancement
Abstract
In previous work we showed that shape descriptor features can be used in Look Up Table (LUT) classifiers to learn patterns of degradation and correction in historical document images. The algorithm encodes the pixel neighborhood information effectively using a variant of shape descriptor. However, the generation of the shape descriptor features was approached in a heuristic manner. In this work, we propose a system of learning the shape features from the training data set by using neural networks: Multilayer Perceptrons (MLP) for feature extraction. Given that the MLP maybe restricted by a limited dataset, we apply a feature selection algorithm to generalize, and thus improve, the feature set obtained from the MLP. We validate the effectiveness and efficiency of the proposed approach via experimental results. © 2009 Copyright SPIE - The International Society for Optical Engineering.
Document Type
Conference Proceeding
DOI
https://doi.org/10.1117/12.838746
Keywords
Artificial neural networks, Document image analysis, Historical documents, Image enhancement, Machine learning
Publication Date
3-29-2010
Recommended Citation
Obafemi-Ajayi, Tayo, Gady Agam, and Ophir Frieder. "Learning shape features for document enhancement." In Document Recognition and Retrieval XVII, vol. 7534, p. 75340F. International Society for Optics and Photonics, 2010.
Journal Title
Proceedings of SPIE - The International Society for Optical Engineering