4.5 Article

An optimization for binarization methods by removing binary artifacts

Journal

PATTERN RECOGNITION LETTERS
Volume 34, Issue 11, Pages 1299-1306

Publisher

ELSEVIER
DOI: 10.1016/j.patrec.2013.04.007

Keywords

Historical documents; Threshold; Denoising; Binarization; Minimum error rate; Bayes theory

Funding

  1. National Council on Science and Technology (CONACYT) of Mexico [C00/587/11]

Ask authors/readers for more resources

In this article, we introduce a novel technique to remove binary artifacts. Given a gray-intensity image and its corresponding binary image, our method detects and remove connected components that are more likely to be background pixels. With this aim, our method constructs an auxiliary image by the minimum-error-rate threshold and, then, computes the ratio of intersection between the connected components of the original binary image and the connected components of the auxiliary image. Connected components with high ratio are considered true connected components while the rest are removed from the output. We tested our method in binarization methods for historical documents (handwritten and printed). Our results are favorable and indicate that our method can improve the outputs from diverse binarization methods. In particular, a high improvement was observed for printed documents. Our method is easy to implement, has a moderate computational cost, and has two parameters whose model interpretation allows an easy empirical selection. (C) 2013 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available