4.5 Article

An optimization for binarization methods by removing binary artifacts

期刊

PATTERN RECOGNITION LETTERS
卷 34, 期 11, 页码 1299-1306

出版社

ELSEVIER
DOI: 10.1016/j.patrec.2013.04.007

关键词

Historical documents; Threshold; Denoising; Binarization; Minimum error rate; Bayes theory

资金

  1. National Council on Science and Technology (CONACYT) of Mexico [C00/587/11]

向作者/读者索取更多资源

In this article, we introduce a novel technique to remove binary artifacts. Given a gray-intensity image and its corresponding binary image, our method detects and remove connected components that are more likely to be background pixels. With this aim, our method constructs an auxiliary image by the minimum-error-rate threshold and, then, computes the ratio of intersection between the connected components of the original binary image and the connected components of the auxiliary image. Connected components with high ratio are considered true connected components while the rest are removed from the output. We tested our method in binarization methods for historical documents (handwritten and printed). Our results are favorable and indicate that our method can improve the outputs from diverse binarization methods. In particular, a high improvement was observed for printed documents. Our method is easy to implement, has a moderate computational cost, and has two parameters whose model interpretation allows an easy empirical selection. (C) 2013 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据