3.8 Proceedings Paper

A Statistical approach to line segmentation in handwritten documents

期刊

出版社

SPIE-INT SOC OPTICAL ENGINEERING
DOI: 10.1117/12.704538

关键词

-

向作者/读者索取更多资源

A new technique to segment a handwritten document into distinct lines of text is presented. Line segmentation is the first and the most critical pre-processing step for a document recognition/analysis task. The proposed algorithm starts, by obtaining an initial set of candidate lines from the piece-wise projection profile of the document. The lines traverse around any obstructing handwritten connected component by associating it to the line above or below. A decision of associating such a component is made by (i) modeling the lines as bivariate Gaussian densities and evaluating the probability of the component under each Gaussian or (ii) the probability obtained from a distance metric. The proposed method is robust to handle skewed documents and those with lines running into each other. Experimental results show that on 720 documents (which includes English,Arabic and children's handwriting) containing a total of 11, 581 lines, 97.31 % of the lines were segmented correctly. On an experiment over 200 handwritten images with 78, 902 connected components, 98.81 % of them were associated to the correct lines.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据