4.6 Article

Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

期刊

VISUAL COMPUTER
卷 37, 期 2, 页码 291-305

出版社

SPRINGER
DOI: 10.1007/s00371-020-01799-4

关键词

Meitei Mayek; Indian language; Character recognition; Segmentation

向作者/读者索取更多资源

The peculiar Indian script Meitei Mayek has experienced a resurgence in recent years and little attention in handwriting research. However, by developing different datasets and recognition systems, high recognition accuracy has been achieved for handwritten Meitei Mayek text.
A peculiar Indian Script Meitei Mayek has experienced a resurgence in the last few years and gets very little attention in handwriting research due to recently insurgence and limited sources. The objective of this paper is two folds; firstly, develop two different datasets: Mayek27 having 4900 isolated Meitei Mayek alphabets and MM (Meitei Mayek) dataset of 189 full-length handwritten text page. Secondly, develop a recognition system on the Mayek27 dataset using convolutional neural network and segmentation algorithms (text-lines, words, and characters) on the full-length Meitei Mayek handwritten text. A recognition rate of 99.02% is achieved using three layers of convolutional layers with a filter size of 3x3 with 16, 32, and 96 kernels. In MM text dataset, the text-line and word segmentation are performed concurrently on 809 lines by tracking space between lines in a novel approach based on horizontal projection histogram and monitoring vertical projection histogram along the run-length of segmentation. Various constraints like skew, curve, close, and touching text-lines are incorporated, and the segmentation algorithm results are 91.84% and 88.96% for text-line and word, respectively. Furthermore, characters are segmented by headline removal, and connected component analysis achieves an accuracy of 91.12%.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据