☆ 4.6 Article

Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

VISUAL COMPUTER (2021)

期刊

VISUAL COMPUTER

卷 37, 期 2, 页码 291-305

出版社

SPRINGER

DOI: 10.1007/s00371-020-01799-4

关键词

Meitei Mayek; Indian language; Character recognition; Segmentation

类别

Computer Science, Software Engineering

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The peculiar Indian script Meitei Mayek has experienced a resurgence in recent years and little attention in handwriting research. However, by developing different datasets and recognition systems, high recognition accuracy has been achieved for handwritten Meitei Mayek text.

A peculiar Indian Script Meitei Mayek has experienced a resurgence in the last few years and gets very little attention in handwriting research due to recently insurgence and limited sources. The objective of this paper is two folds; firstly, develop two different datasets: Mayek27 having 4900 isolated Meitei Mayek alphabets and MM (Meitei Mayek) dataset of 189 full-length handwritten text page. Secondly, develop a recognition system on the Mayek27 dataset using convolutional neural network and segmentation algorithms (text-lines, words, and characters) on the full-length Meitei Mayek handwritten text. A recognition rate of 99.02% is achieved using three layers of convolutional layers with a filter size of 3x3 with 16, 32, and 96 kernels. In MM text dataset, the text-line and word segmentation are performed concurrently on 809 lines by tracking space between lines in a novel approach based on horizontal projection histogram and monitoring vertical projection histogram along the run-length of segmentation. Various constraints like skew, curve, close, and touching text-lines are incorporated, and the segmentation algorithm results are 91.84% and 88.96% for text-line and word, respectively. Furthermore, characters are segmented by headline removal, and connected component analysis achieves an accuracy of 91.12%.

Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

期刊

VISUAL COMPUTER

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition

期刊

VISUAL COMPUTER

出版社

SPRINGER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文