☆ 3.8 Proceedings Paper

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022) (2022)

期刊

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022)

卷 -, 期 -, 页码 2563-2571

出版社

IEEE COMPUTER SOC

DOI: 10.1109/WACV51458.2022.00262

关键词

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic Imaging Science & Photographic Technology

资金

Swedish Research Council [2018-06074]
Spanish project [RTI2018-095645B-C21]
CERCA Program/Generalitat de Catalunya
AGAUR [2019PROD00090]
UAB [B18P0073]
[PID2020-116298GB-I00]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper addresses the challenge of low-resource Handwritten Text Recognition (HTR) by proposing a data generation technique based on Bayesian Program Learning (BPL). Unlike traditional methods, which require a large amount of annotated images, our method can generate human-like handwriting using only one sample of each symbol in the alphabet. Synthetic lines are then created to train state-of-the-art HTR architectures in a segmentation-free fashion. Quantitative and qualitative analyses confirm the effectiveness of the proposed method.

Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data and the very limited linguistic information (dictionaries and language models). For example, in the case of historical ciphered manuscripts, which are usually written with invented alphabets to hide the message contents. Thus, in this paper we address this problem through a data generation technique based on Bayesian Program Learning (BPL). Contrary to traditional generation approaches, which require a huge amount of annotated images, our method is able to generate human-like handwriting using only one sample of each symbol in the alphabet. After generating symbols, we create synthetic lines to train state-of-the-art HTR architectures in a segmentation free fashion. Quantitative and qualitative analyses were carried out and confirm the effectiveness of the proposed method.

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

期刊

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022)

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

期刊

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022)

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文