4.7 Article

A comprehensive comparison of end-to-end approaches for handwritten digit string recognition

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 165, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2020.114196

关键词

Handwritten digit string recognition; Handwritten digit segmentation; Convolutional neural networks; Deep learning

资金

  1. National Council for Scientific and Technological Development (CNPq) [303252/2018-9, 306684/2018-2]
  2. CAPES [001]
  3. Fondecyt Chile [11150945]
  4. STIC-Amsud [19-STIC-04]
  5. Araucaria Foundation
  6. NVIDIA Corporation

向作者/读者索取更多资源

Traditional approaches for handwritten digit string recognition have relied on digit segmentation, while recent segmentation-free strategies offer a new perspective but still show limitations when dealing with a large number of touching digits. This study introduces an approach that treats a string of digits as a sequence of objects, evaluating various end-to-end methods to solve the HDSR problem.
Over the last decades, most approaches proposed for handwritten digit string recognition (HDSR) have resorted to digit segmentation, which is dominated by heuristics, thereby imposing substantial constraints on the final performance. Few of them have been based on segmentation-free strategies where each pixel column has a potential cut location. Recently, segmentation-free strategies has added another perspective to the problem, leading to promising results. However, these strategies still show some limitations when dealing with a large number of touching digits. To bridge the resulting gap, in this paper, we hypothesize that a string of digits can be approached as a sequence of objects. We thus evaluate different end-to-end approaches to solve the HDSR problem, particularly in two verticals: those based on object-detection (e.g., Yolo and RetinaNet) and those based on sequence-to-sequence representation (CRNN). The main contribution of this work lies in its provision of a comprehensive comparison with a critical analysis of the above mentioned strategies on five benchmarks commonly used to assess HDSR, including the challenging Touching Pair dataset, NIST SD19, and two real-world datasets (CAR and CVL) proposed for the ICFHR 2014 competition on HDSR. Our results show that the Yolo model compares favorably against segmentation-free models with the advantage of having a shorter pipeline that minimizes the presence of heuristics-based models. It achieved a 97%, 96%, and 84% recognition rate on the NIST-SD19, CAR, and CVL datasets, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据