4.7 Article

LCSegNet: An Efficient Semantic Segmentation Network for Large-Scale Complex Chinese Character Recognition

Journal

IEEE TRANSACTIONS ON MULTIMEDIA
Volume 23, Issue -, Pages 3427-3440

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2020.3025696

Keywords

Character recognition; Feature extraction; Image segmentation; Semantics; Task analysis; Encoding; Text recognition; Character recognition; complex scene; handwriting recognition; label coding; large-scale categories; semantic segmentation

Funding

  1. Natural Science Foundation of China [61872113, 61876052, U1813215]
  2. Strategic Emerging Industry Development Special Funds of Shenzhen [XMHT20190108009, JCYJ20190806112, JCYJ20180306172232154]

Ask authors/readers for more resources

In this paper, an efficient semantic segmentation model LCSegNet based on label coding (LC) is proposed for recognizing large-scale Chinese characters, achieving state-of-the-art performances in both complex scene and handwritten character recognition tasks. The method utilizes a new label coding method called Wubi-CRF and a conditional random field (CRF) module to learn constraint rules, significantly improving the accuracy of Chinese character recognition in scene text recognition tasks.
Complex scene character recognition is a challenging yet important task in machine learning, especially for languages with large character sets, such as Chinese, which is composed of hieroglyphics with large-scale categories and similar glyphs. Recently, state-of-the-art methods based on semantic segmentation have achieved great success in scene parsing and have been applied in scene text recognition. However, because of limitations in terms of memory and computation, they are only applied in the small category recognition tasks, such as tasks involving English alphabets and digits. In this paper, we propose an efficient semantic segmentation model based on label coding (LC), called LCSegNet, to recognize large-scale Chinese characters. First, to reduce the number of labels, we design a new label coding method based on the Wubi Chinese characters code, called Wubi-CRF. In this method, glyphs and structure information of Chinese characters are encoded into 140-bit labels. Second, we employ an efficient semantic segmentation model for pixel-wise prediction and utilize a conditional random field (CRF) module to learn the constraint rules of Wubi-like coding. Finally, experiments are conducted on three benchmarks: a large Chinese text dataset in the wild (CTW), ICDAR2019-ReCTS, and HIT-OR3C dataset. Results show that the proposed method achieves state-of-the-art performances in both complex scene and handwritten character recognition tasks.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available