Journal
COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, ICCHP 2016, PT I
Volume 9758, Issue -, Pages 35-42Publisher
SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-319-41264-1_5
Keywords
STEM; OCR; E-born PDF; Accessibility
Funding
- Grants-in-Aid for Scientific Research [25245084] Funding Source: KAKEN
Ask authors/readers for more resources
A new method to recognize STEM contents in e-born PDF, which is produced originally from an electronic file such as a Microsoft-Word document, LaTeX system, etc., is developed. Character information (the character code, the font type and the coordinates on a page) extracted directly from a document is combined with analysis technologies in Math OCR. It improves recognition rate for STEM contents in e-born PDF remarkably, compared with ordinary image-based OCR approaches. This new method is actually implemented in our math OCR system (InftyReader).
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available