3.8 Proceedings Paper

Recognition of E-Born PDF Including Mathematical Formulas

Journal

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-319-41264-1_5

Keywords

STEM; OCR; E-born PDF; Accessibility

Funding

  1. Grants-in-Aid for Scientific Research [25245084] Funding Source: KAKEN

Ask authors/readers for more resources

A new method to recognize STEM contents in e-born PDF, which is produced originally from an electronic file such as a Microsoft-Word document, LaTeX system, etc., is developed. Character information (the character code, the font type and the coordinates on a page) extracted directly from a document is combined with analysis technologies in Math OCR. It improves recognition rate for STEM contents in e-born PDF remarkably, compared with ordinary image-based OCR approaches. This new method is actually implemented in our math OCR system (InftyReader).

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

3.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available