Related references
Note: Only part of the references are listed.
Proceedings Paper
Computer Science, Artificial Intelligence
DocVQA: A Dataset for VQA on Document Images
Minesh Mathew et al.
Summary: DocVQA is a new dataset for Visual Question Answering on document images, with 50,000 questions defined on 12,000+ images. Analysis shows that existing models perform reasonably well on certain question types, but there is still a large performance gap compared to human performance. Models need to improve on questions where understanding the structure of the document is crucial.
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 (2021)
Proceedings Paper
Computer Science, Artificial Intelligence
FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents
Guillaume Jaume et al.
2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 2ND INTERNATIONAL WORKSHOP ON OPEN SERVICES AND TOOLS FOR DOCUMENT ANALYSIS (OST), VOL 2 (2019)