4.2 Article

Asking questions on handwritten document collections

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Proceedings Paper Computer Science, Artificial Intelligence

DocVQA: A Dataset for VQA on Document Images

Minesh Mathew et al.

Summary: DocVQA is a new dataset for Visual Question Answering on document images, with 50,000 questions defined on 12,000+ images. Analysis shows that existing models perform reasonably well on certain question types, but there is still a large performance gap compared to human performance. Models need to improve on questions where understanding the structure of the document is crucial.

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 (2021)

Proceedings Paper Computer Science, Artificial Intelligence

Scene Text Visual Question Answering

Ali Furkan Biten et al.

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) (2019)

Proceedings Paper Computer Science, Artificial Intelligence

FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents

Guillaume Jaume et al.

2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 2ND INTERNATIONAL WORKSHOP ON OPEN SERVICES AND TOOLS FOR DOCUMENT ANALYSIS (OST), VOL 2 (2019)

Proceedings Paper Computer Science, Artificial Intelligence

Word Spotting and Recognition using Deep Embedding

Praveen Krishnan et al.

2018 13TH IAPR INTERNATIONAL WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS (DAS) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

DVQA: Understanding Data Visualizations via Question Answering

Kushal Kafle et al.

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) (2018)

Proceedings Paper Computer Science, Artificial Intelligence

Are You Smarter Than A Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

Aniruddha Kembhavi et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

Justin Johnson et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering

Yash Goyal et al.

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) (2017)

Proceedings Paper Computer Science, Artificial Intelligence

LSDE: Levenshtein Space Deep Embedding for Query-by-string Word Spotting

Lluis Gomez et al.

2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 (2017)

Proceedings Paper Computer Science, Interdisciplinary Applications

Reading Wikipedia to Answer Open-Domain Questions

Danqi Chen et al.

PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1 (2017)

Proceedings Paper Computer Science, Artificial Intelligence

VQA: Visual Question Answering

Stanislaw Antol et al.

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) (2015)

Article Computer Science, Artificial Intelligence

Image Classification with the Fisher Vector: Theory and Practice

Jorge Sanchez et al.

INTERNATIONAL JOURNAL OF COMPUTER VISION (2013)

Article Computer Science, Artificial Intelligence

Aggregating Local Image Descriptors into Compact Codes

Herve Jegou et al.

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (2012)

Article Computer Science, Artificial Intelligence

Unconstrained handwritten document retrieval

Huaigu Cao et al.

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION (2011)

Article Computer Science, Artificial Intelligence

A probabilistic method for keyword retrieval in handwritten document images

Huaigu Cao et al.

PATTERN RECOGNITION (2009)

Article Computer Science, Artificial Intelligence

Retrieving poorly degraded OCR documents

Y. Fataicha et al.

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION (2006)