☆ 3.8 Proceedings Paper

Document Collection Visual Question Answering

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II (2021)

期刊

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II

卷 12822, 期 -, 页码 778-792

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

DOI: 10.1007/978-3-030-86331-9_50

关键词

Document collection; Visual Question Answering

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods

资金

UAB PIF scholarship [B18P0070, 2017-SGR-1783]
University Department of the Catalan Government

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Current methods in Document Understanding focus on processing individual documents, while documents are typically organized in collections which provide valuable context for interpretation. To address this issue, DocCVQA introduces a new dataset and task where questions are posed over a whole collection of document images, aiming to provide answers to questions and retrieve the documents containing relevant information. Along with the dataset, a new evaluation metric and baselines are proposed to gain further insights into this new dataset and task.

Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation. To address this problem, we introduce Document Collection Visual Question Answering (DocCVQA) a new dataset and related task, where questions are posed over a whole collection of document images and the goal is not only to provide the answer to the given question, but also to retrieve the set of documents that contain the information needed to infer the answer. Along with the dataset we propose a new evaluation metric and baselines which provide further insights to the new dataset and task.

Document Collection Visual Question Answering

期刊

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Document Collection Visual Question Answering

期刊

DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II

出版社

SPRINGER INTERNATIONAL PUBLISHING AG

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文