Journal
2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022)
Volume -, Issue -, Pages 3426-3438Publisher
IEEE COMPUTER SOC
DOI: 10.1109/ICDE53745.2022.00322
Keywords
Reading Path Generation; Academic Search Engine; Automatic Dataset Creation
Categories
Funding
- Canada CIFAR AI Chair Program
- NSERC Discovery Grant [RGPIN2021-03115]
- UK Engineering and Physical Sciences Research Council [EP/P011829/1]
Ask authors/readers for more resources
In recent years, the field of computer science has seen a rapid growth in the number of research papers, making it challenging to find valuable ones. This paper introduces a new task called Reading Path Generation (RPG) which automatically generates a path of papers to read for a given query. The proposed graph-optimization-based approach, considering the relationships between papers, outperforms other baselines according to extensive evaluations.
Recent years have witnessed the dramatic growth of paper volumes with plenty of new research papers published every day, especially in the area of computer science. How to glean papers worth reading from the massive literature to do a quick survey or keep up with the latest advancement about a specific research topic has become a challenging task. Existing academic search engines return relevant papers by individually calculating the relevance between each paper and query. However, such systems usually omit the prerequisite chains of a research topic and cannot form a meaningful reading path. In this paper, we introduce a new task named Reading Path Generation (RPG) which aims at automatically producing a path of papers to read for a given query. To serve as a research benchmark, we further propose SurveyBank, a dataset consisting of large quantities of survey papers in the field of computer science as well as their citation relationships. Furthermore, we propose a graph-optimization-based approach for reading path generation which takes the relationship between papers into account. Extensive evaluations demonstrate that our approach outperforms other baselines. A real-time Reading Path Generation (RePaGer) system has been also implemented with our designed model. Our source code and SurveyBank dataset can be found here (1).
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available