4.6 Article

Reviewer Recommendations Using Document Vector Embeddings and a Publisher Database: Implementation and Evaluation

期刊

IEEE ACCESS
卷 10, 期 -, 页码 21798-21811

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3151640

关键词

Training; Databases; Computational modeling; Task analysis; Electronic mail; Text mining; Semantics; Reviewer matching; text mining; document vector embedding; evaluation methodology; explainable learning

资金

  1. U.S. National Science Foundation (NSF) [CCF-1934962]

向作者/读者索取更多资源

This study develops an automated data-driven framework for providing reviewer recommendations for submitted manuscripts. It uses a neural network model and a database to improve the recommendation performance. The study also proposes an evaluation methodology and provides a dataset for further research in this field.
We develop and evaluate an automated data-driven framework for providing reviewer recommendations for submitted manuscripts. Given inputs comprising a set of manuscripts for review and a listing of a pool of prospective reviewers, our system uses a publisher database to extract papers authored by the reviewers from which a Paragraph Vector (doc2vec) neural network model is learned and used to obtain vector space embeddings of documents. Similarities between embeddings of an individual reviewer's papers and a manuscript are then used to compute manuscript-reviewer match scores and to generate a ranked list of recommended reviewers for each manuscript. Our mainline proposed system uses full text versions of the reviewers' papers, which we demonstrate performs significantly better than models developed based on abstracts alone, which has been the predominant paradigm in prior work. Direct retrieval of reviewer's manuscripts from a publisher database reduces reviewer burden, ensures up-to-date data, and eliminates the potential for misuse through data manipulation. We also propose a useful evaluation methodology that addresses hyperparameter selection and enables indirect comparisons with alternative approaches and on prior datasets. Finally, the work also contributes a large scale retrospective reviewer matching dataset and evaluation that we hope will be useful for further research in this field. Our system is quite effective; for the mainline approach, expert judges rated 38% of the recommendations as Very Relevant; 33% as Relevant; 24% as Slightly Relevant; and only 5% as Irrelevant.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据