4.5 Article

An NLP-based citation reason analysis using CCRO

Journal

SCIENTOMETRICS
Volume 126, Issue 6, Pages 4769-4791

Publisher

SPRINGER
DOI: 10.1007/s11192-021-03955-6

Keywords

NLP-based citation analysis; Qualitative research evaluation; Text classification; Ontology

Ask authors/readers for more resources

In recent scientific advancements, Artificial Intelligence and Natural Language Processing play a key role in classifying documents and extracting information. This research focuses on understanding the reasons behind citations using an ontology-based approach, with an emphasis on sentiment analysis and collaborative meanings. By annotating citation texts and automatically extracting reasons, the study calculates accuracy in both publicly available and manually curated corpora.
In recent scientific advances, Artificial Intelligence and Natural Language Processing are the major contributors to classifying documents and extracting information. Classifying citations in different classes have gathered a lot of attention due to the large volume of citations available in different digital libraries. Typical citation classification uses sentiment analysis, where various techniques are applied to citations texts to mainly classify them in Positive, Negative and Neutral sentiments. However, there can be innumerable reasons why an author selects another research for citation. Citations' Context and Reasons Ontology-CCRO uses a clear scientific method to articulate eight basic reasons for citing by using an iterative process of sentiment analysis, collaborative meanings, and experts' opinions. Using CCRO, this research paper adopts an ontology-based approach to extract citation's reasons and instantiate ontology classes and properties on two different corpora of citation sentences. One corpus of citation sentences is a publicly available dataset, while the other is our own manually curated. The process uses a two-step approach. The first part is an interface to manually annotate each citation text in the selected corpora on CCRO properties. A team of carefully selected annotators has annotated each citation to achieve a high inter-annotator agreement. The second part focuses on the automatic extraction of these reasons. Using Natural Language Processing, Mapping Graph, and Reporting Verb in a citation sentence, citation's reason is extracted and mapped onto a CCRO property. After comparing both manual and automatic mapping, accuracy is calculated. Based on experiments and results, accuracy is calculated for both publicly available and own corpora of citation sentences.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available