4.4 Article

A novel focal-loss and class-weight-aware convolutional neural network for the classification of in-text citations

Journal

JOURNAL OF INFORMATION SCIENCE
Volume 49, Issue 1, Pages 79-92

Publisher

SAGE PUBLICATIONS LTD
DOI: 10.1177/0165551521991022

Keywords

Citation classification; class imbalance; convolutional neural network; deep learning; focal loss

Ask authors/readers for more resources

This article presents a deep learning-based citation context classification architecture using a large annotated dataset. The proposed model outperforms existing feature-based citation classification models, achieving better performance in both binary and multi-class citation classification tasks. The use of focal-loss and class-weight functions helps overcome the issue of class imbalance in the dataset.
We argue that citations, as they have different reasons and functions, should not all be treated in the same way. Using the large, annotated dataset of about 10K citation contexts annotated by human experts, extracted from the Association for Computational Linguistics repository, we present a deep learning-based citation context classification architecture. Unlike all existing state-of-the-art feature-based citation classification models, our proposed convolutional neural network (CNN) with fastText-based pre-trained embedding vectors uses only the citation context as its input to outperform them in both binary- (important and non-important) and multi-class (Use, Extends, CompareOrContrast, Motivation, Background, Other) citation classification tasks. Furthermore, we propose using focal-loss and class-weight functions in the CNN model to overcome the inherited class imbalance issues in citation classification datasets. We show that using the focal-loss function with CNN adds a factor of ( 1 - p t ) gamma to the cross-entropy function. Our model improves on the baseline results by achieving an encouraging 90.6 F1 score with 90.7% accuracy and a 72.3 F1 score with a 72.1% accuracy score, respectively, for binary- and multi-class citation classification tasks.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available