4.7 Article

A three-way clustering approach for novelty detection

Journal

INFORMATION SCIENCES
Volume 569, Issue -, Pages 650-668

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2021.05.021

Keywords

-

Funding

  1. NUCES faculty research support grant Pakistan
  2. NSERC discovery grant Canada
  3. NUCES faculty publication support Pakistan

Ask authors/readers for more resources

Novelty detection aims to identify novel instances in test data that differ from normal instances in training data. The key challenge is to effectively classify normal instances and reject classification of novel instances. Three-way decisions are a useful strategy to address this challenge.
Novelty detection aims to identify novel instances in the test data that differ in some respect from the normal instances in the training data. Novel instances may be defined and interpreted in different ways. We consider a specific interpretation where novel instances are instances from unknown classes which are not seen during the training phase. This is also sometimes referred to as open world classification or open set recognition. A key challenge in this scenario is to design approaches that effectively classify normal instances and reject the classification of novel instances. Three-way decisions may be realized as a useful strategy to deal with this challenge. It provides provision for deferring the decisions of classifying objects whenever the available evidence is not enough. The deferred cases may be realized as novel or unknown since their classification results are not known and not available. Three-way clustering is an important three-way decision model which can be used for the classification of objects by considering classes as clusters in the data. In this paper, we introduce a three-way clustering based algorithm called reduction and elevation based three-way clustering for open world classification or RE3OWC. A three-way cluster consists of a pair of core and support sets. The RE3OWC uses the operations of reduction and elevation to define the core and support of a three-way cluster. The two sets lead to the three regions of inside, partial and outside corresponding to a cluster. The three regions provide the realization of three-way decisions and are used to identify instances from the unknown classes. Experimental results on datasets of 20 Newsgroups and Amazon reviews suggest improvements in commonly and widely used F1 measure by up to 2.3% and 6.5%, respectively, in comparisons to some of the best known available approaches of DOC, cbsSVM, openMax and others, for identifying instances from unknown classes. (C) 2021 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available