☆ 4.5 Article

USING INFORMATION ON CLASS INTERRELATIONS TO IMPROVE CLASSIFICATION OF MULTICLASS IMBALANCED DATA: A NEW RESAMPLING ALGORITHM

INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE (2019)

Journal

INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE

Volume 29, Issue 4, Pages 769-781

Publisher

UNIV ZIELONA GORA PRESS

DOI: 10.2478/amcs-2019-0057

Keywords

imbalanced data; multi-class learning; re-sampling; data difficulty factors; similarity degrees

Funding

Institute of Computing Science of the Poznan University of Technology

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

The relations between multiple unbalanced classes can be handled with a specialized approach which evaluates types of examples' difficulty based on an analysis of the class distribution in the examples' neighborhood. additionally exploiting information about the similarity of neighboring classes. In this paper, we demonstrate that such an approach can be implemented as a data preprocessing technique and that it can improve the performance of various classifiers on multiclass Unbalanced datasets. It has led us to the introduction of a new resampling algorithm, called Similarity Oversampling and Undersampling Preprocessing (SOUP), which resamples examples according to their difficulty. Its experimental evaluation on real and artificial datasets has shown that it is competitive with the most popular decomposition ensembles and better than specialized preprocessing techniques for multi-imbalanced problems.

USING INFORMATION ON CLASS INTERRELATIONS TO IMPROVE CLASSIFICATION OF MULTICLASS IMBALANCED DATA: A NEW RESAMPLING ALGORITHM

Journal

INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE

Publisher

UNIV ZIELONA GORA PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

USING INFORMATION ON CLASS INTERRELATIONS TO IMPROVE CLASSIFICATION OF MULTICLASS IMBALANCED DATA: A NEW RESAMPLING ALGORITHM

Journal

INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE

Publisher

UNIV ZIELONA GORA PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper