4.5 Article

Self-paced ensemble and big data identification: a classification of substantial imbalance computational analysis

Journal

JOURNAL OF SUPERCOMPUTING
Volume -, Issue -, Pages -

Publisher

SPRINGER
DOI: 10.1007/s11227-023-05828-6

Keywords

Self-paced ensemble; Big data; Classification; Computational; Simulation; Substantial imbalance

Ask authors/readers for more resources

This research paper focuses on the challenges of learning classifiers from large-scale, highly imbalanced datasets. The proposed self-paced ensemble framework addresses the challenges of class overlap and skewed distributions, while maintaining computational efficiency.
This research paper focuses on the challenges associated with learning classifiers from large-scale, highly imbalanced datasets prevalent in many real-world applications. Traditional algorithms learning often need better performance and high computational efficiency when dealing with imbalanced data. Factors such as class imbalance, noise, and class overlap make it demanding to learn effective classifiers. In this study, we propose a novel self-paced ensemble framework for classifying imbalanced data. The framework employs under-sampling to self-harmonize data hardness and build a robust ensemble. Extensive experimental testing demonstrates promising results in handling overlapping classes and skewed distributions while maintaining computational efficiency. The self-paced ensemble method addresses the challenges of high imbalance ratios, class overlap, and noise presence in large-scale imbalanced classification problems. By incorporating the knowledge of these challenges into our learning framework, we establish the concept of classification hardness distribution, and the self-paced ensemble is a revolutionary learning paradigm for massive imbalance categorization, capable of improving the performance of existing learning algorithms on imbalanced data and providing better results for future applications.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available