☆ 4.7 Article

A weighted ML-KNN based on discernibility of attributes to heterogeneous sample pairs

INFORMATION PROCESSING & MANAGEMENT (2022)

Journal

INFORMATION PROCESSING & MANAGEMENT

Volume 59, Issue 5, Pages -

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.ipm.2022.103053

Keywords

Multi-label classification; Rough set; Attribute weight; Boundary region; Heterogeneous sample pair

Funding

National Natural Science Foundation of China [62072294, 61806116, 61972238]
Key R&D Program of Shanxi Province, China [201903D421041]
Natural Science Foundation of Shanxi, China [201801D221175, 201901D211176]
Training Program for Young Scientific Researchers of Higher Education Institutions in Shanxi, China, Industry-University-Research Collaboration Program Between Shanxi University
Cultivate Scientific Research Excellence Programs of Higher Education Institutions in Shanxi [2019SK036]
Postgraduate Education Reform Research Project of Shanxi Province, China [2021YJJG041]
1331 Engineering Project of Shanxi Province, China

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

This paper proposes a hybrid framework combining rough sets with ML-KNN for multi-label learning, aiming to improve classification performance by depicting misclassified samples and evaluating attribute discernibility. Experimental results demonstrate the significant improvement in effectiveness compared to other state-of-the-art multi-label classification methods.

As a well-known multi-label classification method, the performance of ML-KNN may be affected by the uncertainty knowledge from samples. The rough set theory acts as an effective tool for data uncertainty analysis, which can identify the samples easy to cause misclassification in the learning process. In this paper, a hybrid framework by fusing rough sets with ML-KNN for multi-label learning is proposed, whose main idea is to depict easy misclassified samples by rough sets and to measure the discernibility of attributes for such samples. First, a rough set model titled NRFD_RS based on neighborhood relations and fuzzy decisions is proposed for multi-label data to find the heterogeneous sample pairs generated from the boundary regions of each label. Then, the weight of an attribute is defined by evaluating its discernibility to those heterogeneous sample pairs. Finally, a weighted HEOM distance is reconstructed and utilized to ML-KNN. Comprehensive experimental results with fourteen public multi-label data sets, including ten regular-scale and four larger-scale data sets, verify the effectiveness of the proposed framework relative to several state-of-the-art multi-label classification methods.

A weighted ML-KNN based on discernibility of attributes to heterogeneous sample pairs

Journal

INFORMATION PROCESSING & MANAGEMENT

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A weighted ML-KNN based on discernibility of attributes to heterogeneous sample pairs

Journal

INFORMATION PROCESSING & MANAGEMENT

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper