3.8 Proceedings Paper

MULTI-LABEL CLASSIFICATION BASED ON SUBCELLULAR REGION-GUIDED FEATURE DESCRIPTION FOR PROTEIN LOCALISATION

出版社

IEEE
DOI: 10.1109/ISBI48211.2021.9434145

关键词

Protein subcellular localisation; multilabel classification; sorted random projections

向作者/读者索取更多资源

This paper presents a multi-label classification pipeline and a novel feature descriptor for protein subcellular localization. By utilizing a Location-Sorted Random Projections feature descriptor and Multilabel Synthetic Minority Over-sampling Technique, the computational model achieves state-of-the-art performance on a highly unbalanced dataset with long-tail distribution and multi-label images. Additionally, the method shows excellent performance for minority classes.
In this paper, we present a multi-label classification pipeline and a novel feature descriptor for the protein subcellular localisation. The challenge here is the development of a computational model that can classify multi-site proteins on a highly unbalanced dataset with a long-tail distribution and multi-label images. To address this challenge, we design a Location-Sorted Random Projections feature descriptor to represent image intensity and gradient of the protein of interest in reference to the correlated cellular region. Multilabel Synthetic Minority Over-sampling Technique is optimised to generate synthetic features with labels to handle class imbalance. Our method achieves the state-of-the-art performance on a large-scale public dataset and demonstrates excellent performance for the minority classes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据