期刊
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI)
卷 -, 期 -, 页码 1929-1933出版社
IEEE
DOI: 10.1109/ISBI48211.2021.9434145
关键词
Protein subcellular localisation; multilabel classification; sorted random projections
This paper presents a multi-label classification pipeline and a novel feature descriptor for protein subcellular localization. By utilizing a Location-Sorted Random Projections feature descriptor and Multilabel Synthetic Minority Over-sampling Technique, the computational model achieves state-of-the-art performance on a highly unbalanced dataset with long-tail distribution and multi-label images. Additionally, the method shows excellent performance for minority classes.
In this paper, we present a multi-label classification pipeline and a novel feature descriptor for the protein subcellular localisation. The challenge here is the development of a computational model that can classify multi-site proteins on a highly unbalanced dataset with a long-tail distribution and multi-label images. To address this challenge, we design a Location-Sorted Random Projections feature descriptor to represent image intensity and gradient of the protein of interest in reference to the correlated cellular region. Multilabel Synthetic Minority Over-sampling Technique is optimised to generate synthetic features with labels to handle class imbalance. Our method achieves the state-of-the-art performance on a large-scale public dataset and demonstrates excellent performance for the minority classes.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据