Journal
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI)
Volume -, Issue -, Pages 1929-1933Publisher
IEEE
DOI: 10.1109/ISBI48211.2021.9434145
Keywords
Protein subcellular localisation; multilabel classification; sorted random projections
Ask authors/readers for more resources
This paper presents a multi-label classification pipeline and a novel feature descriptor for protein subcellular localization. By utilizing a Location-Sorted Random Projections feature descriptor and Multilabel Synthetic Minority Over-sampling Technique, the computational model achieves state-of-the-art performance on a highly unbalanced dataset with long-tail distribution and multi-label images. Additionally, the method shows excellent performance for minority classes.
In this paper, we present a multi-label classification pipeline and a novel feature descriptor for the protein subcellular localisation. The challenge here is the development of a computational model that can classify multi-site proteins on a highly unbalanced dataset with a long-tail distribution and multi-label images. To address this challenge, we design a Location-Sorted Random Projections feature descriptor to represent image intensity and gradient of the protein of interest in reference to the correlated cellular region. Multilabel Synthetic Minority Over-sampling Technique is optimised to generate synthetic features with labels to handle class imbalance. Our method achieves the state-of-the-art performance on a large-scale public dataset and demonstrates excellent performance for the minority classes.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available