4.6 Article

Elastic Net Regularized Softmax Regression Methods for Multi-subtype Classification in Cancer

Journal

CURRENT BIOINFORMATICS
Volume 15, Issue 3, Pages 212-224

Publisher

BENTHAM SCIENCE PUBL LTD
DOI: 10.2174/1574893613666181112141724

Keywords

regularization; softmax regression; elastic net; multiple classification; gene selection; cancer

Funding

  1. Fundamental Research Funds for the Central Universities [2015XKQY21]
  2. National Natural Science Foundation of China [61501466]
  3. Natural Science Foundation of Jiangsu Province [BK20150204]
  4. China Postdoctoral Science Foundation [2015M581884]

Ask authors/readers for more resources

Background: Various regularization methods have been proposed to improve the prediction accuracy in cancer diagnosis. Elastic net regularized logistic regression has been widely adopted for cancer classification and gene selection in genetics and molecular biology but is commonly applied to binary classification and regression. However, usually, the cancer subtypes can be more, and most likely cannot be decided precisely. Objective: Besides the multi-class issue, the feature selection problem is also a critical problem for cancer subtype classification. Methods: An Elastic Net Regularized Softmax Regression (ENRSR) for multi-classification is put forward to tackle the multiple classification issue. As an extension of elastic net regularized logistic regression, ENRSR enforces structure sparsity and 'grouping effect' for gene selection based on gene expression data, which may exhibit high correlation. The sparsity structure and 'grouping effect' help to select more propriate discriminable features for multi-classification. Result: It is demonstrated that ENRSR gains more accurate and robust performance compared to the other 6 competing algorithms (K-means, Hierarchical Clustering, Expectation Maximization, Nonnegative Matrix Factorization, Support Vector Machine and Random Forest) in predicting cancer subtypes both on simulation data and real cancer gene expression data in terms of F measure. Conclusion: Our proposed ENRSR method is a reliable regularized softmax regression for multi-subtype classification.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available