4.7 Article

Bat algorithm for variable selection in multivariate classification modeling using linear discriminant analysis

期刊

MICROCHEMICAL JOURNAL
卷 187, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.microc.2022.108382

关键词

BA-LDA algorithm; Variable selection; Multivariate classification; Linear discriminant analysis

向作者/读者索取更多资源

This paper introduces a bat-inspired algorithm for variable selection in linear discriminant analysis (LDA). The algorithm simulates the echolocation behavior of bats when searching for prey and uses a cost function associated with the average risk of misclassification in LDA. The results show that the algorithm performs similarly to genetic algorithm (GA-LDA) and successive projection algorithm (SPA-LDA) in classifying coffee and vegetable oil samples, and outperforms them in classifying ovarian cancer samples.
Variable selection is an efficient and powerful tool for reducing the dimensionality of multivariate data and multicollinearity, enabling the successful classification of samples by Linear Discriminant Analysis (LDA). This paper describes a bat-inspired algorithm as an alternative to performing variable selection in multivariate classification by LDA. Named BA-LDA, this algorithm simulates the echolocation behavior of bats when moving in search of prey. It was implemented with a cost function associated with the average risk of misclassification in LDA. The performance of BA-LDA was evaluated on mass spectrometry (MS), near-infrared (NIR), and ultraviolet-visible (UV-vis) spectrometric data sets of serum from unaffected and affected women with ovarian cancer, coffee, and vegetable oil samples, respectively. Its performance was compared with the genetic algorithm (GA-LDA) and successive projection algorithm (SPA-LDA). As the main results, BA-LDA presented a classification performance similar to the GALDA and SPA-LDA, classifying all coffee and vegetable oil samples. For the ovarian cancer dataset, BA-LDA (93% accuracy) presented a better classification performance than GA-LDA (88% accuracy) and SPA-LDA (79% accuracy). Regarding the stochastic nature and reproducibility, the BA-LDA algorithm tends to select variables in regions associated with chemical information and with better separation between classes.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据