3.8 Proceedings Paper

A Study of Feature Selection of Magnetogram Complexity Features in an Imbalanced Solar Flare Prediction Data-set

向作者/读者索取更多资源

Solar flares are the conversion of stored magnetic energy into particle acceleration and radiation, with potential significant detrimental effects on earth including damage to technological infrastructure. Recent work has considered methods to predict flare activity from quantitative measures of the solar magnetic field. Feature selection methods provide insight into measures which have the largest discriminatory potential and provide a means to streamline real-time processing of solar data. Since solar flares are rare events, data-sets for such predictive analysis are inherently imbalanced, causing a bias in the classification. Monte Carlo experiments with randomly sub-sampled, balanced data-sets mitigate classifier biases for imbalanced data-sets, but it is unclear how to implement and interpret feature selection in such a framework. We propose a method to determine a feature subset within a sub-sampled classification based on a histogram analysis of selected features. We show that the feature subsets resulting from this analysis yield better classification accuracies across a large imbalanced data-set unseen in the feature selection and classifier training.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据