☆ 4.4 Article

Exceeding chance level by chance: The caveat of theoretical chance levels in brain signal classification and statistical assessment of decoding accuracy

JOURNAL OF NEUROSCIENCE METHODS (2015)

期刊

JOURNAL OF NEUROSCIENCE METHODS

卷 250, 期 -, 页码 126-136

出版社

ELSEVIER

DOI: 10.1016/j.jneumeth.2015.01.010

关键词

k-Fold cross-validation; Small sample size; Classification; Multi-class decoding; Brain-computer-interfaces (BCIs); Machine learning; Binomial cumulative distribution; Classification significance; Decoding accuracy; MEG; ECoG; Intracranial EEG

类别

Biochemical Research Methods Neurosciences

资金

Ecole Doctorale Inter-Disciplinaire Sciences-Sante (EDISS), Lyon, France
LABEX CORTEX of Universite de Lyon [ANR-11-LABX-0042, ANR-11-IDEX-0007]
Canada Research Chairs program

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Machine learning techniques are increasingly used in neuroscience to classify brain signals. Decoding performance is reflected by how much the classification results depart from the rate achieved by purely random classification. In a 2-class or 4-class classification problem, the chance levels are thus 50% or 25% respectively. However, such thresholds hold for an infinite number of data samples but not for small data sets. While this limitation is widely recognized in the machine learning field, it is unfortunately sometimes still overlooked or ignored in the emerging field of brain signal classification. Incidentally, this field is often faced with the difficulty of low sample size. In this study we demonstrate how applying signal classification to Gaussian random signals can yield decoding accuracies of up to 70% or higher in two-class decoding with small sample sets. Most importantly, we provide a thorough quantification of the severity and the parameters affecting this limitation using simulations in which we manipulate sample size, class number, cross-validation parameters (k-fold, leave-one-out and repetition number) and classifier type (Linear-Discriminant Analysis, Naive Bayesian and Support Vector Machine). In addition to raising a red flag of caution, we illustrate the use of analytical and empirical solutions (binomial formula and permutation tests) that tackle the problem by providing statistical significance levels (p-values) for the decoding accuracy, taking sample size into account. Finally, we illustrate the relevance of our simulations and statistical tests on real brain data by assessing noise-level classifications in Magnetoencephalography (MEG) and intracranial EEG (iEEG) baseline recordings. (C) 2015 Elsevier B.V. All rights reserved.

Exceeding chance level by chance: The caveat of theoretical chance levels in brain signal classification and statistical assessment of decoding accuracy

期刊

JOURNAL OF NEUROSCIENCE METHODS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Exceeding chance level by chance: The caveat of theoretical chance levels in brain signal classification and statistical assessment of decoding accuracy

期刊

JOURNAL OF NEUROSCIENCE METHODS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文