☆ 4.4 Article

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging

JOURNAL OF NEUROSCIENCE METHODS (2019)

期刊

JOURNAL OF NEUROSCIENCE METHODS

卷 325, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.jneumeth.2019.108318

关键词

Encoding model; Deep neural network; Transfer learning; Human visual cortex; Functional magnetic resonance imaging

类别

Biochemical Research Methods Neurosciences

资金

National Key Research and Development Plan of China [2017YFB1002502]
National Natural Science Foundation of China [61701089]
Natural Science Foundation of Henan Province of China [162300410333]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Background: Building visual encoding models to accurately predict visual responses is a central challenge for current vision-based brain-machine interface techniques. To achieve high prediction accuracy on neural signals, visual encoding models should include precise visual features and appropriate prediction algorithms. Most existing visual encoding models employ hand-craft visual features (e.g., Gabor wavelets or semantic labels) or data-driven features (e.g., features extracted from deep neural networks (DNN)). They also assume a linear mapping between feature representations to brain activity. However, it remains unknown whether such linear mapping is sufficient for maximizing prediction accuracy. New Method: We construct a new visual encoding framework to predict cortical responses in a benchmark functional magnetic resonance imaging (fMRI) dataset. In this framework, we employ the transfer learning technique to incorporate a pre-trained DNN (i.e., AlexNet) and train a nonlinear mapping from visual features to brain activity. This nonlinear mapping replaces the conventional linear mapping and is supposed to improve prediction accuracy on measured activity in the human visual cortex. Results: The proposed framework can significantly predict responses of over 20% voxels in early visual areas (i.e., V1-lateral occipital region, LO) and achieve unprecedented prediction accuracy. Comparison with Existing Methods: Comparing to two conventional visual encoding models, we find that the proposed encoding model shows consistent higher prediction accuracy in all early visual areas, especially in relatively anterior visual areas (i.e., V4 and LO). Conclusions: Our work proposes a new framework to utilize pre-trained visual features and train non-linear mappings from visual features to brain activity.

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging

期刊

JOURNAL OF NEUROSCIENCE METHODS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A visual encoding model based on deep neural networks and transfer learning for brain activity measured by functional magnetic resonance imaging

期刊

JOURNAL OF NEUROSCIENCE METHODS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文