4.7 Article

Neural representations of the perception of handwritten digits and visual objects from a convolutional neural network compared to humans

期刊

HUMAN BRAIN MAPPING
卷 44, 期 5, 页码 2018-2038

出版社

WILEY
DOI: 10.1002/hbm.26189

关键词

convolutional neural network; functional magnetic resonance imaging; handwritten digits; representational similarity analysis; visual objects; visual perception

向作者/读者索取更多资源

We studied neural representations for visual perception of handwritten digits and visual objects using fMRI and a CNN. The CNN model's neural representation showed a hierarchical topography mapping similar to the human visual system. Lower convolutional layers of the CNN had greater similarity with early visual areas, while higher convolutional layers were encoded in higher-order visual areas. The neural representations for human visual perception were more widely distributed across the whole brain compared to the CNN model.
We investigated neural representations for visual perception of 10 handwritten digits and six visual objects from a convolutional neural network (CNN) and humans using functional magnetic resonance imaging (fMRI). Once our CNN model was fine-tuned using a pre-trained VGG16 model to recognize the visual stimuli from the digit and object categories, representational similarity analysis (RSA) was conducted using neural activations from fMRI and feature representations from the CNN model across all 16 classes. The encoded neural representation of the CNN model exhibited the hierarchical topography mapping of the human visual system. The feature representations in the lower convolutional (Conv) layers showed greater similarity with the neural representations in the early visual areas and parietal cortices, including the posterior cingulate cortex. The feature representations in the higher Conv layers were encoded in the higher-order visual areas, including the ventral/medial/dorsal stream and middle temporal complex. The neural representations in the classification layers were observed mainly in the ventral stream visual cortex (including the inferior temporal cortex), superior parietal cortex, and prefrontal cortex. There was a surprising similarity between the neural representations from the CNN model and the neural representations for human visual perception in the context of the perception of digits versus objects, particularly in the primary visual and associated areas. This study also illustrates the uniqueness of human visual perception. Unlike the CNN model, the neural representation of digits and objects for humans is more widely distributed across the whole brain, including the frontal and temporal areas.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据