4.7 Article

COVID-19 CT image recognition algorithm based on transformer and CNN

Journal

DISPLAYS
Volume 72, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.displa.2022.102150

Keywords

Bi-directional feature fusion; Transformer; CNN; COVID-19

Funding

  1. Shanxi Pro-vincial Key Research and Development Project [201903D121121]

Ask authors/readers for more resources

A parallel bi-branch model based on Transformer module and Convolutional Neural Network module is proposed for extracting features from COVID-19 CT images, achieving significant improvement in classification accuracy. This model provides a new method for the diagnosis of COVID-19 and promotes the development of real-time diagnosis, contributing to saving lives.
Novel corona virus pneumonia (COVID-19) broke out in 2019, which had a great impact on the development of world economy and people's lives. As a new mainstream image processing method, deep learning network has been constructed to extract medical features from chest CT images, and has been used as a new detection method in clinical practice. However, due to the medical characteristics of COVID-19 CT images, the lesions are widely distributed and have many local features. Therefore, it is difficult to diagnose directly by using the existing deep learning model. According to the medical features of CT images in COVID-19, a parallel bi-branch model (Trans-CNN Net) based on Transformer module and Convolutional Neural Network module is proposed by making full use of the local feature extraction capability of Convolutional Neural Network and the global feature extraction advantage of Transformer. According to the principle of cross-fusion, a bi-directional feature fusion structure is designed, in which features extracted from two branches are fused bi-directionally, and the parallel structures of branches are fused by a feature fusion module, forming a model that can extract features of different scales. To verify the effect of network classification, the classification accuracy on COVIDx-CT dataset is 96.7%, which is obviously higher than that of typical CNN network (ResNet-152) (95.2%) and Transformer network (Deit-B) (75.8%). These results demonstrate accuracy is improved. This model also provides a new method for the diagnosis of COVID-19, and through the combination of deep learning and medical imaging, it promotes the development of real-time diagnosis of lung diseases caused by COVID-19 infection, which is helpful for reliable and rapid diagnosis, thus saving precious lives.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available