☆ 4.6 Article

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images

MEDICAL PHYSICS (2023)

期刊

MEDICAL PHYSICS

卷 -, 期 -, 页码 -

出版社

WILEY

DOI: 10.1002/mp.16750

关键词

3D segmentation; CT; deep convolutional network; multi-organ; transformer network

类别

Radiology, Nuclear Medicine & Medical Imaging

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The purpose of this research was to develop and optimize a new architecture for automatically segmenting the prostate gland and normal organs in medical images. The researchers combined a shifted-window transformer with a convolutional U-Net to create the SwinAttUNet network, which achieved high accuracy in segmenting multi-organ anatomy.

PurposeDeep learning-based networks have become increasingly popular in the field of medical image segmentation. The purpose of this research was to develop and optimize a new architecture for automatic segmentation of the prostate gland and normal organs in the pelvic, thoracic, and upper gastro-intestinal (GI) regions.MethodsWe developed an architecture which combines a shifted-window (Swin) transformer with a convolutional U-Net. The network includes a parallel encoder, a cross-fusion block, and a CNN-based decoder to extract local and global information and merge related features on the same scale. A skip connection is applied between the cross-fusion block and decoder to integrate low-level semantic features. Attention gates (AGs) are integrated within the CNN to suppress features in image background regions. Our network is termed SwinAttUNet. We optimized the architecture for automatic image segmentation. Training datasets consisted of planning-CT datasets from 300 prostate cancer patients from an institutional database and 100 CT datasets from a publicly available dataset (CT-ORG). Images were linearly interpolated and resampled to a spatial resolution of (1.0 x 1.0x 1.5) mm3. A volume patch (192 x 192 x 96) was used for training and inference, and the dataset was split into training (75%), validation (10%), and test (15%) cohorts. Data augmentation transforms were applied consisting of random flip, rotation, and intensity scaling. The loss function comprised Dice and cross-entropy equally weighted and summed. We evaluated Dice coefficients (DSC), 95th percentile Hausdorff Distances (HD95), and Average Surface Distances (ASD) between results of our network and ground truth data.ResultsSwinAttUNet, DSC values were 86.54 & PLUSMN; 1.21, 94.15 & PLUSMN; 1.17, and 87.15 & PLUSMN; 1.68% and HD95 values were 5.06 & PLUSMN; 1.42, 3.16 & PLUSMN; 0.93, and 5.54 & PLUSMN; 1.63 mm for the prostate, bladder, and rectum, respectively. Respective ASD values were 1.45 & PLUSMN; 0.57, 0.82 & PLUSMN; 0.12, and 1.42 & PLUSMN; 0.38 mm. For the lung, liver, kidneys and pelvic bones, respective DSC values were: 97.90 & PLUSMN; 0.80, 96.16 & PLUSMN; 0.76, 93.74 & PLUSMN; 2.25, and 89.31 & PLUSMN; 3.87%. Respective HD95 values were: 5.13 & PLUSMN; 4.11, 2.73 & PLUSMN; 1.19, 2.29 & PLUSMN; 1.47, and 5.31 & PLUSMN; 1.25 mm. Respective ASD values were: 1.88 & PLUSMN; 1.45, 1.78 & PLUSMN; 1.21, 0.71 & PLUSMN; 0.43, and 1.21 & PLUSMN; 1.11 mm. Our network outperformed several existing deep learning approaches using only attention-based convolutional or Transformer-based feature strategies, as detailed in the results section.ConclusionsWe have demonstrated that our new architecture combining Transformer- and convolution-based features is able to better learn the local and global context for automatic segmentation of multi-organ, CT-based anatomy.

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images

期刊

MEDICAL PHYSICS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A new architecture combining convolutional and transformer-based networks for automatic 3D multi-organ segmentation on CT images

期刊

MEDICAL PHYSICS

出版社

WILEY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文