4.6 Article

CTA-UNet: CNN-transformer architecture UNet for dental CBCT images segmentation

Journal

PHYSICS IN MEDICINE AND BIOLOGY
Volume 68, Issue 17, Pages -

Publisher

IOP Publishing Ltd
DOI: 10.1088/1361-6560/acf026

Keywords

deep learning; image segmentation; CNN-transformer architecture; self-supervised learning; masked image modeling

Ask authors/readers for more resources

Due to the limitations in current deep learning models for segmenting dental CBCT images, this study proposed a CNN-Transformer Architecture UNet network that combines CNN and Transformer to effectively extract local features and capture long-range dependencies. Multiple spatial attention modules were included to enhance spatial information extraction. A novel Masked image modeling method was introduced to pre-train the CNN and Transformer modules simultaneously, mitigating limitations caused by insufficient labeled training data. Experimental results demonstrated superior performance in dental CBCT image segmentation, with real-world applicability in orthodontics and dental implants.
In view of the limitations of current deep learning models in segmenting dental cone-beam computed tomography (CBCT) images, specifically dealing with complex root morphological features, fuzzy boundaries between tooth roots and alveolar bone, and the need for costly annotation of dental CBCT images. We collected dental CBCT data from 200 patients and annotated 45 of them for network training, and proposed a CNN-Transformer Architecture UNet network, which combines the advantages of CNN and Transformer. The CNN component effectively extracts local features, while the Transformer captures long-range dependencies. Multiple spatial attention modules were included to enhance the network's ability to extract and represent spatial information. Additionally, we introduced a novel Masked image modeling method to pre-train the CNN and Transformer modules simultaneously, mitigating limitations due to a smaller amount of labeled training data. Experimental results demonstrate that the proposed method achieved superior performance (DSC of 87.12%, IoU of 78.90%, HD95 of 0.525 mm, ASSD of 0.199 mm), and provides a more efficient and effective approach to automatically and accurately segment dental CBCT images, has real-world applicability in orthodontics and dental implants.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available