4.6 Article

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation

期刊

DIAGNOSTICS
卷 13, 期 11, 页码 -

出版社

MDPI
DOI: 10.3390/diagnostics13111947

关键词

Segment Anything Model (SAM); medical image segmentation; zero-shot segmentation; large AI models; foundation models; deep Learning

向作者/读者索取更多资源

Medical image analysis is crucial in clinical diagnosis. This paper evaluates the Segment Anything Model (SAM) on various medical image segmentation benchmarks and finds that while SAM performs well on general domain images, its zero-shot segmentation ability is limited for medical images. Inconsistent performance is observed across different medical domains, with complete failure in segmentation of certain structured targets. However, fine-tuning SAM with a small amount of data leads to significant improvement, showing the potential of achieving accurate medical image segmentation for precision diagnostics.
Medical image analysis plays an important role in clinical diagnosis. In this paper, we examine the recent Segment Anything Model (SAM) on medical images, and report both quantitative and qualitative zero-shot segmentation results on nine medical image segmentation benchmarks, covering various imaging modalities, such as optical coherence tomography (OCT), magnetic resonance imaging (MRI), and computed tomography (CT), as well as different applications including dermatology, ophthalmology, and radiology. Those benchmarks are representative and commonly used in model development. Our experimental results indicate that while SAM presents remarkable segmentation performance on images from the general domain, its zero-shot segmentation ability remains restricted for out-of-distribution images, e.g., medical images. In addition, SAM exhibits inconsistent zero-shot segmentation performance across different unseen medical domains. For certain structured targets, e.g., blood vessels, the zero-shot segmentation of SAM completely failed. In contrast, a simple fine-tuning of it with a small amount of data could lead to remarkable improvement of the segmentation quality, showing the great potential and feasibility of using fine-tuned SAM to achieve accurate medical image segmentation for a precision diagnostics. Our study indicates the versatility of generalist vision foundation models on medical imaging, and their great potential to achieve desired performance through fine-turning and eventually address the challenges associated with accessing large and diverse medical datasets in support of clinical diagnostics.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据