☆ 4.6 Article

Multilevel comparison of deep learning models for function quantification in cardiovascular magnetic resonance: On the redundancy of architectural variations

FRONTIERS IN CARDIOVASCULAR MEDICINE (2023)

期刊

FRONTIERS IN CARDIOVASCULAR MEDICINE

卷 10, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA

DOI: 10.3389/fcvm.2023.1118499

关键词

cardiovascular magnetic resonance; MRI; artificial intelligence; deep learning; cardiac image segmentation; cardiac function quantification; quality control

类别

Cardiac & Cardiovascular Systems

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The aim of this study was to analyze and compare the performance of three popular CNN models for cardiac function quantification. The results showed strong correlation between the models and expert segmentations in terms of quantitative clinical parameters. However, all models encountered difficulties and failures in segmenting the basal and apical slices.

BackgroundCardiac function quantification in cardiovascular magnetic resonance requires precise contouring of the heart chambers. This time-consuming task is increasingly being addressed by a plethora of ever more complex deep learning methods. However, only a small fraction of these have made their way from academia into clinical practice. In the quality assessment and control of medical artificial intelligence, the opaque reasoning and associated distinctive errors of neural networks meet an extraordinarily low tolerance for failure. AimThe aim of this study is a multilevel analysis and comparison of the performance of three popular convolutional neural network (CNN) models for cardiac function quantification. MethodsU-Net, FCN, and MultiResUNet were trained for the segmentation of the left and right ventricles on short-axis cine images of 119 patients from clinical routine. The training pipeline and hyperparameters were kept constant to isolate the influence of network architecture. CNN performance was evaluated against expert segmentations for 29 test cases on contour level and in terms of quantitative clinical parameters. Multilevel analysis included breakdown of results by slice position, as well as visualization of segmentation deviations and linkage of volume differences to segmentation metrics via correlation plots for qualitative analysis. ResultsAll models showed strong correlation to the expert with respect to quantitative clinical parameters (r(z)(') = 0.978, 0.977, 0.978 for U-Net, FCN, MultiResUNet respectively). The MultiResUNet significantly underestimated ventricular volumes and left ventricular myocardial mass. Segmentation difficulties and failures clustered in basal and apical slices for all CNNs, with the largest volume differences in the basal slices (mean absolute error per slice: 4.2 +/- 4.5 ml for basal, 0.9 +/- 1.3 ml for midventricular, 0.9 +/- 0.9 ml for apical slices). Results for the right ventricle had higher variance and more outliers compared to the left ventricle. Intraclass correlation for clinical parameters was excellent (>= 0.91) among the CNNs. ConclusionModifications to CNN architecture were not critical to the quality of error for our dataset. Despite good overall agreement with the expert, errors accumulated in basal and apical slices for all models.

Multilevel comparison of deep learning models for function quantification in cardiovascular magnetic resonance: On the redundancy of architectural variations

期刊

FRONTIERS IN CARDIOVASCULAR MEDICINE

出版社

FRONTIERS MEDIA SA

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multilevel comparison of deep learning models for function quantification in cardiovascular magnetic resonance: On the redundancy of architectural variations

期刊

FRONTIERS IN CARDIOVASCULAR MEDICINE

出版社

FRONTIERS MEDIA SA

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文