What question did this study set out to answer?

This study aims to compare the performance of three AI models for segmenting short-axis cine images in cardiac magnetic resonance imaging.

June 4, 2026Open Access

A comparison of vendor artificial intelligence solutions for automated post-processing of short-axis cine images in cardiovascular magnetic resonance imaging

Q: What is the clinical evidence from this study?

Study design: Observational. Population: Dilated cardiomyopathy, left ventricular hypertrophy, healthy volunteers, and other cardiac diseases (n=346). Intervention: Three AI models (two commercial, one research) for short-axis cine segmentation vs. Expert-derived measurements. Primary outcome: Clinical parameter agreement between AI-derived and expert-derived ventricular volumes and left ventricular mass (r > 0.8).

Key Result

Three AI models for automated CMR segmentation showed strong agreement with expert measurements (r > 0.8), but produced clinically relevant differences across cardiac regions and disease groups.

Key Points

This study aims to compare the performance of three AI models for segmenting short-axis cine images in cardiac magnetic resonance imaging.
Assessed three AI models (two commercial, one research) on 346 cases with various cardiac conditions.
Evaluated agreement of AI-derived and expert-derived ventricular volumes and left ventricular mass using correlations and Dice coefficients.
Characterized slice detection with false positive and negative rates across different models.
AI-derived clinical parameters demonstrated strong agreement with expert measurements (r > 0.8).
Midventricular segmentation yielded high reliability (Dice > 80%), while apical slice detection was inadequate (Dice < 65%).
Variance in slice detection rates led to clinically relevant differences in volume estimates among models, particularly for left ventricular hypertrophy cases.

Study Design

Type

Observational (n=346)

Structured PICO

Population

346 cases including dilated cardiomyopathy, left ventricular hypertrophy, healthy volunteers, and other cardiac diseases.

Exposure

Three AI models (two commercial, one research) for automated post-processing of short-axis cine images in cardiovascular magnetic resonance imaging

Comparator

Expert-derived measurements

Outcome

Clinical parameter agreement between AI-derived and expert-derived ventricular volumes and left ventricular mass (LVM) evaluated using correlations and mean differencessurrogate

While AI solutions for CMR segmentation show high overall agreement with experts, they are not interchangeable and can produce clinically relevant differences depending on the cardiac region and disease.

Main Result

Effect estimate: r > 0.8

Abstract

Abstract Automated segmentation of cardiac magnetic resonance (CMR) imaging is integrated into clinical workflows, yet comparative performance across vendor AI solutions remains insufficiently characterized. This study assessed three models (two commercial, one research) for short-axis cine segmentation in a diverse cohort of 346 cases, including dilated cardiomyopathy (DCM), left ventricular hypertrophy (LVH), healthy volunteers, and other cardiac diseases. Clinical parameter agreement between AI-derived and expert-derived ventricular volumes and left ventricular mass (LVM) was evaluated using correlations and mean differences, segmentation agreement with Dice coefficient, and slice detection was characterized with false positive and negative rates (FPR/FNR). Papillary muscle (PM) inclusion was examined with subgroup analyses. AI-derived clinical parameters agreed strongly with expert measurements (r > 0.8). Nevertheless, inter-model biases included differing ventricular volume estimates. Midventricular segmentation was reliable (Dice > 80%), whereas apical slices were poor (Dice < 65%) with minor area impact (< 1cm 2 ). Basal slice detection varied substantially, with AI1 and AI2 over- and AI3 under-detecting slices (e.g. RV FPR: AI1 24%, AI2 14%, AI3 FNR: 32%), producing large area differences. Due to PM exclusion AI2 overestimated volumes and underestimated LVM – particularly LVH-cases. While AI-expert agreement is high, AI solutions are not interchangeable and produce clinically relevant differences to experts across cardiac regions and disease groups.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Thomas Hadler

Max Delbrück Center

Clemens Ammann

University of Bern

Hadil Saad

Max Delbrück Center

Journals

Scientific Reports

Actions

Institutions

Humboldt-Universität zu Berlin

Max Delbrück Center

Siemens (Germany)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A comparison of vendor artificial intelligence solutions for automated post-processing of short-axis cine images in cardiovascular magnetic resonance imaging

Key Result

Key Points

Study Design

Structured PICO

Main Result

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study