What question did this study set out to answer?

The aim is to validate an AI-based organ segmentation workflow's accuracy and its influence on fluorescence quantification in preclinical imaging.

May 31, 2026Open Access

Validation of an automated AI-based micro-CT organ segmentation workflow against expert annotations and its impact on fluorescence quantification

Key Points

The aim is to validate an AI-based organ segmentation workflow's accuracy and its influence on fluorescence quantification in preclinical imaging.
Evaluated micro-CT scans of heart, lungs, liver, and kidneys from 27 mice.
Compared segmentations against expert annotations using Sørensen-Dice similarity coefficient.
Assessed impact on fluorescence measurements with iteratively reconstructed CT datasets.
AI segmentations showed organ-dependent agreement with expert assessments, especially lower for heart and lungs.
Segmentation time reduced from approximately 30 minutes to about 5 minutes per mouse, maintaining reproducibility.
Iterative CT reconstruction improved fluorescence measurement agreement with experts.

Abstract

Abstract Objective Automated, artificial intelligence (AI)-based, organ segmentation has the potential to streamline preclinical imaging workflows, but its suitability must be evaluated not only by geometric accuracy, but also by impact on downstream quantitative analyses. We validated a commercially available AI-based organ segmentation workflow for whole-body micro-computed tomography (micro-CT) data regarding segmentation accuracy, reproducibility, processing time, and effects on fluorescence tomography (FLT) quantification. Materials and methods Heart, lungs, liver, and kidneys were segmented in micro-CT scans from 27 mice by three independent experts and by a commercially available automated deep learning tool applied to standard and iteratively reconstructed CT datasets. Segmentation performance was evaluated using the Sørensen-Dice similarity coefficient (DSC) and organ volumes, while downstream effects were assessed using FLT overlay quantification across five fluorescent probes. Results AI-based segmentations showed organ-dependent agreement with experts, with lower DSC scores for heart and lungs reflecting systematic boundary differences. Repeated AI segmentations yielded identical results, demonstrating reproducibility while reducing segmentation time from approximately 30 min per mouse to about 5 min, with AI inference completed within seconds. While segmentation geometry and organ volumes differed from expert annotations in an organ-dependent manner, these differences showed no linear relationship with downstream FLT quantification. Iterative CT reconstruction improved the agreement of fluorescence measurements with expert-derived values. Conclusion Geometric segmentation metrics alone are insufficient to predict downstream fluorescence quantification, underscoring the need for task-based evaluation strategies in preclinical multimodal imaging. AI-based segmentation offers a reproducible and time-efficient alternative to manual annotation, enabling practical, scalable, and generalizable image analysis. Relevance statement Reproducible, AI-based organ segmentation can substantially improve the efficiency and scalability of quantitative preclinical imaging workflows, supporting more robust translation of multimodal imaging biomarkers toward clinical research and future patient-focused applications. Key Points In hybrid micro-CT/FLT imaging, geometric segmentation metrics did not reliably predict downstream fluorescence quantification outcomes. AI segmentation assessment should shift from benchmark accuracy metrics toward task-based validation reflecting real downstream imaging applications across different workflows. A commercially available AI-based organ segmentation tool demonstrates fully reproducible segmentation and substantially reduced analysis time compared with expert annotations. AI applied to iteratively reconstructed CT data improves the agreement of fluorescence quantification with expert annotations compared with standard reconstruction.

Mark Helpful

Bookmark

Relay

View Full Paper