Key points are not available for this paper at this time.
Abstract Introduction Large Language Models (LLMs) have proven effective at analyzing unstructured and structured data from electronic health records (EHR). Their reliance on static EHR data means they miss the dynamic, high-resolution physiological signals that are the first indicators of patient decline, creating a fundamental data lag as clinical documentation of an event may occur hours after its onset. This represents a critical gap in the ability to develop truly effective triage tools. Furthermore, while LLMs can synthesize multi-source data for acuity assessment, they exhibit known limitations, including hallucinations that constrain autonomous use in safety-critical settings. In contrast, a Vision-Language Model (VLM) with strong image-grounded clinical reasoning can interpret vital sign trend images, surpassing the limitations of current EHR-based models. This approach provides immediate triage evaluations that lead to earlier detection of patient deterioration. Methods This mixed-methods study deployed a VLM (Gemini 2.5) integrated within the eICU dashboard. We used near-real-time two-hour trends in heart rate, blood pressure, respiratory rate, and oxygen saturation from consecutive patient cases during alerts in hospital floor units monitored by eICU that admit acute medical and surgical patients. To increase the diversity of the sample, we sought routine cases from patients without alerts and emergent cases from patients admitted to the medical, surgical, trauma, and mixed ICUs. A structured context prompt was used to generate a standardized summary including clinical interpretation, data quality and artifact assessment, triage recommendation, and the model’s chain-of-thought reasoning. Results We included patients from Mayo Clinic Rochester, MN, using an 800-patient cohort for prompt development and a 100-patient cohort for validation against clinically adjudicated results. In the validation cohort the VLM achieved 80% accuracy (80/100). An analysis of the 20 incorrect predictions revealed a clear pattern of conservative over-triage: 80% (16/20) of errors were clinically acceptable adjacent-class discrepancies (routine↔urgent or urgent↔emergent). Only 4/20 discrepancies represented a substantive misclassification. Most errors therefore reflected conservative prioritization rather than incorrect assessment. The model demonstrated a perfect safety profile, underestimating patient acuity in 0% of cases, and thus avoiding the most clinically dangerous error. These results confirm the feasibility of VLM-based approach and provide a strong foundation for the proposed work. Conclusion This VLM performs cross-modal reasoning by fusing deep visual perception with broad clinical context. The result is a fully contextualized and explainable triage prioritization, which provides direct decision support and integrates into the clinical workflow, allowing for faster and more confident acute care assessments. This abstract is funded by: none
Building similarity graph...
Analyzing shared references across papers
Loading...
I Strechen
P Krishnan
O Kilickaya
American Journal of Respiratory and Critical Care Medicine
Mayo Clinic in Arizona
Building similarity graph...
Analyzing shared references across papers
Loading...
Strechen et al. (Fri,) studied this question.
www.synapsesocial.com/papers/6a0d5013f03e14405aa9b99d — DOI: https://doi.org/10.1093/ajrccm/aamag162.3329
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: