1576 Background: Non-receipt of guideline-concordant treatment (GCT) for pancreatic cancer is associated with worse outcomes and occurs disproportionately in safety-net settings. Timely identification of patients at risk using traditional approaches remains challenging. We developed machine learning (ML) risk prediction models using electronic health record (EHR) data available at presentation to predict non-receipt of GCT and support targeted navigation. Methods: We conducted a retrospective analysis of patients with pancreatic cancer treated at two safety-net hospitals in Alabama from 2018-2022. GCT was defined based on NCCN guidelines. Candidate predictors included demographic, clinical, laboratory, access-to-care and healthcare utilization variables available at presentation. Primary outcome was non-receipt of GCT. Data were split into stratified 80/20 training and test sets given class imbalance (2/3 rd received GCT). Feature engineering was applied to reduce overfitting from high-cardinality variables. Classification and Regression Tree (CART) analysis was used to explore feature interactions and rank predictors. Multiple ML models were trained using PyCaret with default hyperparameters. Model performance was evaluated on the test set, with recall prioritized to reflect a screening use case in which false positives were acceptable. Results: Of 902 patients, 67.5% (n=609) received GCT. CART identified 30 candidate features. Highly ranked features included Area Deprivation Index, age, albumin, time from symptom onset to start of workup, employment status, presence of primary care physician, BMI and ECOG performance status. Performance metrics of the top 5 models are summarized in Table 1. Across models, test-set accuracy ranged from 72.9% to 86.2%, and recall for non-GCT ranged from 35.6% to 61.0%. Light Gradient Boosting model demonstrated the strongest overall screening performance, achieving the highest recall (61.0%), precision (94.7%), F1 score (0.74), and AUC (0.94). Conclusions: ML models using EHR data readily available at presentation can identify patients at high risk for non-receipt of GCT, driven in part by modifiable access-to-care and social vulnerability factors. Integration of these models into EHR workflows could enable automated risk flagging at diagnosis and trigger early referral to navigation and supportive services, with the potential to improve treatment delivery and outcomes. Test-set performance of machine learning models. Model Accuracy AUC Recall Precision F1 score Decision Tree Classifier 0.79 0.77 0.51 0.77 0.61 Logistic Regression 0.73 0.79 0.36 0.66 0.46 Light Gradient Boosting Machine (LGBM) Classifier 0.86 0.94 0.61 0.95 0.74 Extreme Gradient Boosting (XGB) Classifier 0.81 0.87 0.54 0.80 0.65 Ada Boost Classifier 0.77 0.85 0.46 0.73 0.56
Fonseca et al. (Wed,) studied this question.