6022 Background: Immunohistochemistry (IHC) marker testing and identification of pathological features are essential tasks in surgical pathology following resection of head and neck cancers. However, IHC testing is time-consuming, and identifying pathologic features typically requires examination of multiple slides. We hypothesized that imaging features from a single H&E-stained whole slide image (WSI) could predict this information using machine learning. Methods: We utilized data from the publicly available HANCOCK dataset for machine learning model development and testing. All patients had undergone surgical resection for locoregional head and neck cancer. Image features (embeddings extracted by TITAN, a pretrained vision-language pathology foundation model) from each primary tumor's H&E-stained WSI served as model inputs. Four machine learning algorithms (XGBoost, support vector machine, multilayer perceptron, and random forest) were trained to predict seven IHC markers (CD3, CD8, CD56, CD68, CD163, MHC-I, PD-L1) and three surgical pathological features (tumor grading, lymphovascular invasion, and perineural invasion). IHC marker positivity was determined using DeepLIIF-derived percent-positive staining, with non-PD-L1 markers binarized by median split and PD-L1 positivity defined using a 10% threshold. Surgical pathological features were binary except for tumor grade, which included four classes. All models underwent nested stratified five-fold cross-validation with hyperparameter tuning. Performance was assessed using AUC and F1 score for binary tasks (IHC markers, lymphovascular invasion, perineural invasion), and balanced accuracy and macro-F1 score and for tumor grading. Results: Between 685 and 699 cases were available for model development depending on the prediction task. Support vector machine (SVM) was the best-performing model for five tasks, followed by XGBoost (XGB) for three tasks and random forest (RF) for two tasks. Performance metrics for the best-performing models are detailed in the table. Conclusions: Our findings demonstrate that IHC markers and surgical pathological features can be predicted with reasonable accuracy from image features extracted from a single H&E-stained WSI. Future work will focus on external validation of these models in independent cohorts and exploration of advanced foundational models to further improve prediction performance. Task Best Model Metrics CD3 XGB AUC: 0.78 ± 0.04 F1: 0.72 ± 0.03 CD8 SVM AUC: 0.73 ± 0.03 F1: 0.67 ± 0.04 CD56 XGB AUC: 0.66 ± 0.03 F1: 0.67 ± 0.02 CD68 SVM AUC: 0.75 ± 0.03 F1: 0.70 ± 0.04 CD163 SVM AUC: 0.71 ± 0.05 F1: 0.67 ± 0.04 MHC1 SVM AUC: 0.68 ± 0.05 F1: 0.66 ± 0.03 PDL1 SVM AUC: 0.73 ± 0.06 Macro F1: 0.54 ± 0.04 Grading RF Balanced Acc: 0.71 ± 0.04 Macro F1: 0.70 ± 0.03 Lymphovascular Invasion RF AUC: 0.82 ± 0.02 Macro F1: 0.70 ± 0.02 Perineural Invasion XGB AUC: 0.78 ± 0.03 Macro F1: 0.67 ± 0.03
Sadek et al. (Wed,) studied this question.