What does this research mean for the field?

An ultrasound-based machine learning model can accurately identify microvascular invasion in patients with hepatocellular carcinoma preoperatively, achieving an AUC of 0.812 in external validation. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

The research aims to develop and validate a machine learning model for predicting microvascular invasion (MVI) in hepatocellular carcinoma (HCC).

March 7, 2026Open Access

Identification and validation of an ultrasound-based interpretable machine learning model for the preoperative evaluation of microvascular invasion in patients with hepatocellular carcinoma

Key Points

The research aims to develop and validate a machine learning model for predicting microvascular invasion (MVI) in hepatocellular carcinoma (HCC).
Retrospective multicenter study conducted in China.
Patients with HCC were enrolled and divided into training and validation sets.
LASSO regression was used for feature selection.
Comparison of four machine learning algorithms was performed for MVI prediction.
Model performance assessed using various metrics including AUC and sensitivity.
A total of 496 patients were enrolled, with 229 MVI-positive and 267 MVI-negative cases.
The Gradient Boosting Machine model achieved an AUC of 0.829 in the internal validation set.
In the external validation set, the model demonstrated an AUC of 0.812.
Key predictors included AFP, tumor size, and washout start time.

Abstract

The aim of our study was to develop and validate a machine learning model for the preoperative identification of microvascular invasion (MVI) in patients with hepatocellular carcinoma (HCC). This retrospective multicenter study was conducted in China. Patients with HCC from June 2017 to December 2024 were enrolled. Database was divided into training and internal validation sets randomly. Least absolute shrinkage and selection operator (LASSO) regression was employed for feature selection. Four machine learning algorithms were compared for MVI prediction. Model performance was evaluated using the area under the receiver operating characteristic (AUC), accuracy, sensitivity, specificity, precision, Youden's index, and F1 score. Finally, the machine learning model with the best performance was selected as our final model while using it for an independent external validation set. The SHapley Additive exPlanations (SHAP) diagram was utilized to elucidate the variable importance within the model, culminating in the amalgamation of the above metrics to discern the most succinct features. The study finally enrolled 496 patients, comprising 229 MVI-positive and 267 MVI-negative cases. A total of 42 patients with HCC were collected in the independent external validation center, of which 18 were MVI-positive. LASSO regression showed that AFP, tumor size, peripheral enhancement, mosaic architecture and washout start time were the significant predictors. Among the four models, the Gradient Boosting Machine (GBM) model showed the best performance in the internal validation set, with an AUC of 0.829. In the independent external validation set, the GBM model demonstrated an AUC of 0.812. The machine learning model shows promising efficacy in preoperative MVI identification for HCC patients. This method has potential clinical applications and may help identify MVI preoperatively, potentially improving clinical outcomes.

Bookmark

View Full Paper

Cite This Study

Zhang et al. (Thu,) studied this question.

synapsesocial.com/papers/69abc1a65af8044f7a4ea6e4 https://doi.org/https://doi.org/10.1186/s12885-026-15828-3

Bookmark

View Full Paper