What question did this study set out to answer?

To automate the prediction of CVE severity from textual descriptions using machine learning techniques.

March 15, 2026Open Access

Automated CVE severity prediction using deep learning and explainable AI

Key Points

To automate the prediction of CVE severity from textual descriptions using machine learning techniques.
Utilized a generative language model to augment underrepresented classes.
Fine-tuned a DeBERTa-based deep learning model for severity classification.
Employed LIME for model interpretability and to identify influential terms.
Achieved high accuracy in predicting CVE severity levels from text.
Demonstrated strong predictive performance while providing insights into linguistic patterns.

Abstract

Cybersecurity vulnerabilities represent a critical threat to information systems, often leading to data breaches and operational disruptions. Accurate assessment of vulnerability severity is therefore essential for effective risk prioritization. The Common Vulnerabilities and Exposures (CVE) system maintains a catalog of such vulnerabilities, each accompanied by a brief textual description and a severity score, typically assigned using the Common Vulnerability Scoring System (CVSS). However, manually assigning severity scores is time-consuming and resource-intensive. This challenge highlights the need for automated approaches capable of predicting severity directly from textual data. In this study, we explore the automatic prediction of CVE severity levels from textual descriptions using machine learning. To address class imbalance, we leverage GPT-Neo, a generative language model, to synthetically augment underrepresented categories. We then fine-tune a DeBERTa-based deep learning model for classification, achieving high accuracy in predicting severity levels from text alone. To enhance interpretability, we employ Local Interpretable Model-Agnostic Explanations (LIME) to identify key terms and phrases that most strongly influence model decisions. This approach demonstrates strong predictive performance and provides insight into the linguistic patterns associated with vulnerability severity.

Automated CVE severity prediction using deep learning and explainable AI

Key Points

Abstract

Cite This Study