What question did this study set out to answer?

The aim is to utilize a large language model to improve the identification and quantification of driver hazardous actions in vehicle crashes.

May 20, 2026Open Access

From narratives to probabilistic quantification: Predicting and interpreting drivers’ hazardous actions in crashes using large language model

Key Points

The aim is to utilize a large language model to improve the identification and quantification of driver hazardous actions in vehicle crashes.
Fine-tuned the Llama 3.2 1B model on five years of two-vehicle crash data from Michigan Traffic Crash Facts.
Benchmarking performance against traditional machine learning models like Random Forest, XGBoost, and CatBoost.
Developed a probabilistic scenario analysis for assessing the impact of distraction and age on hazardous actions.
The fine-tuned LLM achieved an overall accuracy of 80%, outperforming all baseline models.
Driver distraction significantly increased the likelihood of 'General Unsafe Driving'; distraction for both drivers maximized hazardous actions probability.
Introducing a teen driver elevated the probability of 'Speed and Stopping Violations.'

Abstract

Vehicle crashes involve complex interactions between road users, split-second decisions, and challenging environmental conditions. Among these, two-vehicle crashes are the most prevalent. Accounting for approximately 70% of roadway crashes, they present a significant challenge to traffic safety. Identifying Driver Hazardous Action (DHA) is essential for understanding crash causation, yet the reliability of DHA data in large-scale databases is limited by inconsistent and labor-intensive manual coding practices. Here, we present an innovative framework that leverages a fine-tuned large language model (LLM) to automatically infer DHAs from textual crash narratives, thereby improving the validity and providing quantifiable probabilistic insights into DHA classifications. Using five years (2019–2023) of two-vehicle crash data from Michigan Traffic Crash Facts (MTCF), we fine-tuned the Llama 3.2 1B model on detailed crash narratives and benchmarked its performance against conventional machine learning classifiers, including Random Forest, XGBoost, CatBoost, and a neural network. The fine-tuned LLM achieved an overall accuracy of 80%, surpassing all baseline models and demonstrating pronounced improvements in scenarios with imbalanced data. To increase provide quantifiable probabilistic insights, we developed a novel probabilistic scenario analysis approach, analyzing model output shifts across original test sets and three targeted counterfactual scenarios: variations in driver distraction and age. Our analysis revealed that introducing distraction for one driver substantially increased the likelihood of “General Unsafe Driving”; distraction for both drivers maximized the probability of “Both Drivers Took Hazardous Actions”; and assigning a teen driver markedly elevated the probability of “Speed and Stopping Violations.” Together, our framework and analytical methods provide a robust and quantifiable solution for large-scale automated DHA detection, offering new opportunities for traffic safety analysis and intervention.

Read Full Paperexternally

Demander à l'IA

Bookmark

View Full Paper