What question did this study set out to answer?

This review aims to assess recent NLP applications for analyzing traffic crash reports to improve safety analysis and outcomes.

May 1, 2026Open Access

The State of Practice in Application of Natural Language Processing in Transportation Safety Analysis

Key Points

This review aims to assess recent NLP applications for analyzing traffic crash reports to improve safety analysis and outcomes.
Systematic literature selection process with predefined inclusion criteria.
Review categorization into crash severity prediction, risk factor identification, and road safety analysis.
Evaluation of various NLP techniques including probabilistic topic models and transformer-based models.
Different NLP techniques offer complementary advantages for analytical goals.
Hybrid methods combining text and structured data enhance prediction accuracy.
Challenges identified include class imbalance and lack of standardization in evaluation methods.

Abstract

This paper provides a systematic review of recent applications of NLP methods for analyzing traffic crash reports, with a focus on estimating crash severity, crash duration, and crash causation. The review covers prior research using probabilistic topic modeling methods such as LDA, STM, and hierarchical Dirichlet processes in addition to research using transformer-based language models, which include encoder-based models like BERT and PubMedBERT as well as decoder-based models like GPT, GPT2, ChatGPT, GPT-3, and LLaMA. The review starts with a systematic literature selection process with predefined inclusion criteria. We categorize the reviewed studies into the following application areas: crash severity prediction, risk factor identification in crashes, and road safety analysis. The results show several complementary advantages of using different NLP techniques to achieve different analytical goals. Topic models allow for interpretable and exploratory pattern discovery, while encoder models are well-suited for structured prediction problems. Decoder models have the additional flexibility to perform zero-shot and few-shot reasoning, which makes them useful for reasoning about under-sampled or under-reported data. Across the literature, hybrid methods that combine text and structured data outperform individual methods in terms of prediction accuracy and broad applicability. Challenges across the literature include class imbalance, lack of standardization in preprocessing and evaluation methods, and the tradeoff between prediction accuracy and interpretability of prediction models. These findings highlight the importance of aligning model selection with data availability and operational constraints, pointing toward future research directions in hybrid modeling frameworks, standardized evaluation protocols, and real-world deployment of NLP-driven traffic safety systems.

Read Full Paperexternally

AI에게 질문

Bookmark

View Full Paper