Abstract Test case generation is a time-consuming and labor-intensive task vital to ensuring software reliability. Automating this process is critical for increasing efficiency and reducing potential human errors in test case generation. This study systematically examined the applications and motivations of Large Language Models (LLMs) in test case generation. The Systematic Literature Review (SLR) method was chosen to identify gaps in the existing literature and comprehensively evaluate the impact of LLMs in this field. 38 peer-reviewed articles published between 2020 and 2025 in databases such as Science Direct, IEEE Xplore, ACM Digital Library, and SpringerLink, addressing the use of LLMs in test case generation, were systematically analyzed. The review evaluated the datasets used, LLM training and test generation techniques, targeted programming languages, preprocessing and postprocessing methods, and integration strategies with existing software workflows. The findings highlight the ability of LLMs to increase the speed and coverage of test case generation in test automation, highlight challenges such as dataset quality and integration complexities, and suggest potential solutions to address these issues. This review provides an important resource for researchers using LLMs in automated test generation, providing insights into their capabilities and encouraging further research in this area.
Building similarity graph...
Analyzing shared references across papers
Loading...
Murat Tasarsu
Ahmet Vedat Tokmak
Cagatay Catal
Cluster Computing
Building similarity graph...
Analyzing shared references across papers
Loading...
Tasarsu et al. (Mon,) studied this question.
www.synapsesocial.com/papers/69ccb68116edfba7beb883a2 — DOI: https://doi.org/10.1007/s10586-026-06021-z
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: