What question did this study set out to answer?

The aim is to develop a tool called Bugspyter that detects and repairs bugs in Jupyter Notebooks using large language models.

May 8, 2026Open Access

Bugspyter : detecting code bugs in Jupyter Notebooks using LLMs

Key Points

The aim is to develop a tool called Bugspyter that detects and repairs bugs in Jupyter Notebooks using large language models.
Developed an agent-based model to test LLM performance in detecting bug types and root causes.
Enhanced the model with static analysis results to improve accuracy.
Analyzed bug identification in executed notebooks versus those with static analysis results.
Bugspyter identified buggy notebooks with high accuracy for implementation bug types.
Successfully pinpointed coding errors as root causes of bugs but struggled with other types.
No performance improvement was noted with static analysis results added.

Abstract

Computational notebooks are increasingly used in the fields of data science, computer science, classrooms, the software industry, and various fields. However, users often encounter errors, bugs, and vulnerabilities related to modularized code, unexecuted cells, and outdated library versions. This paper presents a tool, Bugspyter, designed to detect and repair code bugs in Jupyter Notebooks using LLMs. We develop an agent-based model to test the performance of the LLM in identifying the bug types and the root causes of these bugs in the notebooks, along with enhancing the model with static analysis results. Our results show that Bugspyter can identify buggy notebooks and has a high accuracy for identifying implementation bug types in notebooks. Additionally, it can identify coding errors as the root cause of bugs in a notebook but fails to perform well in other root causes of bugs. Furthermore, we see an improvement in the performance of LLMs when identifying bugs in executed notebooks but no change in performance with the inclusion of static analysis results. This study contributes valuable insights into enhancing the reliability of computational notebooks, as it helps to reduce the need for many manual evaluations to fix these issues.

Bookmark

View Full Paper

Cite This Study

Oluwadabira Omotoso (Thu,) studied this question.

synapsesocial.com/papers/69fd7fa1bfa21ec5bbf081b8 https://doi.org/https://doi.org/10.14288/1.0452417

Bookmark

View Full Paper