This repository contains the companion artifact for the paper "A Factorial Comparison of LLM-Based Multi-Agent Architectures for Automated Python Refactoring", submitted to the International Symposium on Empirical Software Engineering and Measurement (ESEM 2026). It includes the implementation of four LLM-based multi-agent system (MAS) architectures that automate Python code refactoring by varying two design dimensions — communication structure (hierarchical vs. centralized) and coordination strategy (static vs. dynamic) — all orchestrated with LangGraph. The artifact provides the full source code, configuration files, a curated dataset of 2,000 real-world ML project Python files sourced from GitHub, and scripts to reproduce the experimental results for RQ1–RQ3. It enables reproducibility, transparency, and future research on collaborative LLM-driven software maintenance approaches.
Anonymous (Tue,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: