To achieve carbon neutrality goals, large amounts of renewable energy sources (RESs) are being integrated into power systems. In particular, high penetration of distributed photovoltaic (PV) makes distribution networks highly stochastic, calling for dynamic distribution network reconfiguration (DNR). Existing DNR approaches can be broadly categorized into model-driven optimization-based methods and learning-based methods, with deep reinforcement learning (DRL) being a representative paradigm for fast online decision-making. Existing DNR models typically belong to mixed-integer linear programming, which requires solution methods such as deep reinforcement learning (DRL). However, existing methods commonly struggle to account for human factors, i.e., the time-varying preferences of distribution network operators in DRL decisions. To this end, this paper proposes a natural language-driven, human-in-the-loop DNR framework, which combines a DRL base policy for hour-level dynamic reconfiguration with a large language model (LLM)-based instruction supervision layer. Based on this human-in-the-loop framework, commands from operators in natural language are translated into online adjustments of safety-screened DRL switching actions. Therefore, the framework demonstrates the fast, model-free decision capability of DRL while providing an explicit and interpretable interface for incorporating temporary and context-dependent operator requirements without retraining. Case studies on IEEE 16-bus and 33-bus distribution networks show that the proposed framework reduces network losses, improves voltage profiles, and limits switching operations. It also achieves markedly higher compliance with operator instructions than a conventional model-based method and a pure DRL baseline. These results highlight a viable path to embedding natural language guidance into the data-driven operation of active distribution networks.
Zhang et al. (Wed,) studied this question.