What question did this study set out to answer?

The aim is to develop a framework for inverter-based Volt-VAR control that addresses coordination and partial observability challenges in active distribution networks.

March 3, 2026Open Access

Attention-Enhanced Multi-Agent Deep Reinforcement Learning for Inverter-Based Volt-VAR Control in Active Distribution Networks

Puntos clave

The aim is to develop a framework for inverter-based Volt-VAR control that addresses coordination and partial observability challenges in active distribution networks.
Developed an attention-enhanced multi-agent deep reinforcement learning architecture.
Formulated the voltage regulation problem as a decentralized partially observable Markov decision process.
Implemented a centralized training and decentralized execution paradigm for agent interaction.
Conducted simulations on the IEEE 33-bus system with multiple PV inverters.
Reduced average voltage deviation from 0.0117 p.u. (droop control) and 0.0112 p.u. (MADDPG) to 0.0074 p.u.
Maintained millisecond-level execution time comparable to existing MADRL baselines.
Demonstrated robust performance scalability with up to 12 agents under increased PV penetration.

Resumen

The increasing penetration of inverter-interfaced photovoltaic (PV) generation in active distribution networks (ADNs) intensifies fast voltage violations and makes real-time Volt-VAR control (VVC) challenging, especially when each inverter has only partial and noisy measurements and communication is limited. Existing local droop-type strategies lack coordination, while fully centralized optimization/learning is often impractical for online deployment. To address these gaps, an attention-enhanced multi-agent deep reinforcement learning (MADRL) framework is developed for inverter-based VVC under the centralized training and decentralized execution (CTDE) paradigm. First, the voltage regulation problem is formulated as a decentralized partially observable Markov decision process (Dec-POMDP) to explicitly account for system stochasticity and temporal variability under partial observability. To solve this complex game, an attention-enhanced MADRL architecture is employed, where an agent-level attention mechanism is integrated into the centralized critic. Unlike traditional methods that treat all neighbor information equally, the proposed mechanism enables each inverter agent to dynamically prioritize and selectively focus on the most influential states from other agents, effectively capturing complex intercorrelations while enhancing training stability and learning efficiency. Operating under the CTDE paradigm, the framework realizes coordinated reactive power support using only local measurements, ensuring high scalability and practical implementability in communication-constrained environments. Simulations on the IEEE 33-bus system with six PV inverters show that the proposed method reduces the average voltage deviation on the test set from 0.0117 p.u. (droop control) and 0.0112 p.u. (MADDPG) to 0.0074 p.u., while maintaining millisecond-level execution time comparable to other MADRL baselines. Scalability tests with up to 12 agents further demonstrate robust performance of the proposed method under higher PV penetration.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Chen et al. (Sun,) studied this question.

synapsesocial.com/papers/69a67eb2f353c071a6f0a1f4 https://doi.org/https://doi.org/10.3390/math14050839

Me gusta

Guardar

Ver artículo completo