Virtual power plants face dual challenges in coordinating internal distributed resources and engaging in strategic external trading under market uncertainties. This study proposes a two-stage game-theoretic decision framework to address these challenges. In the first stage, distributionally robust optimization aggregates photovoltaic systems, energy storage, and flexible loads to construct an internal cost function that incorporates operational risks. In the second stage, this cost function is embedded within a Stackelberg game to model the VPP’s strategic interaction with the external market, where a deep reinforcement learning algorithm adaptively approximates the equilibrium bidding strategy. Simulation results based on real market data from a Chinese provincial spot market and an actual VPP demonstration project demonstrate the effectiveness of the proposed approach. Over a 28-day test period spanning diverse seasonal conditions, the method increases average daily net profit by 18.7% compared to single-stage optimization and by 12.3% compared to deterministic game models, while improving the 95% Value-at-Risk by 22.5%. The findings confirm that integrating distributionally robust optimization for risk-aware internal aggregation with deep reinforcement learning for adaptive external gaming effectively supports VPPs in achieving risk-aware profit maximization within complex market environments.
Building similarity graph...
Analyzing shared references across papers
Loading...
Qi Yu
Lei Nie
Jinxuan Li
China Mobile (China)
China Southern Power Grid (China)
Guangzhou Automobile Group (China)
Building similarity graph...
Analyzing shared references across papers
Loading...
Yu et al. (Wed,) studied this question.
www.synapsesocial.com/papers/6a192f07fab5b468c4418498 — DOI: https://doi.org/10.1007/s42452-026-08842-4