This paper proposes a safety-constrained Transformer–PPO framework for low-carbon economic dispatch with settable incentive-based demand response (DR) in wind–PV integrated electro–thermal–hydrogen industrial-park energy systems. Hydrogen is modeled as exogenous hydrogen-domain demand and is satisfied through electrolyzer production and hydrogen inventory dynamics. A causal Transformer captures long-horizon multi-energy coupling and intertemporal constraints and is trained with PPO under uncertainty. A dual-layer safety mechanism combines dual-variable (Lagrange multiplier) updates for statistical constraints with an execution-layer quadratic-programming action projection to enforce hard physical constraints, including operating limits, ramping, battery SOC, hydrogen inventory bounds, and energy balance. Baseline–verification–settlement rules and budget-ledger states are embedded to ensure verifiable response quantities and settlement outcomes that are traceable and independently recompilable. Case studies on a real industrial-park scenario in Inner Mongolia show reduced peak-hour maximum grid purchase demand and constraint violations, together with lower total cost, carbon cost, and curtailment penalties versus MILP, PPO-MLP, and Transformer–PPO without safety mechanisms.
Zhengjian et al. (Mon,) studied this question.