What question did this study set out to answer?

The aim is to develop a collaborative energy management strategy that balances economic efficiency, durability, and performance in fuel cell systems under dynamic driving conditions.

April 7, 2026Open Access

Research on the Collaborative Optimization Method of Power Prediction and DRL Control

Key Points

The aim is to develop a collaborative energy management strategy that balances economic efficiency, durability, and performance in fuel cell systems under dynamic driving conditions.
Developed the LSTM-LSSVM-CC hybrid prediction model for better power demand forecasting.
Introduced Dynamic Spatio-Temporal Fusion to enhance the DRL state space for decision-making.
Created a Dynamic Hierarchical Adaptive Multi-Objective Optimization Framework for adaptive control in varying scenarios.
Established a closed-loop control architecture for consistent feedback and prediction accuracy.
Achieved an 11.3% reduction in hydrogen consumption compared to traditional methods.
Reduced SOC fluctuation range by 30.9%.
Improved power tracking error by 55.3%.
Under disturbance scenarios, limited hydrogen consumption increase to 8.3 g/100 km and decreased power tracking error by 65.6%.

Abstract

This paper proposes a collaborative energy management strategy based on power prediction and deep reinforcement learning (DRL) to address the trade-offs among economic efficiency, durability, and dynamic performance in fuel cell hybrid power systems (FCHPS) under dynamic driving conditions. First, a hybrid prediction model termed LSTM-LSSVM with Cascade Correction (LSTM-LSSVM-CC) is developed. The cascade correction (CC) mechanism adopts a hierarchical structure to capture both low-frequency steady-state trends and high-frequency dynamic fluctuations, which are typically challenging for single models to represent. By integrating an online residual correction mechanism, this model generates accurate future power demand sequences. Second, a Dynamic Spatio-Temporal Fusion (DSTF) method is introduced to construct a high-dimensional DRL state space. This approach integrates predicted data, historical residuals, and real-time system states, enabling the agent to perform anticipatory decision-making. Third, a Dynamic Hierarchical Adaptive Multi-Objective Optimization Framework (DHAMOF) is designed. This framework dynamically adjusts objective weights and constraint boundaries based on real-time operating characteristics, enabling adaptive switching of optimization priorities across diverse scenarios. Furthermore, a closed-loop control architecture comprising “prediction–decision–execution–feedback” is established. By incorporating rolling horizon optimization and a proportional-integral (PI) residual compensation mechanism, the proposed architecture effectively suppresses prediction error accumulation and mitigates communication delays. Simulation results under combined CLTC-P and WLTP driving cycles demonstrate that, compared to conventional fixed-weight strategies, the proposed method achieves an 11.3% reduction in hydrogen consumption, a 30.9% decrease in SOC fluctuation range, and a 55.3% reduction in power tracking error. Moreover, under disturbance scenarios involving prediction errors, sensor noise, and a 200 ms communication delay, the system exhibits superior robustness: the increase in hydrogen consumption is limited to within 8.3 g/100 km, and the power tracking error is reduced by 65.6% relative to uncorrected baselines. This collaborative optimization approach overcomes the limitations of traditional open-loop prediction and fixed-weight control, offering a novel technical pathway for the high-efficiency and stable operation of fuel cell hybrid power systems.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Li et al. (Fri,) studied this question.

synapsesocial.com/papers/69d49f44b33cc4c35a227bc3 https://doi.org/https://doi.org/10.3390/pr14071150

Bookmark

View Full Paper