What does this research mean for the field?

A safety-guided deep reinforcement learning framework improves fuel economy by 8.36% and lithium-ion battery thermal safety by 10.14% in fuel cell vehicles while maintaining a zero unsafe duration ratio. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

This research aims to improve the safety and efficiency of energy management systems in fuel cell vehicles using a novel framework.

March 4, 2026Open Access

Decoupled safety supervision empowering efficient and safe energy management for fuel cell vehicles

Key Points

This research aims to improve the safety and efficiency of energy management systems in fuel cell vehicles using a novel framework.
Developed a safety-guided deep reinforcement learning (DRL) framework
Introduced an independent safety-guided network for enforcing safety constraints
Validated on a fuel cell bus platform
Compared with state-of-the-art baselines
Improved fuel economy by 8.36%
Enhanced lithium-ion battery thermal safety by 10.14%
Achieved zero unsafe duration ratio in real-world scenarios
Reduced violation severity by up to 21.88% under extreme thermal conditions

Abstract

Abstract Simultaneously ensuring operational efficiency and safety of energy systems remains a critical challenge for fuel cell vehicle energy management. Mainstream deep reinforcement learning (DRL) approaches often inadequately address explicit safety constraints, especially concerning lithium-ion battery (LIB) thermal management. This study proposes a safety-guided DRL framework introducing an independent safety-guided network to explicitly and reliably enforce safety constraints. By decoupling safety assurance from objective optimization, our architecture overcomes the mutual interference and reward-tuning difficulties inherent in existing reward-penalty methods. Validated on a fuel cell bus platform, our method outperforms state-of-the-art baselines, improving fuel economy by 8.36% and LIB thermal safety by 10.14% under full-load conditions. Notably, it maintains a zero unsafe duration ratio across real-world scenarios and reduces violation severity by up to 21.88% under extreme thermal conditions. These results demonstrate the proposed method’s robust safety assurance and generalization capability, positioning it as a practical solution for intelligent vehicle energy management.

Bookmark

View Full Paper

Cite This Study

Jia et al. (Mon,) studied this question.

synapsesocial.com/papers/69a7ccd5d48f933b5eed89fe https://doi.org/https://doi.org/10.1038/s44333-026-00087-3

Bookmark

View Full Paper