Início
Explorar
nav.journalClub
Tendências
Mais
synapse
⌘+K
Idioma
Português
Português
Dynamic proximal policy optimization: Enhancing PPO with adaptive entropy and smooth clipping | Synapse
March 3, 2026
Dynamic proximal policy optimization: Enhancing PPO with adaptive entropy and smooth clipping
SS
Shiyu Sha
YL
Yanhong Liu
BH
Benyan Huo
Zhengzhou University
Key Points
Enhanced proximal policy optimization utilizes adaptive entropy and smooth clipping, improving stability.
Adaptive entropy adjustment leads to a significant performance increase in dynamic environments.
The approach applies a modified reinforcement learning framework to optimize policy gradient methods.
Results indicate potential for broader applications in complex decision-making tasks.
Mark Helpful
Like
Save
Bookmark
Relay
Share
Mark Helpful
Like
Save
Bookmark
Relay
Share
Cite This Study
Copy
Sha et al. (Sat,) studied this question.
synapsesocial.com/papers/69a75ef3c6e9836116a29fa7
https://doi.org/https://doi.org/10.1016/j.neucom.2026.132861