Inicio
Explorar
nav.journalClub
Tendencias
Más
synapse
⌘+K
Idioma
Español
Español
Dynamic proximal policy optimization: Enhancing PPO with adaptive entropy and smooth clipping | Synapse
March 3, 2026
Dynamic proximal policy optimization: Enhancing PPO with adaptive entropy and smooth clipping
SS
Shiyu Sha
YL
Yanhong Liu
BH
Benyan Huo
Zhengzhou University
Puntos clave
Enhanced proximal policy optimization utilizes adaptive entropy and smooth clipping, improving stability.
Adaptive entropy adjustment leads to a significant performance increase in dynamic environments.
The approach applies a modified reinforcement learning framework to optimize policy gradient methods.
Results indicate potential for broader applications in complex decision-making tasks.
Mark Helpful
Me gusta
Save
Guardar
Relay
Compartir
Cite This Study
Copy
Sha et al. (Sat,) studied this question.
synapsesocial.com/papers/69a75ef3c6e9836116a29fa7
https://doi.org/https://doi.org/10.1016/j.neucom.2026.132861
Mark Helpful
Me gusta
Save
Guardar
Relay
Compartir