Inicio
Explorar
nav.journalClub
Tendencias
Más
synapse
⌘+K
Idioma
Español
Español
Adaptive discount factor for accelerating policy learning considering long-term returns in reinforcement learning with non-stationary environments | Synapse
March 3, 2026
Adaptive discount factor for accelerating policy learning considering long-term returns in reinforcement learning with non-stationary environments
KO
Kazuki Ogawa
TG
Takeru Goto
Honda (Japan)
TA
Tatsuhito AIHARA
Ver todo
Puntos clave
Adaptive discount factor enhances policy learning in reinforcement learning, especially in changing environments.
A focus on long-term returns shows improved decision-making strategies across various scenarios.
Observational analysis across environments reveals benefits for dynamic reinforcement learning applications.
Highlights the need for continuous adaptation to optimize performance over time in changing circumstances.
Mark Helpful
Me gusta
Save
Guardar
Relay
Compartir
Mark Helpful
Me gusta
Save
Guardar
Relay
Compartir
Cite This Study
Copy
Ogawa et al. (Mon,) studied this question.
synapsesocial.com/papers/69a76603badf0bb9e87db4b1
https://doi.org/https://doi.org/10.1016/j.jocs.2026.102787