Start
Entdecken
nav.journalClub
Trends
Mehr
synapse
⌘+K
Sprache
Deutsch
Deutsch
An offline actor-critic policy improvement algorithm with historical state-action pairs | Synapse
March 3, 2026
An offline actor-critic policy improvement algorithm with historical state-action pairs
HZ
Huaqing Zhang
Ministry of Education
XZ
Xiaofei Zhang
Beijing Institute of Technology
JJ
Jixiang Jiang
Ministry of Education
See all
Key Points
Key points are not available for this paper at this time.
Mark Helpful
Like
Save
Bookmark
Relay
Share
Mark Helpful
Like
Save
Bookmark
Relay
Share
Cite This Study
Copy
Zhang et al. (Thu,) studied this question.
synapsesocial.com/papers/69a759f0c6e9836116a1f571
https://doi.org/https://doi.org/10.1007/s13042-025-02963-9