September 3, 2024

Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse

Key Points

Key points are not available for this paper at this time.

Abstract

We develop a new class of model-free deep reinforcement learning algorithms for data-driven, learning-based control.Our Generalized Policy Improvement algorithms combine the policy improvement guarantees of on-policy methods with the efficiency of sample reuse, addressing a trade-off between two important deployment requirements for real-world control: (i) practical performance guarantees and (ii) data efficiency.We demonstrate the benefits of this new class of algorithms through extensive experimental analysis on a broad range of simulated control tasks.

Generalized Policy Improvement Algorithms with Theoretically Supported Sample Reuse

Key Points

Abstract

Cite This Study