Key points are not available for this paper at this time.
We present CURL: Contrastive Unsupervised Representations for Reinforcement. CURL extracts high-level features from raw pixels using contrastive and performs off-policy control on top of the extracted features. CURL prior pixel-based methods, both model-based and model-free, on tasks in the DeepMind Control Suite and Atari Games showing 1. 9x and1. 2x performance gains at the 100K environment and interaction steps benchmarks. On the DeepMind Control Suite, CURL is the first image-based to nearly match the sample-efficiency of methods that use state-based. Our code is open-sourced and available at: //github. com/MishaLaskin/curl.
Srinivas et al. (Wed,) studied this question.