The Three Regimes of Offline-to-Online Reinforcement Learning | Synapse