Efficient and Stable Offline-to-online Reinforcement Learning via Continual Policy Revitalization | Synapse