Data-driven Offline Reinforcement Learning for HVAC-systems | Synapse