Entropy-driven deep reinforcement learning for HVAC system optimization | Synapse