Optimal Demand Response Using Device-Based Reinforcement Learning | Synapse