Heterogeneous reinforcement learning for defending power grids against attacks | Synapse