Random environment simulation-based multi-stage reinforcement learning for short-term scheduling of cascade hydropower stations | Synapse