An online hyper‐volume action bounding approach for accelerating the process of deep reinforcement learning from multiple controllers | Synapse