General multi-agent reinforcement learning integrating heuristic-based delay priority strategy for demand and capacity balancing | Synapse