A novel method-based reinforcement learning with deep temporal difference network for flexible double shop scheduling problem | Synapse