Research on Head Deviation Control Method of Finishing Strip Based on Deep Reinforcement Learning | Synapse