Mutual Information Tracks Policy Coherence in Reinforcement Learning | Synapse