Persistent Latent Identity in Multi-Agent Reinforcement Learning: Bilateral Replay, Sleep Consolidation, and Cross-Environment Generalization | Synapse