Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation | Synapse