Evolving LLMs from Next-Token Prediction to Multi-Token Prediction via Self-Distillation | Synapse