Data Mixing for Large Language Models Pretraining: A Survey and Outlook | Synapse