LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism | Synapse