Optimizing Large Language Model Scaling with Micro Batch Pipeline and Inference Parallelism | Synapse