Efficient large-scale language model training on GPU clusters using megatron-LM | Synapse