Efficient training of large neural networks for language modeling | Synapse