Thorough Characterization and Analysis of Large Transformer Model Training At-Scale | Synapse