Modernizing GPT-2-Scale Models Under a 10B-Token Training Budget | Synapse