ST-MoE: Designing Stable and Transferable Sparse Expert Models | Synapse