Optimal Distributed Training With Co-Adaptive Data Parallelism in Heterogeneous Environments | Synapse