Enabling Efficient Batch Serving for LMaaS via Generation Length Prediction | Synapse