Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective | Synapse