Efficient Model Compression and Knowledge Distillation on LLama 2: Achieving High Performance with Reduced Computational Cost | Synapse