Compress or Route? Task-Dependent Strategies for Cost-Efficient Large Language Model Inference | Synapse