Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards | Synapse