Rethinking Reasoning Quality in Large Language Models through Enhanced Chain-of-Thought via RL | Synapse