Mitigating Think-Answer Mismatch in LLM Reasoning Through Noise-Aware Advantage Reweighting | Synapse