Evaluating Mathematical Reasoning Beyond Accuracy | Synapse