Generalizable Process Reward Models via Formally Verified Training Data | Synapse