Dynamic and Generalizable Process Reward Modeling | Synapse