A new perspective on optimizers: leveraging moreau-yosida approximation in gradient-based learning | Synapse