Multi-objective reinforcement learning based on nonlinear scalarization and long-short-term optimization | Synapse