Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking | Synapse