Optimal Policy Learning for Multi-Action Treatment with Risk Preference using Stata | Synapse