Reinforcement learning with parameterized action space and sparse reward for UAV navigation | Synapse