Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis | Synapse