Deep Reinforcement Learning-Driven Adaptive Prompting for Robust Medical LLM Evaluation | Synapse