DARLR: Dual-Agent Offline Reinforcement Learning for Recommender Systems with Dynamic Reward | Synapse