Toward Optimizing Reinforcement Learning Workload Placement at the Cloud-Edge Continuum in 6G Networks: A Scaled RL Framework | Synapse