Low-Rank Reinforcement Learning With Heterogeneous Human Feedback: From Recommendation to Large Language Models | Synapse