Key points are not available for this paper at this time.
Planning for both immediate and long-term benefits becomes increasingly important in recommendation. Existing methods apply Reinforcement Learning (RL) to learn planning capacity by maximizing cumulative reward for long-term recommendation. However, the scarcity of recommendation data presents challenges such as instability and susceptibility to overfitting when training RL models from scratch, resulting in sub-optimal performance. In this light, we propose to leverage the remarkable planning capabilities over sparse data of Large Language Models (LLMs) for long-term recommendation. The key to achieving the target lies in formulating a guidance plan following principles of enhancing long-term engagement and grounding the plan to effective and executable actions in a personalized manner. To this end, we propose a Bi-level Learnable LLM Planner framework, which consists of a set of LLM instances and breaks down the learning process into macro-learning and micro-learning to learn macro-level guidance and micro-level personalized recommendation policies, respectively. Extensive experiments validate that the framework facilitates the planning ability of LLMs for long-term recommendation. Our code and data can be found at https://github.com/jizhi-zhang/BiLLP.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wentao Shi
Xiangnan He
Yang Zhang
Building similarity graph...
Analyzing shared references across papers
Loading...
Shi et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68e76fa2b6db6435876e567c — DOI: https://doi.org/10.1145/3626772.3657683
Synapse has enriched 3 closely related papers on similar clinical questions. Consider them for comparative context: