Recently, recommendation systems have actively integrated Transformers to capture real-time context. However, these systems often suffer from generalization imbalance, where predictions are biased toward popular (head) items due to the sparsity and volatility inherent in session-based data. To address this challenge, this paper proposes MoE-SLMRec, a Mixture-of-Experts (MoE)-based recommendation model that selects expert networks based on session-level contextual information. The proposed model extracts a session latent representation, h, through a session-aware controller and forms balanced predictive characteristics across the entire data distribution via dynamic routing. Experimental results demonstrate that MoE-SLMRec significantly outperforms the baseline SLMRec, improving accuracy by 1.51 percentage points (from 18.76% to 20.27%). Furthermore, the model achieved state-of-the-art performance in Recall@20 (0.8358) and MRR@20 (0.3455), validating simultaneous improvements in both retrieval capability and ranking quality. Notably, the model effectively stabilized the performance for head items while coordinating the generalization trade-off between head and tail segments. By ensuring a favorable capacity–cost trade-off while maintaining robust performance, this study presents a promising alternative under session-based recommendation settings, facilitating scalable deployment in real-time recommendation services.
Kwak et al. (Sat,) studied this question.