This article comprehensively analyses hotness-based memory page movement strategies in multi-node heterogeneous memory systems. As modern computing environments increasingly adopt diverse memory technologies such as CXL, PIM, and HBM, efficiently managing memory pages across different tiers with varying latencies and bandwidths becomes crucial for system performance. We examine three primary monitoring techniques: PTE-scan methodology, fault induction monitoring, and PMU sampling approaches, evaluating their effectiveness, scalability, and implementation challenges. The article further explores dedicated hardware-based solutions, mainly focusing on CXL monitoring architectures and their advantages over traditional software approaches. Our analysis reveals that while each methodology offers distinct benefits, the future of memory management lies in hybrid solutions that combine hardware precision with software flexibility. We discuss emerging trends in this field, including integrating machine learning techniques and adaptive algorithms for predictive memory management, providing insights into the evolution of memory tiering solutions. The article suggests that as memory hierarchies become more complex, the development of intelligent, self-optimizing memory management systems will be essential for maintaining optimal system performance while balancing capacity and bandwidth requirements.
Peethambaran et al. (Mon,) studied this question.