What question did this study set out to answer?

The aim is to tackle the performance issues related to memory access latency in high-performance computing.

March 12, 2026Open Access

CacheAware: Data Locality-Aware Scheduling for Distributed Memory Systems

Key Points

The aim is to tackle the performance issues related to memory access latency in high-performance computing.
Introduces CacheAware, a compiler-runtime framework for scheduling.
Uses compiler analysis to annotate tasks with memory access footprints.
Combines static information with runtime monitoring of cache miss patterns.
Integrates proactive and reactive strategies for task scheduling.
Evaluates performance on scientific benchmarks against existing systems.
Achieved reductions of up to 30% in cache misses.
Demonstrated over 20% improvements in execution time compared to other schedulers.
Confirmed practicality and scalability of CacheAware in enhancing data locality.

Abstract

The widening performance gap between processor speed and memory access latency has made data locality a critical bottleneck in high-performance computing. In Non-Uniform Memory Access (NUMA) and distributed memory systems, remote accesses incur penalties far greater than local operations, degrading the efficiency of scientific and data-intensive workloads. This paper introduces CacheAware, a compiler–runtime framework for data locality-aware scheduling. CacheAware leverages compiler analysis to annotate tasks with memory access footprints and combines this static information with runtime monitoring of cache miss patterns to guide scheduling and dynamic task migration. Unlike existing NUMA balancing or runtime tasking systems, CacheAware integrates both proactive and reactive strategies to minimize cache thrashing and remote memory fetches. Experimental evaluation on scientific benchmarks demonstrates reductions of up to 30% in cache misses and over 20% improvements in execution time compared to Linux AutoNUMA, NUMA-aware schedulers, and task-based runtimes. These results confirm that CacheAware provides a practical and scalable approach for enhancing data locality and accelerating workloads on modern distributed memory systems.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Haifa A. Alanazi

Northern Border University

Abdulaziz G. Alanazi

Northern Border University

Nasser Albalawi

Northern Border University

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

CacheAware: Data Locality-Aware Scheduling for Distributed Memory Systems

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider