What question did this study set out to answer?

This research addresses the challenge of managing computation reuse and memory across heterogeneous machine learning backends with different models.

April 25, 2026

Technical Perspective on 'MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems'

Key Points

This research addresses the challenge of managing computation reuse and memory across heterogeneous machine learning backends with different models.
Analyzed the execution models of various ML platforms including local CPUs, GPUs, and Apache Spark.
Evaluated the memory hierarchies and caching mechanisms associated with these platforms.
Proposed a holistic approach to streamline memory and computation management across diverse backends.
Identified gaps in current research addressing multi-backend ML systems.
Demonstrated that current methods for memory management are insufficient for heterogeneous environments.
Outlined key strategies for enhancing management efficiency across different execution contexts.

Abstract

ML deployments in real-world settings are often heterogeneous, spanning local multi-core CPU operations, GPU-accelerated computations, and distributed execution on platforms such as Apache Spark. This heterogeneity arises from practical necessity: a feature engineering step might run locally, a large matrix multiplication offloads to Spark when data exceeds driver memory, and DNN layers execute on GPUs. Or an initial homogeneous setup might evolve over time into a heterogeneous one. This multi-backend reality creates a systems challenge that has received surprisingly little research attention: How to holistically manage computation reuse and memory across backends with fundamentally different execution models, memory hierarchies, and caching primitives?

Bookmark

Technical Perspective on 'MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems'

Key Points

Abstract

Cite This Study