What question did this study set out to answer?

The research aims to enhance deep neural network inference efficiency on edge devices while addressing cold-start issues.

May 17, 2026Open Access

EdgeOpt-Sched-CS: Cold-Start-Aware Dynamic Scheduling for Efficient DNN Inference at the Edge

Key Points

The research aims to enhance deep neural network inference efficiency on edge devices while addressing cold-start issues.
Proposes EdgeOpt-Sched-CS to extend dynamic graph scheduling for edge inference.
Utilizes scheduling knowledge from structurally similar source graphs for initialization.
Evaluated across various device-model scenarios with lightweight convolutional neural networks and other models.
Reduced cumulative cold-start latency by 10.6–20.4%.
Shortened time-to-stability by 5.2–21.7%.
Maintained steady-state latency–memory behavior with minimal additional scheduling overhead.

Abstract

Dynamic scheduling can improve the latency and memory efficiency of deep neural network inference on edge devices, but it often introduces cold-start overhead when a newly deployed model requires online profiling and policy adaptation before reaching stable performance. This paper proposes EdgeOpt-Sched-CS, a cold-start-aware extension of dynamic graph scheduling for edge inference. The key idea is to initialize the scheduler of a target computation graph using scheduling knowledge transferred from structurally similar source graphs, instead of starting from a generic policy. EdgeOpt-Sched-CS constructs compact graph signatures, retrieves relevant source schedulers, and performs lightweight cold-start-aware online adaptation during early deployment. We evaluate the framework across representative device–model scenarios involving lightweight convolutional neural networks, transformer models, and quantized language-model workloads. The results show that EdgeOpt-Sched-CS reduces cumulative cold-start latency by 10.6–20.4% and shortens time-to-stability by 5.2–21.7%, while preserving the steady-state latency–memory behavior of the original dynamic scheduler with only small additional scheduling overhead. These findings indicate that scheduler initialization is an important optimization dimension for adaptive edge inference and that prior scheduling knowledge can be effectively reused across related computation graphs.

EdgeOpt-Sched-CS: Cold-Start-Aware Dynamic Scheduling for Efficient DNN Inference at the Edge

Key Points

Abstract

Cite This Study