What question did this study set out to answer?

The aim is to enhance CNN inference performance using In-Network Computing within an Information-Centric Networking framework.

February 14, 2026Open Access

Accelerating CNN Inference via In-Network Computing in Information-Centric Networking

Key Points

The aim is to enhance CNN inference performance using In-Network Computing within an Information-Centric Networking framework.
Developed a collaborative inference acceleration mechanism integrating In-Network Computing in an Information-Centric Networking framework.
Leveraged name-based resolution to utilize underused computational resources across distributed nodes.
Created a distributed decision-making algorithm for dynamic CNN layer assignment based on network conditions.
Conducted extensive simulations on representative CNN models, particularly VGG16 under high concurrency.
Achieved a 43.3% reduction in average task completion time compared to IP-based approaches.
Attained a 60.2% reduction in task completion time relative to Edge-Cloud baselines.
Maintained a load balancing fairness index of over 0.86 during simulations.

Abstract

Although Convolutional Neural Networks (CNNs) have achieved remarkable accuracy in intelligent tasks, their increasing complexity hinders low-latency execution. While edge computing mitigates the wide-area network delays typical of cloud-based inference, it remains constrained by limited computational resources when processing complex models under high concurrency. Collaborative inference has emerged as a promising paradigm to address these limitations; however, existing approaches often struggle with rigid routing, limited scalability, and inefficient resource utilization. In this paper, we propose a novel collaborative inference acceleration mechanism that integrates In-Network Computing (INC) within an Information-Centric Networking (ICN) framework. By leveraging the name-based resolution capability of ICN, our approach dynamically harnesses underutilized computational resources across distributed INC nodes, enabling flexible layer-wise offloading that transcends the limitations of static IP paths. Furthermore, a distributed decision-making and node-selection algorithm is designed to orchestrate CNN layer assignment based on real-time network conditions and node workloads. Extensive simulations on representative models demonstrate the effectiveness of the proposed method. Specifically, for the computationally intensive VGG16 model under high concurrency, the average task completion time is reduced by 43.3% and 60.2% relative to IP-based and Edge-Cloud baselines, respectively, with a load balancing fairness index maintained above 0.86.

Bookmark

View Full Paper

Cite This Study

Hu et al. (Wed,) studied this question.

synapsesocial.com/papers/699012032ccff479cfe58b95 https://doi.org/https://doi.org/10.3390/electronics15040775

Bookmark

View Full Paper