March 6, 2024

Convergence of Deep Learning and Edge Computing using Model Optimization

Key Points

Key points are not available for this paper at this time.

Abstract

Edge systems are undergoing a groundbreaking computing evolution to support artificial intelligence, deep learning, and complex computational algorithms. Using cloud servers to perform deep learning model inference poses challenges such as response delays, increased communication costs, and data privacy concerns. Therefore, significant efforts have been made to push the processing of deep learning models to edge systems, which has led to the creation of edge intelligence as the intersection of learning and edge computing. Learning models, especially deep convolutional neural networks, have made significant achievements in machine vision, which provide high accuracy and predictability by spending computing power and memory. If these models are optimized and deployed on edge systems, there will be a revolution in the applications of edge systems in real time. In this paper, by using optimization techniques such as quantization, weight pruning, and weight clustering, the possibility of deploying a typical convolutional neural network model on edge systems that have limited computing resources and memory is investigated. The results show that by using a collaborative algorithm, despite the slight decrease in the accuracy of the model, it is possible to achieve a small-sized model that can even be deployed on microcontrollers.

Bookmark

Cite This Study

Peyman Babaei (Wed,) studied this question.

synapsesocial.com/papers/68e758b6b6db6435876d03d5 https://doi.org/https://doi.org/10.1109/mvip62238.2024.10491145