What question did this study set out to answer?

May 6, 2026Open Access

Optimization and Benchmarking of Lightweight Neural Networks for Efficient Embedded AI Deployment

Key Points

The aim is to establish an optimization and benchmarking framework for lightweight neural networks in embedded AI.
Developed an optimization framework for neural networks targeting embedded hardware.
Applied model compression techniques like quantization and pruning.
Evaluated performance based on inference latency, model size, and power consumption.
Optimized lightweight models show improved computational efficiency.
Enhanced energy performance on various edge deployment platforms.

Abstract

ABSTRACT The rapid growth of artificial intelligence (AI) has accelerated its integration into embedded systems, enabling real‐time intelligence at the edge with reduced latency, improved privacy, and lower power consumption. This paradigm, known as Embedded AI, deploys machine learning and deep learning models directly on resource‐constrained platforms such as microcontrollers, FPGAs, and system‐on‐chip devices. This study presents a structured optimization and benchmarking framework for lightweight neural networks targeting heterogeneous embedded hardware platforms, including CPU, GPU, TPU, and MCU architectures. Model compression techniques such as quantization, pruning, and mixed‐precision computation are applied to reduce memory footprint and computational complexity while preserving classification accuracy. Performance evaluation is conducted using inference latency, model size, and power consumption as benchmarking metrics under consistent experimental conditions. Results demonstrate that optimized lightweight models significantly improve computational efficiency and energy performance across edge deployment platforms. The proposed framework provides practical guidance for selecting suitable neural network configurations for real‐time embedded AI applications.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Fridous et al. (Fri,) studied this question.

synapsesocial.com/papers/69faa2b504f884e66b5334f8 https://doi.org/https://doi.org/10.1002/eng2.70814

Bookmark

View Full Paper