What question did this study set out to answer?

May 6, 2026Open Access

Optimization and Benchmarking of Lightweight Neural Networks for Efficient Embedded AI Deployment

Key Points

The aim is to establish an optimization and benchmarking framework for lightweight neural networks in embedded AI.
Developed an optimization framework for neural networks targeting embedded hardware.
Applied model compression techniques like quantization and pruning.
Evaluated performance based on inference latency, model size, and power consumption.
Optimized lightweight models show improved computational efficiency.
Enhanced energy performance on various edge deployment platforms.

Abstract

ABSTRACT The rapid growth of artificial intelligence (AI) has accelerated its integration into embedded systems, enabling real‐time intelligence at the edge with reduced latency, improved privacy, and lower power consumption. This paradigm, known as Embedded AI, deploys machine learning and deep learning models directly on resource‐constrained platforms such as microcontrollers, FPGAs, and system‐on‐chip devices. This study presents a structured optimization and benchmarking framework for lightweight neural networks targeting heterogeneous embedded hardware platforms, including CPU, GPU, TPU, and MCU architectures. Model compression techniques such as quantization, pruning, and mixed‐precision computation are applied to reduce memory footprint and computational complexity while preserving classification accuracy. Performance evaluation is conducted using inference latency, model size, and power consumption as benchmarking metrics under consistent experimental conditions. Results demonstrate that optimized lightweight models significantly improve computational efficiency and energy performance across edge deployment platforms. The proposed framework provides practical guidance for selecting suitable neural network configurations for real‐time embedded AI applications.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Vidapankal Mohammad Fridous

Hindustan Institute of Technology and Science

Abhishek Agarwal

Royal University of Bhutan

K B Bhaskar

Global College

Journals

Engineering Reports

Actions

Institutions

Sri Venkateswara University

Hindustan Institute of Technology and Science

Royal University of Bhutan

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Optimization and Benchmarking of Lightweight Neural Networks for Efficient Embedded AI Deployment

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study