Key points are not available for this paper at this time.
Neural architecture search (NAS) for edge devices is often time-consuming because of long-latency deploying and testing on edge devices. The ability to accurately predict the computation cost and memory requirement for convolutional neural networks (CNNs) in advance holds substantial value. Existing work primarily relies on analytical models, which can result in high prediction errors. This article proposes a resource-aware NAS (RaNAS) model based on various features. Additionally, a new graph neural network is introduced to predict inference latency and maximum memory requirements for CNNs on edge devices. Experimental results show that, within the error bound of ±1%, RaNAS achieves an accuracy improvement of approximately 8% for inference latency prediction and about 25% for maximum memory occupancy prediction over the state-of-the-art approaches.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jianhua Gao
Zeming Liu
Yizhuo Wang
ACM Transactions on Architecture and Code Optimization
Beijing Institute of Technology
Beijing Normal University
Building similarity graph...
Analyzing shared references across papers
Loading...
Gao et al. (Tue,) studied this question.
www.synapsesocial.com/papers/69694713099e72f3f5c8fab1 — DOI: https://doi.org/10.1145/3703353