NFDI4DS | UHH-SEMS - Publication Details

DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Edge device Frequency scaling

DOI: 10.48550/arxiv.2306.01811 Publication Date: 2023-01-01

Abstract Supplemental Material References Cited by

AUTHORS (4)

Ziyang Zhang

Yang Zhao

Huan Li

Jie Liu

ABSTRACT

Due to limited resources on edge and different characteristics of deep neural network (DNN) models, it is a big challenge optimize DNN inference performance in terms energy consumption end-to-end latency devices. In addition the dynamic voltage frequency scaling (DVFS) technique, edge-cloud architecture provides collaborative approach for efficient inference. However, current methods have not optimized various compute Thus, we propose DVFO, novel DVFS-enabled framework, which co-optimizes DVFS offloading parameters via reinforcement learning (DRL). Specifically, DVFO automatically 1) CPU, GPU memory frequencies devices, 2) feature maps be offloaded cloud servers. addition, leverages thinking-while-moving concurrent mechanism accelerate DRL process, spatial-channel attention extract secondary importance workload offloading. This improves models under conditions. Extensive evaluations using two datasets six widely-deployed three heterogeneous devices show that significantly reduces by 33% average, compared state-of-the-art schemes. Moreover, achieves up 28.6%-59.1% reduction, while maintaining accuracy within 1% loss average.

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENALEX - Publications OPENAIRE - Products

PlumX Metrics

DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....