4.6 Article

Partitioning DNNs for Optimizing Distributed Inference Performance on Cooperative Edge Devices: A Genetic Algorithm Approach

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 20, Pages -

Publisher

MDPI
DOI: 10.3390/app122010619

Keywords

DNN partitioning; deployment optimization; genetic algorithm; distributed DNN inferencing

Funding

  1. Key Project of the National Natural Science Foundation of China [U1908212]

Ask authors/readers for more resources

To fully utilize the potential of edge devices, this paper proposes a layer-based DNN partitioning approach and an improved genetic algorithm to address the problem. Experimental results show that the proposed algorithm can achieve faster running time and better deployment performance.
To fully unleash the potential of edge devices, it is popular to cut a neural network into multiple pieces and distribute them among available edge devices to perform inference cooperatively. Up to now, the problem of partitioning a deep neural network (DNN), which can result in the optimal distributed inferencing performance, has not been adequately addressed. This paper proposes a novel layer-based DNN partitioning approach to obtain an optimal distributed deployment solution. In order to ensure the applicability of the resulted deployment scheme, this work defines the partitioning problem as a constrained optimization problem and puts forward an improved genetic algorithm (GA). Compared with the basic GA, the proposed algorithm can result in a running time approximately one to three times shorter than the basic GA while achieving a better deployment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available