GitHub - rohitshubham/special_assignment_2020: Special Assignment

Dynamic Adaptive DNN Surgery for Inference Acceleration on the Edge

This is the implementation of DNN Surgery Light (DSL) algorithm originally published by Chuang Hu et. al. in IEEE INFOCOM 2019 [1].

Implementation details

It has been implemented in python3 using pytorch DNN library. This implementation uses VGG-16 as the underlying architecture and the function calls have been made using gRPC protocol (as per the original paper). It automatically uses CUDA cores if available on the device.

The implementation essentially converts the original graph to a minimum s-t cut problem. This implementation performs the cut using Boykov Kolmogorov algorithm. The bandwidth has been measure using iperf3 tool.

Architecture

A rough architecture and the overall data-flow has been summarized in the figure below.

Figure 1 : Architecture and data flow of the implementation

Requirements

We require the following dependencies on the edge device:

pytorch
gRPCio and gRPCio-tools
networkx
pyYaml and
matplotlib (optional)

Additionally, for running the server code, we require:

pytorch
gRPCio and gRPCio-tools

How to run

Install the required dependencies on the respective system
Generate the required metadata using generate_metadata.py
Update the configuration.yaml file
For edge device, run partial_inference_edge.py
For cloud device, run grpc_server.py in background

Figure 2: Running the code

Sample configuration file

The configuration.yaml file reads the necessary configuration about the server address and required model config. A sample configuration.yaml file should look like :

model:
    architecture: VGG16
    path: "/home/rohit/vgg16-397923af.pth"
    classes: 1000
server:
    address: "cloud"
    port: "50051"

Results

Figure 3: Representation of VGG-16 graph to the minimum s-t cut problem

Figure 4 : Size of each layer

Figure 5 : Latency of each layer

References

[1] C. Hu, W. Bao, D. Wang and F. Liu, "Dynamic Adaptive DNN Surgery for Inference Acceleration on the Edge," IEEE INFOCOM 2019 - IEEE Conference on Computer Communications, Paris, France, 2019, pp. 1423-1431, doi: 10.1109/INFOCOM.2019.8737614.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
Project Plan		Project Plan
images		images
src		src
.gitignore		.gitignore
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Adaptive DNN Surgery for Inference Acceleration on the Edge

Implementation details

Architecture

Requirements

How to run

Sample configuration file

Results

References

About

Releases

Packages

Languages

rohitshubham/special_assignment_2020

Folders and files

Latest commit

History

Repository files navigation

Dynamic Adaptive DNN Surgery for Inference Acceleration on the Edge

Implementation details

Architecture

Requirements

How to run

Sample configuration file

Results

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages