# Neurosurgeon: Collaborative intelligence between the cloud and mobile edge

## Meta Info

Presented in [ASPLOS 2017](https://doi.org/10.1145/3037697.3037698).

Authors: Yiping Kang, Johann Hauswald, Cao Gao, Austin Rovinski, Trevor Mudge, Jason Mars, Lingjia Tang (UMich).

## Understanding the paper

### TL;DRs

This paper presents **Neurosurgeon**, a lightweight scheduler to *automatically partition DNN computation* between mobile devices and data centers *at the granularity of neural network layers*.

It doesn't require per-application profiling.

### Overview

<figure><img src="https://819228986-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-MkzeiawY8SkBarQBDVm-659326392%2Fuploads%2FQ7CYuyFnKptoGlNusG9g%2Fimage.png?alt=media&#x26;token=e41899d2-aae3-43b8-b7fe-8c580c01ec59" alt=""><figcaption><p>System Overview.</p></figcaption></figure>

### Dynamic DNN Partitioning

1. Analysis of the target DNN (use the prediction models to estimate)
2. Partition point selection (for *best end-to-end latency* or *best mobile energy consumption*)

### Related Work

Previous research efforts focus on *offloading computation from the mobile to the cloud*.
