Ctrlk

Introduction
Paper List
Reading Notes
About Myself
- Academic Profile
- Personal Blog (in Chinese)

Powered by GitBook

On this page

Meta Info
Understanding the paper
Opportunities in co-locating DL training tasks
Challenges

Was this helpful?

Reading Notes
Conference
SC 2023

Interference-aware multiplexing for deep learning in GPU clusters: A middleware approach

Meta Info

Presented in SC 2023.

Understanding the paper

Opportunities in co-locating DL training tasks

Tune training configurations (e.g., batch size) across all co-located tasks
Choose appropriate tasks to multiplex on a GPU device

Challenges

Trade-off between mitigating interference and accelerating training progress to achieve optimal training time
Vast search space of task configurations
Coupling between adjusting task configurations and designing task placement policies

Last updated 1 year ago

Was this helpful?