githubEdit

Resource Scheduler

circle-info

I am actively maintaining this list.

Scheduling for DL Training Workloads

Scheduling for General ML Training Workloads

  • SLAQ: Quality-Driven Scheduling for Distributed Machine Learning (SoCC 2017) [Personal Notes] [Paperarrow-up-right]

    • Princeton

    • Fine-grained job-level scheduler

    • Leverage the iterative nature of general ML training algorithms

Trace Analysis

Survey

Acronyms

  • DL: Deep Learning

  • ML: Machine Learning

Last updated