SC 2023

Meta Info

Homepage: https://sc23.supercomputing.org/

Paper list: https://dl.acm.org/doi/proceedings/10.1145/3581784

Papers

Distributed Training

  • EasyScale: Elastic Training with Consistent Accuracy and Improved Utilization on GPUs [Paper] [Code]

    • BUAA & Alibaba

  • Hanayo: Harnessing Wave-like Pipeline Parallelism for Enhanced Large Model Training Efficiency [Paper] [Code]

    • NUS

GPU Sharing

  • Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach [Personal Notes] [Paper] [Code]

    • UMacau & SIAT, CAS

    • IADeep — a cluster scheduler to co-locate DL training tasks

Serverless Functions

  • Rethinking Deployment for Serverless Functions: A Performance-first Perspective [Paper] [Code]

    • TJU

    • Chiron

Last updated