githubEdit

SoCC 2024

Meta Info

Homepage: https://acmsocc.org/2024/index.htmlarrow-up-right

Paper list: https://acmsocc.org/2024/schedule.htmlarrow-up-right

Acceptance Rate

30.1% (= 63 / 209)

Papers

Large Language Models (LLMs)

  • LLM inference

    • Queue Management for SLO-Oriented Large Language Model Serving [Paperarrow-up-right]

      • UIUC & IBM Research

  • LLM training

Mixture of Experts (MoEs)

GPU Sharing

  • KACE: Kernel-Aware Colocation for Efficient GPU Spatial Sharing [Paperarrow-up-right]

    • Stony Brook University

Serverless Computing

  • On-demand and Parallel Checkpoint/Restore for GPU Applications [Paperarrow-up-right]

    • SJTU IPADS & Shanghai Artificial Intelligence Research Institute

    • gCROP: GPU Checkpoint/Restore made On-demand and Parallel

Resource Scheduler

  • Scheduler for deep learning training workloads

    • Hops: Fine-grained heterogeneous sensing, efficient and fair Deep Learning cluster scheduling system [Paperarrow-up-right]

      • Anhui University & Institute of Artificial Intelligence, Hefei Comprehensive National Science Center

Distributed Training

  • Generative Adversarial Networks (GANs)

    • ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks [Paperarrow-up-right]

      • NUS

Last updated