Homepage: https://sc25.supercomputing.org
Paper list: https://sc25.conference-program.com
LLM Inference
Hetis: Serving LLMs in Heterogeneous GPU Clusters with Fine-grained and Dynamic Parallelism [Paper] [arXiv]
University of Macau & SYSU
LLM: Large Language Model
Last updated 5 months ago