OSDI 2025
Last updated
Was this helpful?
Last updated
Was this helpful?
Homepage:
14.6% (= 48 / 327)
LLM Training
WLB-LLM: Workload-Balanced 4D Parallelism for Large Language Model Training
UCSD
LLM Inference
Fast and Live Model Auto Scaling with O(1) Host Caching
SJTU IPADS
KPerfIR: Towards a Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
UCSD
CMU
Preemptive Scheduling for Diverse XPUs using Multi-level Hardware Model
SJTU IPADS
Enabling Efficient GPU Communication over Multiple NICs with FuseLink
HKUST iSING Lab
Decouple and Decompose: Scaling Resource Allocation through a Different Lens
Harvard
EMT: An OS Framework for New Memory Translation Architectures
UIUC
Quake: Adaptive Indexing for Vector Search
UW-Madison
Fast and Synchronous Crash Consistency with Metadata Write-Once File System
HIT-SZ
Tigon: A Distributed Database for a CXL Pod
UT-Austin
UC Berkeley
Mirage: A Multi-Level Superoptimizer for Tensor Programs [] []
Picsou: Enabling Efficient Cross-Consensus Communication []