Homepage: https://conferences.sigcomm.org/sigcomm/2023/arrow-up-right
Janus: A Unified Distributed Training Framework for Sparse Mixture-of-Experts Models [Paperarrow-up-right]
THU & ByteDance
Last updated 1 year ago