πŸ“œ
Awesome Papers
search
⌘Ctrlk
πŸ“œ
Awesome Papers
  • Introduction
  • Paper List
    • Systems for ML
    • ML for Systems
    • Artificial Intelligence (AI)
    • Hardware Virtualization
    • Resource Disaggregation
    • Resource Fragmentation
    • Cloud Computing
    • Remote Direct Memory Access (RDMA)
    • Research Skills
    • Miscellaneous
  • Reading Notes
    • Conference
    • Journal
    • Miscellaneous
      • arXiv
        • 2024
        • 2023
          • HexGen: Generative inference of foundation model over heterogeneous decentralized environment
          • High-throughput generative inference of large language models with a single GPU
        • 2022
        • 2016
      • MSR Technical Report
  • About Myself
    • Academic Profilearrow-up-right
    • Personal Blog (in Chinese)arrow-up-right
gitbookPowered by GitBook
block-quoteOn this pagechevron-down
githubEdit
  1. Reading Noteschevron-right
  2. Miscellaneouschevron-right
  3. arXiv

2023

HexGen: Generative inference of foundation model over heterogeneous decentralized environmentchevron-rightHigh-throughput generative inference of large language models with a single GPUchevron-right

Last updated 2 years ago