📜
Awesome Papers
Ctrlk
  • Introduction
  • Paper List
    • Systems for ML
    • ML for Systems
    • Artificial Intelligence (AI)
    • Hardware Virtualization
    • Resource Disaggregation
    • Resource Fragmentation
    • Cloud Computing
    • Remote Direct Memory Access (RDMA)
    • Research Skills
    • Miscellaneous
  • Reading Notes
    • Conference
    • Journal
    • Miscellaneous
      • arXiv
        • 2024
        • 2023
          • HexGen: Generative inference of foundation model over heterogeneous decentralized environment
          • High-throughput generative inference of large language models with a single GPU
        • 2022
        • 2016
      • MSR Technical Report
  • About Myself
    • Academic Profile
    • Personal Blog (in Chinese)
Powered by GitBook
On this page

Was this helpful?

Edit
  1. Reading Notes
  2. Miscellaneous
  3. arXiv

2023

HexGen: Generative inference of foundation model over heterogeneous decentralized environmentHigh-throughput generative inference of large language models with a single GPU

Last updated 2 years ago

Was this helpful?