Diffusion Models

Serving Diffusion Models

  • PipeFusion: Displaced Patch Pipeline Parallelism for Inference of Diffusion Transformer Models (arXiv:2405.14430) [arXiv] [Code]

    • Tencent & HKU

  • Cache Me if You Can: Accelerating Diffusion Models through Block Caching (CVPR 2024) [Paper] [Homepage]

    • Meta & TUM & MCML & Oxford

  • CAT-DM: Controllable Accelerated Virtual Try-on with Diffusion Model (CVPR 2024) [Paper] [Code]

    • TJU & Tencent

    • CAT-DM: Controllable Accelerated virtual Try-on with Diffusion Model

  • DeepCache: Accelerating Diffusion Models for Free (CVPR 2024) [Paper] [Code]

    • NUS

  • DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models (CVPR 2024) [Paper]

    • MIT & Princeton & Lepton AI & NVIDIA

  • Approximate Caching for Efficiently Serving Text-to-Image Diffusion Models (NSDI 2024) [Paper] [Slides]

    • Adobe Research & UIUC

Supporting Add-on Modules

  • X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model (CVPR 2024) [Paper] [Homepage] [Code]

    • NUS & Tencent & FDU

Domain-Specific Accelerator (DSA)

  • Cambricon-D: Full-Network Differential Acceleration for Diffusion Models (ISCA 2024)

    • ICT, CAS

Last updated