# ICML 2023

## Meta Info

Homepage: <https://icml.cc/Conferences/2022>

Paper List: <https://icml.cc/virtual/2023/papers.html?filter=titles>

## Papers

### LLM Inference

* Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time \[[Paper](https://proceedings.mlr.press/v202/liu23am.html)]
* FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU \[[Personal Notes](https://paper.lingyunyang.com/reading-notes/miscellaneous/arxiv/2023/flexgen)] \[[Paper](https://proceedings.mlr.press/v202/sheng23a.html)]
