# Resource Fragmentation

## GPU Fragmentation

* Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent ([ATC 2023](/reading-notes/conference/atc-2023.md)) \[[Paper](https://www.usenix.org/conference/atc23/presentation/weng)] \[[Code](https://github.com/hkust-adsl/kubernetes-scheduler-simulator)]
  * HKUST & Alibaba
  * Quantify GPU fragmentation in GPU-sharing clusters.
  * Guided scheduling with *fragmentation gradient descent.*
* HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees ([OSDI 2020](/reading-notes/conference/osdi-2020.md)) \[[Personal Notes](/reading-notes/conference/osdi-2020/hived.md)] \[[Paper](https://www.usenix.org/conference/osdi20/presentation/zhao-hanyu)] \[[Code](https://github.com/microsoft/hivedscheduler)]
  * PKU & HKU & MSRA
  * Consider GPU affinity; resource reservation.

## General Fragmentation

* Large-scale cluster management at Google with Borg (EuroSys 2015) \[[Paper](https://dl.acm.org/doi/10.1145/2741948.2741964)]
  * Google
  * Reduce *stranded resources* that cannot be used because another resource on the machine is fully allocated.
* Multi-Resource Packing for Cluster Schedulers (SIGCOMM 2014) \[[Paper](https://dl.acm.org/doi/10.1145/2619239.2626334)]
  * Microsoft
  * Tetris: *Pack* jobs to avoid resource fragmentation and over-allocation.
* Evaluating job packing in warehouse-scale computing ([CLUSTER 2014](https://github.com/mental2008/awesome-papers/blob/develop/reading-notes/conference/CLUSTER-2014/README.md)) \[[Personal Notes](/reading-notes/conference/cluster-2014/evaluating-job-packing.md)] \[[Paper](https://static.googleusercontent.com/media/research.google.com/en/pubs/archive/43103.pdf)]
  * Google
  * **Four** metrics for evaluating the packing efficiency of schedulers: *aggregate utilization*, *hole filling*, *workload inflation*, and *cluster compaction*.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://paper.lingyunyang.com/paper-list/resource-fragmentation.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
