Serving unseen deep learning model with near-optimal configurations: A fast adaptive search approach

Meta Info

Presented in SoCC 2022.

Authors: Yuewen Wu, Heng Wu, Diaohan Luo, Yuanjia Xu, Yi Hu, Wenbo Zhang, Hua Zhong, Institute of Software, Chinese Academy of Sciences

Code: https://github.com/dos-lab/Falcon

Understanding the paper

TL;DRs

This paper presents Falcon, a novel configuration recommender system that can quickly adapt to unseen DL models.

Existing Problem

There exists a cold start problem when searching a configuration of DL models.

Key Insights

  • Some Key Operators (KOPs) can be used to estimate the performance of DL models.

  • The resource sensitivity can be represented by four typical Key Operator Resource Curves (KOP-RCs), including Slope, Convex, Concave, and Plane.

Solutions

  • A DL model can be characterized by its KOPs and corresponding KOP-RCs.

  • Use a combination of Monte Carlo Tree Search and Bayesian Optimization (MCTS-BO) to search the configurations.

Evaluation

  • Compared to Morphling.

Last updated