Deep Learning Recommendation Model (DLRM)
Optimizing Embedding Tables
UGache: A Unified GPU Cache for Embedding-based Deep Learning (SOSP 2023) [Personal Notes] [Paper]
SJTU
A unified multi-GPU cache system.
Used for GNN training and DLR inference.
EVStore: Storage and Caching Capabilities for Scaling Embedding Tables in Deep Recommendation Systems (ASPLOS 2023) [Personal Notes] [Paper] [Code]
UChicago & Beijing University of Technology & Bandung Institute of Technology, Indonesia & Seagate Technology & Emory
A caching layer optimized for embedding access patterns.
DisaggRec: Architecting Disaggregated Systems for Large-Scale Personalized Recommendation (arXiv 2212.00939) [Personal Notes] [Paper]
Meta AI & WashU & UPenn & Cornell & Intel
Disaggregated system; decouple CPUs and memory resources; partition embedding tables.
Faster Model Update
DLRM
Efficient Long Sequential User Data Modeling for Click-Through Rate Prediction (DLP-KDD 2022) [Paper]
Alibaba
ETA: Efficient target attention network
Locality-sensitive hashing
Deployed on Taobao.
Wide & Deep Learning for Recommender Systems (DLRS 2016) [Personal Notes] [Paper]
Google
WDL: Wide & Deep model
Acronyms
DLRM: Deep Learning Recommendation Model
Last updated