木叶吟
木叶吟
Home
Experience
Publications
Posts
CV
Light
Dark
Automatic
2
UniSched: A Unified Scheduler for Deep Learning Training Jobs with Different User Demands
We present UniSched, a unified scheduler to optimize different types of scheduling objectives (e.g., guaranteeing the deadlines of SLO jobs, minimizing the latency of best-effort jobs).
Wei Gao
,
Zhisheng YE
,
Peng Sun
,
Tianwei Zhang
,
Yonggang Wen
Preprint
PDF
Cite
DOI
Deep Learning Workload Scheduling in GPU Datacenters: A Survey
Deep learning (DL) shows its prosperity in a wide variety of fields. The development of a DL model is a time-consuming and …
Zhisheng YE
,
Wei Gao
,
Qinghao Hu
,
Peng Sun
,
Xiaolin Wang
,
Yingwei Luo
,
Tianwei Zhang
,
Yonggang Wen
Preprint
PDF
Cite
Project
DOI
ASTRAEA: A Fair Deep Learning Scheduler for Multi-tenant GPU Clusters
We design a new and practical GPU scheduler, ASTRAEA, to enforce the desired fairness among tenants and jobs for deep learning training clusters.
Zhisheng YE
,
Peng Sun
,
Wei Gao
,
Tianwei Zhang
,
Xiaolin Wang
,
Shengen Yan
,
Yingwei Luo
Preprint
Cite
Code
DOI
Cite
×