Optimizing makespan and resource utilization for multi-DNN training in GPU cluster

出版年份 2021 全文链接

标题

作者

关键词

Deep neural network (DNN) training, Ring-Allreduce, Job scheduling, Resource allocation, Linear scaling rule (LSR), GPU cluster

出版物

Volume 125, Issue -, Pages 206-220

出版商

Elsevier BV

发表日期

2021-06-24

DOI

10.1016/j.future.2021.06.021

参考文献

查看 5 条相关文献

联系作者

Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.

Upload Now

The Peeref Institute provides free reviewer training that teaches the core competencies of the academic peer review process.

Get Started