分布式深度学习系列

最新推荐文章于 2024-02-08 10:27:50 发布

cliff_zf

最新推荐文章于 2024-02-08 10:27:50 发布

阅读量1.3k

点赞数 1

分类专栏：深度学习分布式训练文章标签：深度学习分布式

本文链接：https://blog.csdn.net/shixiangyun2/article/details/52775135

版权

深度学习同时被 2 个专栏收录

5 篇文章 1 订阅

订阅专栏

分布式训练

1 篇文章 0 订阅

订阅专栏

分布式深度学习系列

探究分布式深度学习系统的研究现状和应用关键点。

1 神经网络的分布式训练介绍

主要分为模型并行和数据并行，也可以综合使用。

建议使用数据并行，具备更丰富的理论研究空间，模型并行显得更苍白。

涉及随机梯度下降（SGD），参数服务器等概念。…

细节部分容后再续。

参考文献

Distributed Deep Learning

[1] Kai Chen and Qiang Huo. Scalable training of deep learning machines by incremental block training with intra-block parallel optimization and blockwise model-update filtering. In 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 5880–5884. IEEE, 2016.

[2] Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Andrew Senior, Paul Tucker, Ke Yang, Quoc V Le, et al. Large scale distributed deep networks. In Advances in Neural Information Processing Systems, pages 1223–1231, 2012.

[3] Suyog Gupta, Wei Zhang, and Josh Milthrope. Model accuracy and runtime tradeoff in distributed deep learning. arXiv preprint arXiv:1509.04210, 2015.

[4] Qirong Ho, James Cipar, Henggang Cui, Seunghak Lee, Jin Kyu Kim, Phillip B. Gibbons, Garth A Gibson, Greg Ganger, and Eric P Xing. More effective distributed ml via a stale synchronous parallel parameter server. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 26, pages 1223–1231. Curran Associates, Inc., 2013.

[5] Forrest N Iandola, Khalid Ashraf, Mattthew W Moskewicz, and Kurt Keutzer. Firecaffe: near-linear acceleration of deep neural network training on compute clusters. arXiv preprint arXiv:1511.00175, 2015.

[6] Augustus Odena. Faster asynchronous sgd. arXiv preprint arXiv:1601.04033, 2016.

[7] Nikko Strom. Scalable distributed dnn training using commodity gpu cloud computing. In Sixteenth Annual Conference of the International Speech Communication Association, 2015. http://nikkostrom.com/publications/interspeech2015/strom_interspeech2015.pdf.

[8] Hang Su and Haoyu Chen. Experiments on parallel training of deep neural network using model averaging. arXiv preprint arXiv:1507.01239, 2015.

[9] Wei Zhang, Suyog Gupta, Xiangru Lian, and Ji Liu. Staleness-aware async-sgd for distributed deep learning. IJCAI, 2016.

cliff_zf

关注

1
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
分布式深度学习系列

分布式深度学习系列探究分布式深度学习系统的研究现状和应用关键点。1 神经网络的分布式训练介绍主要分为模型并行和数据并行，也可以综合使用。建议使用数据并行，具备更丰富的理论研究空间，模型并行显得更苍白。涉及随机梯度下降（SGD），参数服务器等概念。…细节部分容后再续。参考文献Distributed Deep Learning[1] Kai Chen and Qiang Huo. Scalable
复制链接

扫一扫