并行计算课程自学2

白泽白泽呀

已于 2024-01-03 10:22:12 修改

阅读量754

点赞数 12

分类专栏：并行计算学习文章标签：服务器

于 2024-01-02 15:38:39 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_45498320/article/details/135342359

版权

并行计算学习专栏收录该内容

2 篇文章 0 订阅

订阅专栏

Parameter Server's Architecture

Parameter Server's Architecture(李沐大神和Alex提出的)(与mapreduce的主要区别在于本方法为异步mapreduce为同步）

The Parameter Server

The parameter server was proposed by [1] for scalable machine learningCharacters: client-server architecture, message-passing communication,and asynchronous.

(Note that MapReduce is bulk synchronous.)

Ray [2], an open-source software system, supports parameter server.

Reference

1. Li and others: Scaling distributed machine learning with the parameter server. In OSDl, 2014

2. Moritz and others: Ray: A distributed framework for emerging Al applications. In OSDl, 2018

Synchronous algorithm vs Asynchronous algorithm

同步通信效率非常低

异步通信不需要等待其他worker因此高效

Asynchronous Gradient Descent 异步梯度下降

The i-th worker repeats:

1. Pull the up-to-date modelparameters w from the server.

2. Compute gradient $\color{red}~{g}_i$ using its local data and w.

3. Push $\color{red}~{g}_i$ to the server.

The server performs:

1. Receive gradient from aworker.

2. Update the parameters by:

$\color{blue}w\color{black}\gets \color{blue}w\color{black}-\alpha*\color{red}~{g}_i$

Reference

1. Niu and others: Hogwild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS, 2011

Pro and Con of Asynchronous Algorithms

In practice, asynchronous algorithms are faster than the synchronous.

In theory, asynchronous algorithms has slower convergence rate.

Asynchronous algorithms have restrictions, e.g., a worker cannot bemuch slower than the others.(Why?)

白泽白泽呀

关注

12
点赞
踩
18

收藏

觉得还不错? 一键收藏
0
评论
并行计算课程自学2

并行计算中的同步通信以及异步通信
复制链接

扫一扫

专栏目录

白泽白泽呀

CSDN认证博客专家 CSDN认证企业博客

码龄5年

10: 原创

62万+: 周排名

14万+: 总排名

4302: 访问

: 等级

177: 积分

45: 粉丝

68: 获赞

2: 评论

67: 收藏

私信

关注

热门文章

分类专栏

最新评论

attention is all you need 论文解读视频文字版
CSDN-Ada助手: 非常感谢您分享这篇博文，解读论文确实需要一定的技能和耐心，您的分享让我更好地理解了attention机制的应用。除了这些，我认为对于深度学习的初学者来说，了解一些基础的线性代数、概率论和微积分知识也是非常有帮助的。希望您能继续分享更多优质的内容，谢谢！如何写出更高质量的博客，请看该博主的分享：https://blog.csdn.net/lmy_520/article/details/128686434
Binary Neural Networks notes
CSDN-Ada助手: Python入门技能树或许可以帮到你：https://edu.csdn.net/skill/python?utm_source=AI_act_python

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。