SGD平行算法 - Downpour SGD （单机python多线程版）

最新推荐文章于 2022-04-03 21:20:27 发布

weixin_39195527

最新推荐文章于 2022-04-03 21:20:27 发布

阅读量1.2k

点赞数 1

分类专栏：最优化问题文章标签： python Downpour 平行运算样例代码机器学习

本文链接：https://blog.csdn.net/weixin_39195527/article/details/76099206

版权

本文介绍了Downpour SGD的核心思想，并提供了单机Python多线程实现的样例代码。Downpour SGD是一种并行优化算法，通过将数据和模型变量分散到多个线程，加快了机器学习中的梯度下降过程。每个线程独立训练，最终在主节点汇总更新模型参数。

摘要由CSDN通过智能技术生成

SGD 被广泛运用到机器学习(machine learning)中最优化等问题中，学术界一直热衷于提升SGD在优化问题中的收敛速率，并行计算也是热点研究的方向(包括Hogwild! [1], Delay-tolerant Algorithm for SGD [2], Elastic Average SGD [3])，本篇实现了现在比较火的Downpour SGD [4]的样例代码 (选择这个的原因引用量最大)。

理论思想

核心思想，将数据随机划分成数个子数据集sub-data, 将模型变量划分数个机器/进程/线程，基于子集合数据更新各个机器/进程/线程内的变量n次，然后到master 节点更新模型变量，各个机器/进程/线程训练独立互不干扰, 而且各个机器/进程/线程内的模型变量在训练中也互不干扰。引用原文原话 [4]:

We divide the training data into a number of subsets and run a copy of the model on
each of these subsets. The models communicate updates through a centralized parameter server,
which keeps the current state of all parameters for the model, sharded across many machines (e.g.,
if we have 10 parameter server shards, each shard is responsible for storing and applying updates
to 1/10th of the model parameters) (Figure 2). This approach is asynchronous in two distinct aspects:
the model replicas run independently of each other, and the parameter server shards also run
independently of one another

代码部分

# -*- encoding: utf-8 -*-
import re
import sys
import numpy as np
import copy
import time
import threading

def timeConsumption(func):

最低0.47元/天解锁文章

weixin_39195527

关注

1
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录