RBM中的Gibbs,CD-K,PCD三种抽样方式

本文介绍了RBM(受限玻尔兹曼机)中的三种抽样方法:Gibbs采样、 Contrastive Divergence (CD-K) 和 Persistent Contrastive Divergence (PCD)。重点讲述了PCD算法的原理和实现,强调了PCD在参数更新时不重置马尔科夫链,利用上一轮抽样结果作为初始化状态,以提高效率。
摘要由CSDN通过智能技术生成

首先看RBM教程推导:http://blog.csdn.net/itplus/article/details/19207371,推导到下图时,对中括号中的第二项进行计算,是

通过采样的到的,那么采样有三种方法:Gibbs,CD-K,PCD,下面分别讲三种抽样方法。

          

1:Gibbs采样
2:CD-K

前两种可参考张春霞《受限波尔兹曼机简介》和http://blog.csdn.net/mytestmy/article/details/9150213,下面主要讲我理解的PCD算法。

3:PCD

关于PCD的论文《Training Restricted Boltzmann Machines using Approximations to the Likelihood Gradient》中,有下面几段话:

The standard way to get it is by using a Markov Chain, but running a chain for many steps is too time-consuming. However, between parameter updates, the model changes only slightly.We can take advantage of that by initializing a Markov Chain at the state in which it ended for the previous model. This initialization is often fairly close to the model distribution, even though the model has changed a bit in the parameter update. Neal uses this approach with Sigmoid Belief Networks to approximately sample from the posterior distribution over hidden layer states given the visible layer state. For RBMs, the situation is a bit simpler: there is only one distribution from which we need samples, as opposed to one distribution per training data point. Thus, the algorithm can be used to produce gradient estimates online or using mini-batches, using only a few training data points for the positive part of each gradient estimate, and only a few ’fantasy’ points for the negative part. The fantasy points are updated by one full step of the Markov Chain each time a mini-batch is processed.


Of course this still is an approximation, because the model does change slightly with each parameter update. With infinitesimally small learning rate it becomes exact, and in general it seems to work best with small learning rates.
We call this algorithm Persistent Contrastive Divergence (PCD), to emphasize that the Markov Chain is not reset between parameter updates.


评论 4
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值