scan——Theano中循环的实现

最新推荐文章于 2019-03-05 16:00:24 发布

五道口纳什

最新推荐文章于 2019-03-05 16:00:24 发布

阅读量3.8k

点赞数

分类专栏：深度学习 python

本文链接：https://blog.csdn.net/lanchunhui/article/details/50256423

版权

python 同时被 2 个专栏收录

210 篇文章 26 订阅

订阅专栏

深度学习

61 篇文章 9 订阅

订阅专栏

引子

在开始scan函数的设计之前，我们从一个实例出发，首先来看，一个循环需要必备哪些成分：简单的循环累积相乘，计算 $A^k$ ，使用numpy，代码如下。

def power(A, k):
    result = 1
    for i in range(k):
        result *= A
    return result

计算 $A^k$ 的过程中，我们需要对其中三个主要成分进行操纵，

恒定量A：对应于theano.scan函数中的non_sequences参数
为result设置的初值，由outputs_info指定
result的累积相乘值，自动进行

theano.scan函数接口介绍

results, updates = theano.scan(fn=lambda y, p, x_tm2, x_tm1, A: y+p+x_tm2+x_tm1+A,
                    sequences=[Y, P[::-1]],
                    outputs_info=[dict(initial=X, taps=[-2, -1])],
                    non_sequences=A)

参数 $fn$ 表示每次迭代的操作，简单的通过lambda匿名函数对象来定义，注意lambda函数的参数顺序是有要求的，
- 先是sequences提供的(y, p)，
- 然后是outputs_info的参数(x_tm2, x_tm1)，
- 最后是non_sequences的参数(A)
- 三者默认都是None
sequences表示小迭代的序列，序列的第一维（leading dimension）即是需要迭代的次数，如果不显示指定sequences（也即是默认为None），需要通过n_steps显式指定其迭代次数，Y和P[::-1]的第一维的大小应该相同，如果不同，取两者的较小值，多项式求和的例子中我们会看到这一机制的应用。
outputs_info描述的是需要用到前多少次迭代输出的结果，dict(initial=X, taps=[-2, -1])表示使用前一次和倒数第二次输出的结果。如果当前迭代输出为x(t)，则计算中会使用x(t-1), x(t-2)。
与sequences相对，non_sequences描述了非序列的输入，即A是一个固定的输入，每次迭代相加的A都是固定的。

实例

计算 $A^k$

所以，等价的theano代码如下：

import theano
import theano.tensor as T

k = T.iscalar('k')
A = T.vector('A')
result, updates = theano.scan(fn=lambda prior_result, A: prior_result*A, 
        outputs_info=T.ones_like(A), non_sequences=A, n_steps=k)
        # 这里的对应关系为，outputs_info=T.ones_like(A) -> prior_result，类比 result *= A 
        # 将prior_result初始化为1，每次输出的结果继续传递给outputs_info
        # non_sequence=A -> A 
power = theano.function(inputs=[A, k], outputs=result[-1], updates=updates)
power(range(10), 3)

输出为：

Out[16]: array([   0.,    1.,    8.,   27.,   64.,  125.,  216.,  343.,  512.,  729.])

计算多项式的和

f (x) = a 0 + a 1 x + a 2 x 2 + \dots + a n - 1 x n - 1 + a n x n

$f(x)=a_0+a_1x+a_2x^2+\cdots+a_{n-1}x^{n-1}+a_nx^n$

我们用sequences标识系数部分 $[a_0,a_1,\ldots,a_n]$ ，也包括该系数所在的项对应的幂。
自然使用non_sequences标识自由变量 $x$
多项式求和与前一项的输出无关，

import numpy
import theano
import theano.tensor as T

coefs = T.vector('coefs')
x = T.scalar('x')
max_coef_supported = 10000
components, updates = theano.scan(fn=lambda coef, power, free_var: coef*free_var**power, 
        sequences=[coefs, T.arange(max_coef_supported)],
        outputs_info=None,
        non_sequences=x)
                # sequences=[coefs, T.arange(max_coef_supported)] -> coef, power
                # non_sequences=x -> free_variable
polynomial = components.sum()
calc_poly = theano.function(inputs=[coefs, x], outputs=polynomial)
calc_poly([1, 0, 2], 3)

输出为：

array(19.0)
            # 1*3^0+0*3^1+2*3^2=19

计算序列

x(t)=tanh(x(t−1).dot(W)+y(t).dot(U)+p(T−1).dot(V)) $x(t)=\tanh(x(t-1).dot(W)+y(t).dot(U)+p(T-1).dot(V))$

x = T.vector('x')
W = T.matrix('W')
Y = T.matrix('Y')
U = T.matrix('U')
P = T.matrix('P')
V = T.matrix('V')

results, updates = theanp.scan(fn=lambda y, p, x_tm1: T.tanh(T.dot(x_tm1, W)+T.dot(y, U)+T.dot(p, v)),
            sequences=[Y, P[::-1]],
            outputs_info=[x])
compute_seq = theano.function(inputs=[x, W, Y, U, P, V], outputs=results)

计算矩阵 $X$ 的列的范数

X = T.matrix('X')
results, updates = theano.scan(fn=lambda x: T.sqrt((x**2).sum()), sequences=[X.T])
compute_norm_cols = theano.function(inputs=[X], outputs=[results])

计算矩阵 $X$ 的迹

X = T.matrix('X')
results, updates = theano.scan(fn=lambda i, f, t: T.cast(X[i, j]+t, floatX), 
            sequences=[T.arange(X.shape[0]), T.arange(X.shape[1])],
            outputs_info=np.asarray(0., dtype=floatX))
result = results[-1]        
theano.function(inputs=[X], outputs=[result])

计算序列x(t) = x(t - 2).dot(U) + x(t - 1).dot(V) + tanh(x(t - 1).dot(W) + b)

X = T.matrix('X')
theano.scan(fn=lambda x_tm2, x_tm1: T.dot(x_tm2, U)+T.dot(x_tm1, V)+T.tanh(x_tm1, W)+b,
            outputs_info=[dict(initial=X, taps=[-2, -1])], 
            n_steps=n_sym)

References

[1] Theano2.1.10-基础知识之循环

五道口纳什

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
打赏
2
评论
scan——Theano中循环的实现

在开始scan函数的设计之前，我们从一个实例出发，首先来看，一个循环需要必备哪些成分：简单的循环累积相乘，计算AkA^k，使用numpy，代码如下。def power(A, k): result = 1 for i in range(k): result *= A return result计算AkA^k的过程中，我们需要对其中三个主要成分进行操纵，恒定量A
复制链接

扫一扫