tf.cond 与 tf.control_dependencies 的控制问题

最新推荐文章于 2022-04-03 19:26:59 发布

Yan_Joy

最新推荐文章于 2022-04-03 19:26:59 发布

阅读量8k

点赞数 1

分类专栏： tensorflow 机器学习文章标签： tensorflow

本文链接：https://blog.csdn.net/yan_joy/article/details/70228401

版权

机器学习同时被 2 个专栏收录

27 篇文章 0 订阅

订阅专栏

tensorflow

14 篇文章 0 订阅

订阅专栏

问题引入

在搜索tf.cond的使用方法时，找到了这样的一个问题：

运行下面的一段tensorflow代码：

pred = tf.constant(True)
x = tf.Variable([1])
assign_x_2 = tf.assign(x, [2])
def update_x_2():
  with tf.control_dependencies([assign_x_2]):
    return tf.identity(x)
y = tf.cond(pred, update_x_2, lambda: tf.identity(x))
with tf.Session() as session:
  session.run(tf.initialize_all_variables())
  print(y.eval())

从代码上看，tf.cond经过判断pred的值对x进行更新。但实际上无论在pred = Ture 还是 False，输出的结果都是2，都是pred = tf.constant(True)的情况。

Confused by the behavior of tf.cond

这是怎么回事呢？

顺序执行

先不进行解释，有人在回复中给出了一个可以正确运行的代码，看一下有什么区别：

pred = tf.placeholder(tf.bool, shape=[])
x = tf.Variable([1])
def update_x_2():
  with tf.control_dependencies([tf.assign(x, [2])]):
    return tf.identity(x)
y = tf.cond(pred, update_x_2, lambda: tf.identity(x))
with tf.Session() as session:
  session.run(tf.initialize_all_variables())
  print(y.eval(feed_dict={pred: False}))  # ==> [1]
  print(y.eval(feed_dict={pred: True}))   # ==> [2]

区别也不大，只是把assign_x_2 = tf.assign(x, [2])这句整体移动到了tf.control_dependencies([tf.assign(x, [2])])的内部。
给出的解释是：

如果要让tf.cond()在其中一个分支中执行命令（如分配），你必须在你要传递给的函数创建执行副命令的操作。
If you want to perform a side effect (like an assignment) in one of the branches, you must create the op that performs the side effect inside the function that you pass to .
因为在TensorFlow图中的执行是依次向前流过图形的，所以在任一分支中引用的所有操作必须在条件进行求值之前执行。这意味着true和false分支都接受对tf.assign() op 的控制依赖。
Because execution in a TensorFlow graph flows forward through the graph, all operations that you refer to in either branch must execute before the conditional is evaluated. This means that both the true and the false branches receive a control dependency on the tf.assign() op.

翻译的可能不够准确，大意就是assign_x_2 = tf.assign(x, [2])这句话在tf.cond已经执行过了，因此无论执行update_x_2（让x=2）或lambda: tf.identity(x)（保持x不变），得到的结果都是x=2。
这么来看其实是一个很简单的问题，定义时不仅定义了模型，也隐含着定义了执行顺序。

tf.control_dependencies()

这个函数加不加看起来没有什么区别，比如：

import tensorflow as tf                                                                                                                                
pred = tf.placeholder(tf.bool, shape=[])
x = tf.Variable([1])
# x_2 = tf.assign(x, [2])
def update_x_2():
     # with tf.control_dependencies([x_2]): #[tf.assign(x, [2])]):
     return tf.assign(x, [2])
y = tf.cond(pred, update_x_2, lambda: tf.identity(x))
with tf.Session() as session:
     session.run(tf.global_variables_initializer())
     print(y.eval(feed_dict={pred: False}))  # ==> [1]
     print(y.eval(feed_dict={pred: True}))   # ==> [2]

去掉之后运行结果和正确的相同。具体作用还是看一下官网吧……
直接搜tf.control_dependencies得到的信息并不多：

Wrapper for Graph.control_dependencies() using the default graph.
See tf.Graph.control_dependencies for more details.

在tf.Graph.control_dependencies这里确实讲得很详细，其作用简单来说就是控制计算顺序。

with g.control_dependencies([a, b, c]):
  # `d` and `e` will only run after `a`, `b`, and `c` have executed.
  d = ...
  e = ...

有了这句话，with中的语句就会在control_dependencies()中的操作执行之后运行，并且也支持嵌套操作。在给出的错误例子中，很像开头提出的问题：

# WRONG
def my_func(pred, tensor):
  t = tf.matmul(tensor, tensor)
  with tf.control_dependencies([pred]):
    # The matmul op is created outside the context, so no control
    # dependency will be added.
    return t

# RIGHT
def my_func(pred, tensor):
  with tf.control_dependencies([pred]):
    # The matmul op is created in the context, so a control dependency
    # will be added.
    return tf.matmul(tensor, tensor)