Tensorflow学习笔记(1) 利用函数修饰符@tf.custom_gradients自定义函数梯度

最新推荐文章于 2023-11-17 18:15:00 发布

寂乐居士

最新推荐文章于 2023-11-17 18:15:00 发布

阅读量4k

点赞数 4

分类专栏： Tensorflow 文章标签： Tensorflow 梯度计算

本文链接：https://blog.csdn.net/qq_39216794/article/details/86183668

版权

本文介绍了Tensorflow中的@tf.custom_gradients修饰符，用于解决因数值不稳定性导致的梯度计算问题。通过示例代码展示如何自定义函数及其梯度，确保计算机可以准确计算导数。最终，通过一个复合函数的例子，验证了自定义梯度的正确性。

摘要由CSDN通过智能技术生成

在tensorflow v1.12中，新定义了一个修饰符函数tf.custom_gradients，用于封装自定义的函数-导数对。

有时候我们想使用tensorflow去计算一些函数的梯度，但会碰到如下情况

def log1pexp(x):
    e = tf.exp(x)
    return tf.log(1+e)

x = tf.constant(100.)
y = log1pexp(x)
dy = tf.gradients(y,x)

with tf.Session() as sess:
    print(sess.run(dy))

运行这段代码，命令行输出为

[nan]

这是因为数据具有不稳定性（numerical instability）。为了使计算机仍然能够输出这个导数，我们需要给出一个计算机能”hold“住的表达式，这里先上代码，再做解释：

@tf.custom_gradient
def log1pexp(x):
    e = tf.exp(x)
    def grad(dy):

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

寂乐居士

关注关注

4
点赞
踩
6

收藏

觉得还不错? 一键收藏
3
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

Tensorflow利用函数修饰符@tf.custom_gradients自定义函数梯度

phymat.nico的专栏

12-02

490

Tensorflow学习笔记(1) 利用函数修饰符@tf.custom_gradients自定义函数梯度_寂乐居士的博客-CSDN博客_tf.custom_gradientpython中的修饰符以及@tf.custom_gradient用法_Huang_Fj的博客-CSDN博客 tf.custom_gradient | TensorFlow Core v2.3.0 ...

@tf.custom_gradient

InFiNiTeemo的博客

05-02

1649

@tf.custom_gradient 初衷网上资料较少，而且官方文档比较ambigious(也许有误)，花了比较久的时间搞懂这个修饰器，记此贴防止大家走弯路。官方文档参考文档介绍 @tf.custom_gradient 装饰器允许控制对梯度的一连串操作，这样做的好处是提供一种更有效率更稳定方式。考虑一种情况由于数值不稳定性，x=100处的梯度(▽f=∂∂xi⃗\bigtria...

3 条评论您还未登录，请先登录后发表或查看评论

装饰器之@tf.custom_gradient

yanghe4405的博客

06-14

246

装饰器（Decorator）是 Python 中的一种高级语法，它可以动态地修改函数或类的功能。在 TensorFlow 中，我们通常使用装饰器来定义自定义梯度函数、自定义损失函数、自定义层等。装饰器用于定义自定义梯度函数，它可以让我们更灵活地定义梯度计算方式，以适应不同的模型和任务需求。：@tf.custom_gradient 是一个装饰器，那么，什么时候要用装饰器？函数中，我们可以在函数调用前后添加一些额外的操作。，它接受一个函数作为输入，并返回一个新的函数。在上面的代码中，我们定义了一个装饰器。

修饰符@tf.custom_gradient用法

pyxiea

07-03

803

修饰器tf.custom_gradient

@tf.custom_gradient 自定义sign的梯度

qq965194745的博客

03-21

2759

https://blog.csdn.net/LoseInVain/article/details/83108001 https://github.com/tensorflow/tensorflow/blob/7dd20b844ced19610f8fa67be61d93948563ac43/tensorflow/python/ops/custom_gradient.py 输入 import tens...

Tensorflow中k.gradients()和tf.stop_gradient()用法说明

09-16

首先，`k.gradients()`是Keras库中的一个函数，它与TensorFlow的`tf.gradients()`功能类似，用于计算损失函数相对于输入变量的梯度。这个函数允许你指定你要计算梯度的输出张量（ys）和输入张量（xs）。在实际应用中...

python中的修饰符以及@tf.custom_gradient用法

Huang_Fj的博客

07-11

5533

1. 修饰符 首先说一下修饰符“@”，其功能是在不改变原有函数内部代码的基础上，拓展原函数的功能。由于python里一切皆是对象，所以一个函数的形式参数可以是另一个函数，同时也可以返回一个函数。如这篇文章中的例子，我们定义了一个函数get_text，再将其作为p_decorate的参数输入，最后返回函数func_wrapper。 def get_text(name): return "...

Custom Gradients in TensorFlow

机器学习的小学生

10-23

672

Custom Gradients in TensorFlowTensorFlow defines deep learning models as computational graphs, where nodes are called ops, short for operations, and the data that flows between these ops are called ten

tensorflow自定义梯度

iTom's blog

01-16

789

Notes 要实现 [1] 的 piece-wise threshold function，类似于 Htanh，也需要自定义梯度，用到 @tf.custom_gradient。函数是：g(s)={0,s<0.5−ϵs,0.5−ϵ≤s<0.5+ϵ1,s≥0.5+ϵg(s)=\begin{cases} 0, & s < 0.5-\epsilon \\ s, & 0....

Implement the gradient of customized Tensorflow Op in python.

weixin_40061670的博客

01-29

176

This is a record for implementing the gradient of customized Tensorflow ops in python. That is to say, you have owned debugged Tensorflow ops before you start to write this code. To make automatic ...

180412 tensorflow自定义反向传播中的梯度值g.gradient_override_map()

专注机器学习之路

04-12

4815

Tensorflow’s gradient_override_map function Tensorflow: How to replace or modify gradient? tensorflow学习笔记（三十）：tf.gradients 与 tf.stop_gradient() 与高阶导数 Here is a working example with a layer th...

TensorFlow tf.gradients的用法详细解析以及具体例子

weixin_30540691的博客

03-21

793

tf.gradients 官方定义： tf.gradients( ys, xs, grad_ys=None, name='gradients', stop_gradients=None, ) Constructs symbolic derivatives of sum ofysw.r.t. x inxs. ysand...

Tensorflow 小知识点 shape get_shape() cond placeholder tf.gradients

心之所向

04-13

575

name_scope variable_scope TensorFlow入门（七）充分理解 name / variable_scope

TensorFlow基本概念与常用函数

Juwenile的博客

08-10

1417

本人人工智能入门小白一枚，在网上学习人工智能实践-TensorFlow2.0（北大公开课）课程，将自己学习到的东西进行整理，为方便后面复习，如有错误，烦请指出！多谢！！

python 数据结构转换层_python - 如何在Tensorflow中按层(Layer)设置学习率？

weixin_39965161的博客

12-24

243

是否可以对Tensorflow的不同层使用不同的学习率？我正在尝试修改预训练模型并将其用于其他任务。我想要的是加快对新添加的层的训练，并使受过训练的层保持较低的学习率，以防止它们变形。例如，我有一个5个卷积层的预训练模型，然后，我添加了一个新的转换层并对其进行微调。前5层的学习率为0.00001，后5层的学习率为0.001。如何实现这一目标？相似问题：Tensorflow按网络分层设置学习速率(l...

TensorFlow：框架的自动微分机制

热门推荐

Invokar的博客

01-20

1万+

tf.gradient() tf.gradients( ys, xs, grad_ys=None, name='gradients', colocate_gradients_with_ops=False, gate_gradients=False, aggregation_method=None, stop_gradients=Non...

def __init__(self, sess, state_dim, learning_rate): self.sess = sess self.s_dim = state_dim self.lr_rate = learning_rate # Create the critic network self.inputs, self.out = self.create_critic_network() # Get all network parameters self.network_params = \ tf.compat.v1.get_collection(tf.compat.v1.GraphKeys.TRAINABLE_VARIABLES, scope='critic') # Set all network parameters self.input_network_params = [] for param in self.network_params: self.input_network_params.append( tf.compat.v1.placeholder(tf.float32, shape=param.get_shape())) self.set_network_params_op = [] for idx, param in enumerate(self.input_network_params): self.set_network_params_op.append(self.network_params[idx].assign(param)) # Network target目标 V(s) self.td_target = tf.compat.v1.placeholder(tf.float32, [None, 1]) # Temporal Difference, will also be weights for actor_gradients时间差异，也将是actor_gradients的权重 self.td = tf.subtract(self.td_target, self.out) # Mean square error均方误差 self.loss = tflearn.mean_square(self.td_target, self.out) # Compute critic gradient计算临界梯度 self.critic_gradients = tf.gradients(self.loss, self.network_params) # Optimization Op self.optimize = tf.compat.v1.train.RMSPropOptimizer(self.lr_rate). \ apply_gradients(zip(self.critic_gradients, self.network_params))请对这段代码每句进行注释

05-14

# 定义一个类，表示 Critic 网络 class CriticNetwork(object): def __init__(self, sess, state_dim, learning_rate): # 初始化 Critic 网络的一些参数 self.sess = sess self.s_dim = state_dim self.lr_rate = learning_rate # 创建 Critic 网络 self.inputs, self.out = self.create_critic_network() # 获取 Critic 网络中所有的参数 self.network_params = tf.compat.v1.get_collection(tf.compat.v1.GraphKeys.TRAINABLE_VARIABLES, scope='critic') # 定义一个占位符，表示 Critic 网络的输入参数 self.input_network_params = [] for param in self.network_params: self.input_network_params.append(tf.compat.v1.placeholder(tf.float32, shape=param.get_shape())) # 定义一个操作，用于设置 Critic 网络的所有参数 self.set_network_params_op = [] for idx, param in enumerate(self.input_network_params): self.set_network_params_op.append(self.network_params[idx].assign(param)) # 定义一个占位符，表示 Critic 网络的目标输出 self.td_target = tf.compat.v1.placeholder(tf.float32, [None, 1]) # 计算 Critic 网络的 Temporal Difference self.td = tf.subtract(self.td_target, self.out) # 定义 Critic 网络的损失函数，使用均方误差 self.loss = tflearn.mean_square(self.td_target, self.out) # 计算 Critic 网络的梯度 self.critic_gradients = tf.gradients(self.loss, self.network_params) # 定义 Critic 网络的优化器 self.optimize = tf.compat.v1.train.RMSPropOptimizer(self.lr_rate).apply_gradients(zip(self.critic_gradients, self.network_params))