[torch]nn内部函数?

最新推荐文章于 2024-04-18 18:31:22 发布

三枚目

最新推荐文章于 2024-04-18 18:31:22 发布

阅读量1k

点赞数

分类专栏： torch

本文链接：https://blog.csdn.net/apsvvfb/article/details/54926213

版权

torch 专栏收录该内容

21 篇文章 0 订阅

订阅专栏

functions

https://bigaidream.gitbooks.io/subsets_ml_cookbook/content/dl/lua/lua_module.html

[output] forward(input)

Takes an input object, and computes the corresponding output of the module.

After a forward(), the output state variable should have been updated to the new state.

We do NOT override this function. Instead, we implement updateOutput(input)function. The forward module in the abstract parent class module will call updateOutput(input).

[gradInput] backward(input, gradOutput)
Performs a backpropagation step through the module, w.r.t. the given input.

A backpropagation step consists of computing two kind of gradients at input given gradOutput (gradients w.r.t. the output of the module). This function simply performs this task using two function calls:

a function call to updateGradInput(input, gradOutput)
a function call to accGradParameters(input, gradOutput)

We do NOT override this function call. We override updateGradInput(input, gradOutput) and accGradParameters(input, gradOutput) functions.

[output] updateOutput(input, gradOutput)
When defining a new module, this method should be overloaded.

Computes the output using the current parameter set of the class and input. This function returns the result which is stored in the output field.

[gradInput] updateGradInput(input, gradOutput)
When defining a new module, this method should be overloaded.

Computes the gradient of the module w.r.t. its own input. This is returned in gradInput. Also, the gradInput state variable is updated accordingly.

[gradInput] accGradParameters(input, gradOutput)
When defining a new module, this method should be overloaded, if the module has trainable parameters.

Computes the gradient of the module w.r.t. its own parameters. Many modules do NOT perform this step as they do NOT have any trainable parameters. The module is expected to accumulate the gradients w.r.t. the trainable parameters in some variables.

Zeroing this accumulation is achieved with zeroGradParameters() and updating the trainable parameters according to this accumulation is done with updateParameters().

summary

--[[
output = model:forward(input)
gradInput = model:backward(input, gradOutput)
--]]
out1 = model1:forward(input)
out2 = model2:forward(out1)
loss = criterion:forward(out2,label)
grad_out2 = criterion:backward(out2,label)
grad_out1 = model2:forward(out1,grad_out2)
grad_input = model1:forward(input,grad_out1)

practice

https://github.com/apsvvfb/VQA_jan

--train.lua
word_feat, img_feat, w_ques, w_img, mask = unpack(protos.word:forward({data.questions, new_data_images}))

dummy = protos.word:backward({data.questions,data.images}, {d_conv_feat, d_w_ques, d_w_img, d_conv_img, d_ques_img})

--misc/word_level.lua
function layer:updateOutput(input)
  local seq = input[1]
  local img = input[2]
  ...
  return {self.embed_output, self.img_feat, w_embed_ques, w_embed_img, self.mask}

function layer:updateGradInput(input, gradOutput)
  local seq = input[1]
  local img = input[2]
  ...
  return self.gradInput
end

三枚目

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
[torch]nn内部函数?

https://bigaidream.gitbooks.io/subsets_ml_cookbook/content/dl/lua/lua_module.html[output] forward(input)Takes an input object, and computes the corresponding output of the module.After a forward(), the
复制链接

扫一扫