pytorch中的函数摘录

最新推荐文章于 2024-07-02 10:37:47 发布

hello_world_banni

最新推荐文章于 2024-07-02 10:37:47 发布

阅读量816

点赞数 2

分类专栏： python torch

本文链接：https://blog.csdn.net/hello_world_banni/article/details/116334502

版权

python 同时被 2 个专栏收录

11 篇文章 0 订阅

订阅专栏

torch

1 篇文章 0 订阅

订阅专栏

Categorical.log_prob()

log_prob takes the log of the probability (of some actions). Example:

import torch
from torch.distributions import Categorical
import torch.nn.functional as F

action_logits = torch.rand(5)
action_probs = F.softmax(action_logits, dim=-1)
print(action_probs)
dist = Categorical(action_probs)
action = dist.sample()
print(action)
print(dist.log_prob(action), torch.log(action_probs[action]))

输出

tensor([0.1419, 0.3035, 0.1763, 0.1427, 0.2355])
tensor(2)
tensor(-1.7358) tensor(-1.7358)

即 $log_e(0.1763)$

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

hello_world_banni

关注关注

2
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
pytorch中的函数摘录

Categorical.log_prob()log_prob takes the log of the probability (of some actions). Example:action_logits = torch.rand(5)action_probs = F.softmax(action_logits, dim=-1)action_probsReturns:tensor([0.1457, 0.2831, 0.1569, 0.2221, 0.1922])Then:dist =
复制链接

扫一扫