阅读ViT代码

最新推荐文章于 2024-05-24 20:11:08 发布

ustcjinggg

最新推荐文章于 2024-05-24 20:11:08 发布

阅读量683

点赞数

本文链接：https://blog.csdn.net/ustcjinggg/article/details/118800921

版权

应该说最先读的一段应该是forward这一段，用来理解整个网络框架

def forward_features(self, x):
pdb.set_trace()# x.shape=[256,3,384,384]
x = self.patch_embed(x)#x.shape=[256,576,384]
cls_token = self.cls_token.expand(x.shape[0], -1, -1)
if self.dist_token is None:
x = torch.cat((cls_token, x), dim=1)
else:
x = torch.cat((cls_token, self.dist_token.expand(x.shape[0], -1, -1), x), dim=1)
x = self.pos_drop(x + self.pos_embed)#x.shape=[256,577,384]
x = self.blocks(x)#self_att & ffn,x.shape=[384,577,384]
x = self.norm(x)
if self.dist_token is None:
return self.pre_logits(x[:, 0])
else:
return x[:, 0], x[:, 1]

def forward(self, x):
#pdb.set_trace()
x = self.forward_features(x)
if self.head_dist is not None:
x, x_dist = self.head(x[0]), self.head_dist(x[1]) # x must be a tuple
if self.training and not torch.jit.is_scripting():
# during inference, return the average of both classifier predictions
return x, x_dist
else:
return (x + x_dist) / 2
else:
x = self.head(x)
return x

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

ustcjinggg

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
阅读ViT代码

应该说最先读的一段应该是forward这一段，用来理解整个网络框架 def forward_features(self, x): pdb.set_trace()# x.shape=[256,3,384,384] x = self.patch_embed(x)#x.shape=[256,576,384] cls_token = self.cls_token.expand(x.shape[0], -1, -1) if self.dis...
复制链接

扫一扫