将tensorpack的inference改为pytorch

最新推荐文章于 2023-05-24 16:00:56 发布

云端一散仙

最新推荐文章于 2023-05-24 16:00:56 发布

阅读量541

点赞数 1

分类专栏： Tensorflow 文章标签： tensorflow

本文链接：https://blog.csdn.net/weixin_44347020/article/details/107367923

版权

Tensorflow 专栏收录该内容

2 篇文章 0 订阅

订阅专栏

最近在跑一个OCR模型，模型是用Tensorpack写的，模型做inference的时候，显存，速度都不是很理想，改成pytorch后，显存占用，速度比之前好了很多。记录下改inference的过程遇到的一些坑。

将pb文件转为pth文件

import torch
from collections import OrderedDict
import tensorflow as tf
from tensorflow.python.framework import tensor_util
def view_params():
    pb_file = 'ocr/checkpoint/text_recognition_377500.pb'
    graph = tf.Graph()
    with graph.as_default():
        with tf.gfile.FastGFile(pb_file, 'rb') as f:
            graph_def = tf.GraphDef()
            graph_def.ParseFromString(f.read())
            _ = tf.import_graph_def(graph_def, name='')
            graph_nodes=[n for n in graph_def.node]
            wts = [n for n in graph_nodes if n.op=='Const']

    odic = OrderedDict()
    for n in wts:
        param = tensor_util.MakeNdarray(n.attr['value'].tensor)
        if not param.size == 0:
            odic[n.name] = tensor_util.MakeNdarray(n.attr['value'].tensor)
    torch.save(odic, 'pb_377500.pth')

模型代码

class TextRecognition(nn.Module):
    def __init__(self):
        super(TextRecognition, self).__init__()
        self.features = nn.Sequential(OrderedDict([
            ('Conv2d_1a_3x3', BasicConv2d(3, 32, kernel_size=3, stride=2, padding='SAME')),
            ('Conv2d_2a_3x3', BasicConv2d(32, 32, kernel_size=3, stride=1, padding='SAME')),
           	...
            ('Mixed_6h', Inception_B()),
        ]))
        self.attention_lstm = AttentionLstm()
        
    def forward(self, x):
        x = self.features(x)
        x = self.attention_lstm(x)
        return x
        
class LinearBias(nn.Module):
    def __init__(self, size):
        super(LinearBias, self).__init__()
        self.param = nn.Parameter(torch.Tensor(size))

    def forward(self, x):
        x = x + self.param
        return x
        
class AttentionLstm(nn.Module):
    def __init__(self, seq_len=33, is_training=False, num_classes=7569,
                    wemb_size=256, channel=1024, lstm_size=512):
        super(AttentionLstm, self).__init__()
        self.seq_len = seq_len  # 33
		...
        self.W_wemb = nn.Linear(self.num_classes, self.wemb_size, bias=False)
        self.lstm_b = LinearBias(self.lstm_size*4)
        self.tanh = nn.Tanh()
        self.softmax_1d = nn.Softmax(dim=1)
        self.sigmoid = nn.Sigmoid()
        self.dropout_1d = nn.Dropout(0.)

    def forward(self, cnn_feature):  # bs, 1024, h, w
        _, _, self.height, self.width = cnn_feature.size()
        ...
        return output_array, attention_array

Pytorch 与 TensorFlow 二维卷积（Conv2d）填充（padding）上的差异，写卷积层的时候遇到的坑。
这种差异是由 TensorFlow 和 Pytorch 在卷积运算时使用的填充方式不同导致的。Pytorch 在填充的时候，上、下、左、右各方向填充的大小是一样的，但 TensorFlow 却允许不一样。
参考博客1
参考博客2

在AttentionLstm中，有一个LinearBias类，该类会将pack和self.lstm_b加起来，但是如果在forward中写成相加的形式，就不能将该self.lstm_b保存下来，写成类可以使模型加载参数的时候可以一次加载完成。

class AttentionLstm(nn.Module):
    def __init__(self):
        super(AttentionLstm, self).__init__()
        self.seq_len = 33  # 33
        self.W_wemb = nn.Linear(10, 20, bias=False)
        self.lstm_b = LinearBias(4)
        self.a = nn.Parameter(torch.Tensor(1))
        self.b = torch.randn(1, 3)

test = AttentionLstm()
# odict_keys(['a', 'W_wemb.weight', 'lstm_b.param'])，self.b不会保存在state_dict中，而self.lstm_b会保存
print(test.state_dict())

pack = self.lstm_W(wemb_prev) + self.lstm_U(h_prev) + self.lstm_Z(attention_feature)  # bs, 2048
pack_with_bias = self.lstm_b(pack)

原代码使用的大都是tensorflow的函数，所以要改成相应的pytorch的函数。

tensorflow	pytorch
tf.matmul	torch.matmul
tf.multiply	torch.mul
tf.sigmoid	torch.nn.Sigmoid
tf.nn.dropout	torch.nn.Dropout
tf.nn.softmax	torch.nn.Softmax
tf.tanh	torch.tanh
tf.split	torch.split
tf.shape	torch.size
tf.reshape / tf.transpose	torch.reshape / view
tf.expand_dims	torch.unsqueeze
tf.add_n/tf.add	torch.add
tf.reduce_sum	torch.sum
tf.reduce_mean	torch.mean
tf.transpose	torch.permute
tf.concat	torch.cat
tf.nn.embedding_lookup	torch.index_select

加载参数

最后加载参数验证

net = attention_ocr_pytorch.TextRecognition()
net.load_state_dict(torch.load('log/pytorch/pb_377500_fl.pth'))

云端一散仙

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
将tensorpack的inference改为pytorch

最近在跑一个OCR模型，模型是用Tensorpack写的，模型做inference的时候，显存，速度都不是很理想，改成pytorch后，显存占用，速度比之前好了很多。记录下改inference的过程遇到的一些坑。将pb文件转为pth文件import torchfrom collections import OrderedDictimport tensorflow as tffrom tensorflow.python.framework import tensor_utildef view_
复制链接

扫一扫