Pytorch -- ONNX -- ONNXRuntime

zzzllllllll

已于 2022-09-23 15:32:54 修改

阅读量500

点赞数

分类专栏：模型部署文章标签： pytorch 深度学习 python

于 2022-09-23 15:32:52 首次发布

原文链接：https://zhuanlan.zhihu.com/p/477743341#:~:text=1%20%E6%A8%A1%E5%9E%8B%E9%83%A8%E7%BD%B2%EF%BC%8C%E6%8C%87%E6%8A%8A%E8%AE%AD%E7%BB%83%E5%A5%BD%E7%9A%84%E6%A8%A1%E5%9E%8B%E5%9C%A8%E7%89%B9%E5%AE%9A%E7%8E%AF%E5%A2%83%E4%B8%AD%E8%BF%90%E8%A1%8C%E7%9A%84%E8%BF

版权

模型部署专栏收录该内容

3 篇文章 0 订阅

订阅专栏

本文介绍了如何将一个PyTorch训练好的超分辨率模型进行部署，包括模型转换为ONNX格式以及使用ONNXRuntime进行推理。首先，展示了如何加载预训练模型并进行推理，然后详细讲解了利用torch.onnx.export将PyTorch模型转换为ONNX模型的过程。最后，通过ONNXRuntime进行模型推理验证，确保模型转换无误。

摘要由CSDN通过智能技术生成

开始学习模型部署了。学了一个超简单的部署

模型部署入门教程（一）：模型部署简介 - 知乎 (zhihu.com)

总结

模型部署，指把训练好的模型在特定环境中运行的过程。模型部署要解决模型框架兼容性差和模型运行速度慢这两大问题。
模型部署的常见流水线是“深度学习框架-中间表示-推理引擎”。其中比较常用的一个中间表示是 ONNX。
深度学习模型实际上就是一个计算图。模型部署时通常把模型转换成静态的计算图，即没有控制流（分支语句、循环语句）的计算图。
PyTorch 框架自带对 ONNX 的支持，只需要构造一组随机的输入，并对模型调用 torch.onnx.export 即可完成 PyTorch 到 ONNX 的转换。
推理引擎 ONNX Runtime 对 ONNX 模型有原生的支持。给定一个 .onnx 文件，只需要简单使用 ONNX Runtime 的 Python API 就可以完成模型推理。

import os
import cv2
import numpy as np
import requests
import torch
import torch.onnx
from torch import nn


class SuperResolutionNet(nn.Module):
    def __init__(self, upscale_factor):
        super().__init__()
        self.upscale_factor = upscale_factor
        self.img_upsampler = nn.Upsample(
            scale_factor=self.upscale_factor,
            mode='bicubic',
            align_corners=False)

        self.conv1 = nn.Conv2d(3, 64, kernel_size=9, padding=4)
        self.conv2 = nn.Conv2d(64, 32, kernel_size=1, padding=0)
        self.conv3 = nn.Conv2d(32, 3, kernel_size=5, padding=2)

        self.relu = nn.ReLU()

    def forward(self, x):
        x = self.img_upsampler(x)
        out = self.relu(self.conv1(x))
        out = self.relu(self.conv2(out))
        out = self.conv3(out)
        return out

    # Download checkpoint and test image


urls = ['https://download.openmmlab.com/mmediting/restorers/srcnn/srcnn_x4k915_1x16_1000k_div2k_20200608-4186f232.pth',
        'https://raw.githubusercontent.com/open-mmlab/mmediting/master/tests/data/face/000001.png']
names = ['srcnn.pth', 'face.png']
for url, name in zip(urls, names):
    if not os.path.exists(name):
        open(name, 'wb').write(requests.get(url).content)


def init_torch_model():
    torch_model = SuperResolutionNet(upscale_factor=3)

    state_dict = torch.load('srcnn.pth')['state_dict']

    # Adapt the checkpoint
    for old_key in list(state_dict.keys()):
        new_key = '.'.join(old_key.split('.')[1:])
        state_dict[new_key] = state_dict.pop(old_key)

    torch_model.load_state_dict(state_dict)
    torch_model.eval()
    return torch_model


model = init_torch_model()
input_img = cv2.imread('face.png').astype(np.float32)

# HWC to NCHW
input_img = np.transpose(input_img, [2, 0, 1])
input_img = np.expand_dims(input_img, 0)

# Inference
torch_output = model(torch.from_numpy(input_img)).detach().numpy()

#为了让模型输出成正确的图片格式，我们把模型的输出转换成 HWC 格式，并保证每一通道的颜色值都在 0~255 之间。
# NCHW to HWC
torch_output = np.squeeze(torch_output, 0)
torch_output = np.clip(torch_output, 0, 255)
torch_output = np.transpose(torch_output, [1, 2, 0]).astype(np.uint8)

#如果脚本正常运行的话，一幅超分辨率的人脸照片会保存在 “face_torch.png” 中
# Show image
cv2.imwrite("face_torch.png", torch_output)


#PyTorch的模型转换成ONNX格式的模型
x = torch.randn(1, 3, 256, 256)
#前三个参数分别是要转换的模型、模型的任意一组输入、导出的 ONNX 文件的文件名
#剩下的参数中，opset_version 表示 ONNX 算子集的版本
#如果上述代码运行成功，目录下会新增一个"srcnn.onnx"的 ONNX 模型文件
with torch.no_grad():
    torch.onnx.export(
        model,
        x,
        "srcnn.onnx",
        opset_version=11,
        input_names=['input'],
        output_names=['output'])

#我们可以用下面的脚本来验证一下模型文件是否正确
import onnx
#其中，onnx.load 函数用于读取一个 ONNX 模型。
#onnx.checker.check_model 用于检查模型格式是否正确，如果有错误的话该函数会直接报错。
#我们的模型是正确的，控制台中应该会打印出"Model correct"
onnx_model = onnx.load("srcnn.onnx")
try:
    onnx.checker.check_model(onnx_model)
except Exception:
    print("Model incorrect")
else:
    print("Model correct")

#用 ONNX Runtime 运行一下模型，完成模型部署的最后一步
#如果代码正常运行的话，另一幅超分辨率照片会保存在"face_ort.png"中
import onnxruntime

ort_session = onnxruntime.InferenceSession("srcnn.onnx")
ort_inputs = {'input': input_img}
ort_output = ort_session.run(['output'], ort_inputs)[0]

ort_output = np.squeeze(ort_output, 0)
ort_output = np.clip(ort_output, 0, 255)
ort_output = np.transpose(ort_output, [1, 2, 0]).astype(np.uint8)
cv2.imwrite("face_ort.png", ort_output)