ESP-DL部署魔改MobilenetV1—2. 模型量化

zxfeng~

已于 2024-10-04 11:24:36 修改

阅读量856

点赞数 9

分类专栏： AI 文章标签： c语言单片机 ai 人工智能

于 2024-08-30 19:10:57 首次发布

本文链接：https://blog.csdn.net/qq_30479517/article/details/141725226

版权

AI 专栏收录该内容

13 篇文章

订阅专栏

在上一节完成模型的训练和导出后，这一节我们来基于esp-dl，使用其提供的量化工具包来完成模型的量化。

量化工具

esp-dl

esp-dl支持的算子

首先我们需要知道esp-dl支持的算子有哪些，否则遇到无法支持的算子时会报错，无法正常量化。如果我们的模型里有不支持的算子，可以尝试对其进行替换。

Conv2d：只能处理三维
Gemm：部署还是使用的Conv2d
Relu
AvgPool2d
MaxPool2d
Add
Mul
Sub
Softmax
Tanh
Sigmoid
Concat
Expand
Flatten
Max
Min
Pad
Reshape
Squeeze：构造时忽略，否则其后面的conv2d会报错
Transpose：由于onnx的运算张量顺序为 (C, H, W)，但是我们训练的还是HWC，因此输入端的这一层可以忽略
Slice
Shape
Resize

环境准备

环境要求

Python == 3.7
Numba == 0.53.1
ONNX == 1.9.0
ONNX Runtime == 1.7.0
ONNX Optimizer == 0.2.6

pyenv环境配置

Pyenv安装

在ubuntu的apt中，直接安装pyenv会找不到这个包。需要运行如下命令安装pyenv

curl -L https://github.com/pyenv/pyenv-installer/raw/master/bin/pyenv-installer | bash

安装完成后，根据打印信息，将如下命令复制到~/.bashrc的最后

export PYENV_ROOT="$HOME/.pyenv"
export PATH="$PYENV_ROOT/bin:$PATH"
eval "$(pyenv init --path)"
eval "$(pyenv virtualenv-init -)"

之后执行source ~/.bashrc命令，然后就能够正常运行pyenv了。

安装新版本python

pyenv install 3.7.17

之后如果要查看所有已安装的python版本，可以执行

pyenv versions

这个命令可以列出所有已安装的python版本。

创建新环境

使用pyenv local命令加上python版本，即可更改当前命令行的python版本。

pyenv local 3.7.17

更改完成后，我们可以运行如下命令查看当前的python版本。

python --version
Python 3.7.17

可以看到，当前的python版本已经更改为我们需要的3.7.17了。

之后进入我们的工作目录下，创建我们的python环境。最后的“esp-dl-quant”是环境名称，你可以修改为你自己需要的名称。

python -m venv .venv --prompt esp-dl-quant
echo "*" > .venv/.gitignore

安装依赖

创建完成后，我们就能进入我们创建的环境了，进入后我们安装以下必要的工具包。

source .venv/bin/activate
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

最后，如果想要退出该环境，执行如下命令即可。

deactivate

模型量化和转换

量化工具包中的校准器可将浮点模型量化成可适配 ESP-DL 的整型模型。为了实现训练后量化，请参考以下实例准备校准集，该校准集可以是训练集或验证集的子集：

# Calibration
with open(cal_path, 'rb') as f:
    (test_images, test_labels) = pickle.load(f)

# Prepare the calibration dataset
calib_dataset =  np.expand_dims(test_images[0], axis=0)
pickle_file_path = 'cal_data.pickle'
model_proto = onnx.load(optimized_model_path)

# Calibration
model_proto = onnx.load(optimized_model_path)
print('Generating the quantization table:')

# Initialize an calibrator to quantize the optimized MNIST model to an int16 model using per-tensor minmax quantization method
calib = Calibrator('int16', 'per-tensor', 'minmax')
calib.set_providers(['CPUExecutionProvider'])

# Obtain the quantization parameter
calib.generate_quantization_table(model_proto, calib_dataset, 'mnist_calib.pickle')

# Generate the coefficient files for esp32s3
calib.export_coefficient_to_cpp(model_proto, pickle_file_path, 'esp32s3', '.', 'mnist_coefficient', True)

使用以下命令运行准备好的转换脚本：

python example.py

如果转换成功的话，会生成模型文件
在这里插入图片描述

并在命令行中输出模型的层文件信息

Generating the quantization table:
Converting coefficient to int16 per-tensor quantization for esp32s3
Exporting finish, the output files are: ./cat_vs_dog_coefficient.cpp, ./cat_vs_dog_coefficient.hpp

Quantized model info:
model input name: input_1, exponent: -15
Transpose layer name: StatefulPartitionedCall/model/conv1/Conv2D__6, output_exponent: -15
Conv layer name: StatefulPartitionedCall/model/conv1/Conv2D, output_exponent: -11
DepthwiseConv layer name: StatefulPartitionedCall/model/conv_dw_1/depthwise, output_exponent: -10
Conv layer name: StatefulPartitionedCall/model/conv_pw_1/Conv2D, output_exponent: -9
DepthwiseConv layer name: StatefulPartitionedCall/model/conv_dw_2/depthwise, output_exponent: -10
Conv layer name: StatefulPartitionedCall/model/conv_pw_2/Conv2D, output_exponent: -10
DepthwiseConv layer name: StatefulPartitionedCall/model/conv_dw_4/depthwise, output_exponent: -10
Conv layer name: StatefulPartitionedCall/model/conv_pw_4/Conv2D, output_exponent: -10
DepthwiseConv layer name: StatefulPartitionedCall/model/conv_dw_5/depthwise, output_exponent: -10
Conv layer name: StatefulPartitionedCall/model/conv_pw_5/Conv2D, output_exponent: -11
DepthwiseConv layer name: StatefulPartitionedCall/model/conv_dw_6/depthwise, output_exponent: -11
Conv layer name: StatefulPartitionedCall/model/conv_pw_6/Conv2D, output_exponent: -12
GlobalAveragePool layer name: StatefulPartitionedCall/model/global_average_pooling2d/Mean, output_exponent: -12
Squeeze layer name: StatefulPartitionedCall/model/global_average_pooling2d/Mean_Squeeze__118, output_exponent: -12
Gemm layer name: fused_gemm_0, output_exponent: -10
Softmax layer name: StatefulPartitionedCall/model/softmax/Softmax, output_exponent: -14

其中包含了模型输入和每层输出的量化指数位，在之后的模型部署中会使用到。

此外最后还会输出模型在量化数据集上的准确度
在这里插入图片描述