【Google量化】Mobilenet TensorFlow-Slim

最新推荐文章于 2024-06-02 21:06:39 发布

代码款款

最新推荐文章于 2024-06-02 21:06:39 发布

阅读量4.6k

点赞数 2

分类专栏： # 通用实验相关 # 模型压缩（model compression）文章标签：量化网络压缩人工智能深度学习

本文链接：https://blog.csdn.net/yifen4234/article/details/80142287

版权

本文介绍了Google 2018年的CVPR论文，探讨了用于高效整数算术仅推断的神经网络量化技术。文章详细讲解了CNN量化，特别是针对MobileNet V1的实现，包括训练过程、模型转换和量化工具的使用，以实现更高效的设备端推理。

摘要由CSDN通过智能技术生成

最近在做量化相关的工作，在老师的推荐下看了这篇文章，这篇文章是google2018新的作品，非常良心，讲解非常详细，而且有代码可以work。

一、参考文献

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, Dmitry Kalenichenko

(Submitted on 15 Dec 2017)

The rising popularity of intelligent mobile devices and the daunting computational cost of deep learning-based models call for efficient and accurate on-device inference schemes. We propose a quantization scheme that allows inference to be carried out using integer-only arithmetic, which can be implemented more efficiently than floating point inference on commonly available integer-only hardware. We also co-design a training procedure to preserve end-to-end model accuracy post quantization. As a result, the proposed quantization scheme improves the tradeoff between accuracy and on-device latency. The improvements are significant even on MobileNets, a model family known for run-time efficiency, and are demonstrated in ImageNet classification and COCO detection on popular CPUs.

文章讲解：

Google CVPR 2018论文：CNN量化技术

Additionally, the minimum and maximum values for activations are determined during training. This allows a model trained with quantization in the loop to be converted to a fixed point inference model with little effort, eliminating the need for a separate calibration step.

此外，激活的最小值和最大值在训练期间确定。这使得在循环中用量化训练的模型可以毫不费力地转换成固定点推断模型，从而不需要单独的校准步骤。(校准是为了获得参数的范围)

二、具体实现

github 代码：

https://github.com/tensorflow/models/blob/master/research/slim/nets/mobilenet_v1.md

The linked model tar files contain the following:

Trained model checkpoints：

mobilenet_v1_1.0_224.ckpt.data-00000-of-00001（保存变量及其取值）

mobilenet_v1_1.0_224.ckpt.index

mobilenet_v1_1.0_224.ckpt.meta（保存图结构）

Eval graph text protos (to be easily viewed) ：mobilenet_v1_1.0_224_eval.pbtxt
Frozen trained models：mobilenet_v1_1.0_224_frozen.pb（模型大小：17173742）
Info file containing input and output information：mobilenet_v1_1.0_224_info.txt
Converted TensorFlow Lite flatbuffer model：mobilenet_v1_1.0_224.tflite（模型大小：4276000）

Note that

最低0.47元/天解锁文章

代码款款

关注

2
点赞
踩
11

收藏

觉得还不错? 一键收藏
5
评论
【Google量化】Mobilenet TensorFlow-Slim

最近在做量化相关的工作，在老师的推荐下看了这篇文章，这篇文章是google2018新的作品，非常良心，讲解非常详细，而且有代码可以work。https://arxiv.org/abs/1712.05877Computer Science &gt; LearningQuantization and Training of Neural Networks for Efficient Integer-A...
复制链接

扫一扫

专栏目录