Reading Note: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

最新推荐文章于 2022-09-11 16:41:56 发布

Joshua_Li_

最新推荐文章于 2022-09-11 16:41:56 发布

阅读量2.3k

点赞数 1

分类专栏：计算机视觉 DL

本文链接：https://blog.csdn.net/joshua_1988/article/details/75950288

版权

计算机视觉同时被 2 个专栏收录

72 篇文章 0 订阅

订阅专栏

42 篇文章 0 订阅

订阅专栏

TITLE: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

AUTHOR: Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun

ASSOCIATION: Megvii Inc (Face++)

FROM: arXiv:1707.01083

CONTRIBUTIONS

Two operations, pointwise group convolution and channel shuffle, are proposed to greatly reduce computation cost while maintaining accuracy.

MobileNet Architecture

In MobileNet and other works, efficient depthwise separable convolutions or group convolutions strike an excellent trade-off between representation capability and computational cost. However, both designs do not fully take the $1 \times 1$ convolutions (also called pointwise convolutions in MobileNet) into account, which require considerable complexity.

Channel Shuffle for Group Convolutions

In order to address the mentioned issue, a straightforward solution is applying group convolutions on $1 \times 1$ layers like what has been done on $3 \times 3$ in MobileNet. However, if multiple group convolutions stack together, there is one side effect: outputs from a certain channel are only derived from a small fraction of input channels. This property blocks information flow between channel groups and weakens representation. To allow group convolution obtaining input data from different groups, for the feature map generated from the previous group layer, we can first divide the channels in each group into several subgroups, then feed each group in the next layer with different subgroups. It can be implemented by reshaping the previous output channel dimension into $(g, n)$ , transposing and then flattening it back as the input of next layer, which is called channel shuffle operation and illustrated in the following figure.

Channel Shuffle

ShuffleNet Unit

The following figure shows the ShuffleNet Unit.

ShuffleNet Unit

In the figure, (a) is the building block in ResNeXt, and (b) is the building block in ShuffleNet. Given the input size $c \times h \times w$ and the bottleneck channels $m$ , ResNext has $hw(2cm+9m^2/g)$ FLOPs, while ShuffleNet needs $hw(2cm/g+9m)$ FLOPs.

Network Architecture

Comparison

Joshua_Li_

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Reading Note: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

TITLE: ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
复制链接

扫一扫

专栏目录