李宏毅2020机器学习深度学习 43-44 network compression: network pruning

最新推荐文章于 2021-09-11 19:56:58 发布

Zaгathustra

最新推荐文章于 2021-09-11 19:56:58 发布

阅读量221

点赞数

分类专栏：李宏毅2020机器学习

本文链接：https://blog.csdn.net/Stephanie2014/article/details/113475371

版权

李宏毅2020机器学习专栏收录该内容

8 篇文章 0 订阅

订阅专栏

network compression意义：一些设备，比如可穿戴设备资源（内存和算力）有限，因此需要compress这些network to fit these devices

Q：why larger network is easier to optimize?

A plausible explanation is that a large network contains many small networks. Every group of initial parameters is a lottery ticket. If you use small network, you have little tickets and the change for best result is small. If you use large network, you have more ticket and the probability for optimization is large.

Another hypothesis: small network also can get good results, which is be diametrically opposed to lottery ticket hypothesis

GPU only speed matrix computation. Irrugular structure is hard to compute. So in practice, we set these pruned weights to zero. But if you use this, your real network is still large

Using pruning presents no gain for GPU speed up.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

Zaгathustra

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
1
评论
李宏毅2020机器学习深度学习 43-44 network compression: network pruning

network compression意义：一些设备，比如可穿戴设备资源（内存和算力）有限，因此需要compress这些network to fit these devicesQ：why larger network is easier to optimize?A plausible explanation is that a large network contains many small networks. Every group of initial parameters
复制链接

扫一扫