知识蒸馏paper分类整理(2014-2020)

这篇博客整理了2014年至2020年间关于知识蒸馏的研究,涵盖不同形式的知识如logits、中间层、图基、互信息等,并探讨了知识蒸馏与其他技术如GAN、元学习、无监督学习、RL、多教师学习等的结合应用。此外,还涉及了知识蒸馏在NLP、模型压缩和量化等方面的应用。
摘要由CSDN通过智能技术生成

Awesome Knowledge-Distillation

Different forms of knowledge

Knowledge from logits

  1. Distilling the knowledge in a neural network. Hinton et al. arXiv:1503.02531
  2. Learning from Noisy Labels with Distillation. Li, Yuncheng et al. ICCV 2017
  3. Training Deep Neural Networks in Generations:A More Tolerant Teacher Educates Better Students. arXiv:1805.05551
  4. Knowledge distillation by on-the-fly native ensemble. Lan, Xu et al. NIPS 2018
  5. Learning Metrics from Teachers: Compact Networks for Image Embedding. Yu, Lu et al. CVPR 2019
  6. Relational Knowledge Distillation. Park, Wonpyo et al, CVPR 2019
  7. Like What You Like: Knowledge Distill via Neuron Selectivity Transfer. Huang, Zehao and Wang, Naiyan. 2017
  8. On Knowledge Distillation from Complex Networks for Response Prediction. Arora, Siddhartha et al. NAACL 2019
  9. On the Efficacy of Knowledge Distillation. Cho, Jang Hyun and Hariharan, Bharath. arXiv:1910.01348. ICCV 2019
  10. [noval]Revisit Knowledge Distillation: a Teacher-free Framework. Yuan, Li et al. arXiv:1909.11723
  11. Improved Knowledge Distillation via Teacher Assistant: Bridging the Gap Between Student and Teacher. Mirzadeh et al. arXiv:1902.03393
  12. Ensemble Distribution Distillation. ICLR 2020
  13. Noisy Collaboration in Knowledge Distillation. ICLR 2020
  14. On Compressing U-net Using Knowledge Distillation. arXiv:1812.00249
  15. Distillation-Based Training for Multi-Exit Architectures. Phuong, Mary and Lampert, Christoph H. ICCV 2019
  16. Self-training with Noisy Student improves ImageNet classification. Xie, Qizhe et al.(Google) CVPR 2020
  17. Variational Student: Learning Compact and Sparser Networks in Knowledge Distillation Framework. arXiv:1910.12061
  18. Preparing Lessons: Improve Knowledge Distillation with Better Supervision. arXiv:1911.07471
  19. Adaptive Regularization of Labels. arXiv:1908.05474
  20. Positive-Unlabeled Compression on the Cloud. Xu, Yixing(HUAWEI) et al. NIPS 2019
  21. Snapshot Distillation: Teacher-Student Optimization in One Generation. Yang, Chenglin et al. CVPR 2019
  22. QUEST: Quantized embedding space for transferring knowledge. Jain, Himalaya et al. CVPR 2020(pre)
  23. Conditional teacher-student learning. Z. Meng et al. ICASSP 2019
  24. Subclass Distillation. Müller, Rafael et al. arXiv:2002.03936
  25. MarginDistillation: distillation for margin-based softmax. Svitov, David & Alyamkin, Sergey. arXiv:2003.02586
  26. An Embarrassingly Simple Approach for Knowledge Distillation. Gao, Mengya et al. MLR 2018
  27. Sequence-Level Knowledge Distillation. Kim, Yoon & Rush, Alexander M. arXiv:1606.07947
  28. Boosting Self-Supervised Learning via Knowledge Transfer. Noroozi, Mehdi et al. CVPR 2018
  29. Meta Pseudo Labels. Pham, Hieu et al. ICML 2020

Knowledge from intermediate layers

  1. Fitnets: Hints for thin deep nets. Romero, Adriana et al. arXiv:1412.6550
  2. Paying more attention to attention: Improving the performance of convolutional neural networks via attention transfer. Zagoruyko et al. ICLR 2017
  3. Knowledge Projection for Effective Design of Thinner and Faster Deep Neural Networks. Zhang, Zhi et al. arXiv:1710.09505
  4. A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning. Yim, Junho et al. CVPR 2017
  5. Paraphrasing complex network: Network compression via factor transfer. Kim, Jangho et al. NIPS 2018
  6. Knowledge transfer with jacobian matching. ICML 2018
  7. Self-supervised knowledge distillation using singular value decomposition. Lee, Seung Hyun et al. ECCV 2018
  8. Variational Information Distillation for Knowledge Transfer. Ahn, Sungsoo et al. CVPR 2019
    9
  9. Knowledge Distillation via Instance Relationship Graph. Liu, Yufan et al. CVPR 2019
  10. Knowledge Distillation via Route Constrained Optimization. Jin, Xiao et al. ICCV 2019
  11. Similarity-Preserving Knowledge Distillation. Tung, Frederick, and Mori Greg. ICCV 2019
  12. MEAL: Multi-Model Ensemble via Adversarial Learning. Shen,Zhiqiang,
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值