自监督学习系列(3) : A simple framework for contrastive learning of visual representations

最新推荐文章于 2024-04-18 14:23:18 发布

FancyCode Artist

最新推荐文章于 2024-04-18 14:23:18 发布

阅读量403

点赞数

分类专栏：自监督学习|Self-supervised Learning 文章标签： supervised learning python

原文链接：https://medium.com/%E8%BB%9F%E9%AB%94%E4%B9%8B%E5%BF%83/self-supervised-learning%E5%8F%AF%E4%BB%A5%E5%BE%88%E7%B0%A1%E5%96%AE-bylo%E8%88%87simsiam%E7%9A%84%E8%A7%80%E9%BB%9E-bac4bbdaaaf9

版权

自监督学习|Self-supervised Learning 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

目录

ImageNet accuracies of linear classifiers

ImageNet accuracy of models trained with few labels

Comparison of transfer learning performance of our self-supervised approach with supervised baselines

首先还是论文的相关信息

Paper: A simple framework for contrastive learning of visual representations

地址: http://arxiv.org/abs/2002.05709

主要思想

SimCLR是self-supervised learning与contrastive learning中重要的一个相当重要的里程碑，其最大的特点在于研究各种数据增强 (data augmentation) 作为SSL的归纳偏置 (inductive bias)，并利用不同data间彼此的互斥强化学习目标，避免contrastive learning的output collapse。

整体运作概念分为三个阶段：

先sample一些图片(batch of image)
对batch裡的image做两种不同的data augmentation
希望同一张影像、不同augmentation的结果相近，并互斥其他结果。

如果要将SimCLR的架构划分阶段，大致可以分成两个阶段，首先是大个embedding网络执行特征抽取得到y，接下来使用一个小的网络投影到某个固定为度的空间得到z。

附:作者提供的伪代码

讨论

这个小网络投影也是SimCLR的另一个特点。对于同一个x，用data augmentation得到不同的v，通过网络抽取、投影得到固定维度的特征，计算z的contrastive loss，直接用gradient decent同时训练两个阶段的网络。

SimCLR的方法虽然简单，但是一个麻烦的点在于需要大量的online负样本提供斥力。在论文中使用了4096的batch size，还需要为了特别大的batch使用LARS作为optimizer。

结果

ImageNet accuracies of linear classifiers

ImageNet accuracy of models trained with few labels

Comparison of transfer learning performance of our self-supervised approach with supervised baselines

原文链接

https://medium.com/%E8%BB%9F%E9%AB%94%E4%B9%8B%E5%BF%83/self-supervised-learning%E5%8F%AF%E4%BB%A5%E5%BE%88%E7%B0%A1%E5%96%AE-bylo%E8%88%87simsiam%E7%9A%84%E8%A7%80%E9%BB%9E-bac4bbdaaaf9

Reference

[1] A Simple Framework for Contrastive Learning of Visual Representations [ICML 2020]

FancyCode Artist

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
自监督学习系列(3) : A simple framework for contrastive learning of visual representations

目录相关信息主要思想讨论结果ImageNet accuracies of linear classifiersImageNet accuracy of models trained with few labelsComparison of transfer learning performance of our self-supervised approach with supervised baselinesReference首先还是论文的相关信息Paper: A.
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。