泛读论文:Self-taught Learning: Transfer Learning from Unlabeled Data

Self-taught Learning: Transfer Learning from Unlabeled Data

ICML2007

问题

  • Semi-supervised learning typically makes the additional assumption that the unlabeled data can be labeled with the same labels as the classification task, and that these labels are merely unobserved (Nigam et al., 2000)
  • Transfer learning typically requires further labeled data from a different but related task, and at its heart typically transfers knowledge from one supervised learning task to another; thus it requires additional labeled (and therefore often expensive-to-obtain) data, rather than unlabeled data, for these other supervised learning tasks.1 (Thrun, 1996; Caruana, 1997; Ando & Zhang, 2005)
  • 就是说半监督学习虽然可以用没标签的数据,但是要跟有标签的数据一个类别
  • 迁移学习是用的有标签数据
  • 本文提出的自学习(Self-taught Learning)自学习对于未标记的数据有很少的限制,相比于半监督学习和迁移学习更简单
  • 本算法中基向量的个数可以比输入维数大很多(PCA不行)

方法

  • 启发:作者发现许多从网上随机下载的图像也包含与大象和犀牛类似的基本视觉模式(如边缘)
  • 高层表达学习:用无标签的数据训练获得基向量b
    • m i n i m i z e b , a ∑ i ∣ ∣ x u ( i ) − ∑ j a j ( i ) b j ∣ ∣ 2 2 + β ∣ ∣ a ( i ) ∣ ∣ 1 minimize_{b,a} \sum_i||x_u^{(i)}-\sum_ja_j^{(i)}b_j||_2^2+\beta||a^{(i)}||_1 minimizeb,aix
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值