Taskonomy: Disentangling Task Transfer Learning

最新推荐文章于 2020-03-18 15:55:52 发布

qq_36356761

最新推荐文章于 2020-03-18 15:55:52 发布

阅读量1.1k

点赞数

分类专栏： paper reading notes

本文链接：https://blog.csdn.net/qq_36356761/article/details/80852583

版权

Taskonomy提出了一种任务转移学习的计算谱系映射，通过神经网络建模视觉任务空间的结构，发现并利用不同任务间的依赖关系，以减少对标注数据的需求。研究包括任务特定建模、转移建模、使用AHP进行有序规范化以及计算全局谱系。该框架揭示了任务间的非平凡关系，有助于提高学习效率。

摘要由CSDN通过智能技术生成

Taskonomy: Disentangling Task Transfer Learning

Amir R. Zamir, Alexander Sax, William Shen, Leonidas Guibas, Jitendra Malik, Silvio Savarese(Computer Vision: from 3D reconstruction to recognition (CS 231A)的老师)

Abstract

visual tasks have a relationship, having surface normals simplify estimating the depth of an image $\Rightarrow$ existence of a structure among visual tasks
问题：how to get to know this structure?
解决方案：a computational taxonomic map for task transfer learning, a fully computational approach for modeling the structure of space of visual tasks, finding (first and higher order) transfer learning dependencies across a dictionary of twenty six 2D, 2.5D, 3D, and semantic tasks in a latent space
用途：nontrivial emerged relationships, and exploit them to reduce the demand for labeled data

Introduction

目标检测、深度估计、边缘检测等任务之间有的有显然的联系，比如surface normal和深度估计相关，以及如图所示的3D边缘有助于point match，但是其他则没有显然的联系。现有的计算机视觉已经在忽视这种不同任务之间关联性的道路上越走越远，显然，如果能利用这种不同任务之间的关联性，就能降低学习时数据量的要求。
这里写图片描述
困难：this task space structure and its effects are still largely unknown
解决方案：a framework for mapping the space of visual tasks, use neural networks as the adopted computational function, class, each layer successively forms more abstract representations of the input containing the information needed for mapping the input to the output
computes an affinity matrix among tasks based on whether the solution for one task can be sufficiently easily read out of the representation trained for another task;
Binary Integer Programming formulation: extracts a globally efficient transfer policy from them

Self-supervised learning: leverage the inherent relationships between tasks to learn a desired expensive one (e.g. object detection) via a cheap surrogate, use a manually-entered local part of the structure in the task space
Meta-learning: performing the learning at a level higher than where conventional learning occurs
Domain adaption: render a function that is developed on a certain domain applicable to another

Method

maximize the collective performance on a set of target tasks $T=\{t_1,...,t_n\}$ , subject to the constraint that we have a limited supervision budget $\gamma$ , the maximum allowable number of source tasks $S$ that we are willing to train from scratch.
task dictionary $V=T\cup S$ , $T-S$ the tasks that we want solved but cannot train (“target-only”), $T \cap S$ are the tasks that we want solved but could play as source too, $S-T$ are the “source-only” tasks which we may not directly care about to solve (e.g. jigsaw puzzle) but can be optionally used if they increase the performance on

最低0.47元/天解锁文章

qq_36356761

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
Taskonomy: Disentangling Task Transfer Learning

Taskonomy: Disentangling Task Transfer LearningAmir R. Zamir, Alexander Sax, William Shen, Leonidas Guibas, Jitendra Malik, Silvio Savarese(Computer Vision: from 3D reconstruction to recognition (CS...
复制链接

扫一扫