域适应学习笔记:visual Domain Adaptation

本文详细介绍了域适应学习,特别是在视觉领域的应用。文章首先定义了域适应的概念,然后探讨了几种基本的域适应类型,包括半监督、无监督、多域和异质域适应。接着,文章列举了如协变量漂移、类别不平衡、多任务学习等与域适应相关的学习方法,并讨论了视觉领域中的特征增强、特征转换、参数适应等方法。最后,提到了域适应在人脸识别、人脸合成和物体识别等领域的应用。
摘要由CSDN通过智能技术生成

在迁移学习中, 当源域和目标的数据分布不同 ,但两个任务相同时,这种 特殊 的迁移学习 叫做域适应 (Domain Adaptation,DA )。因为其任务相同,所以根据上篇博客中的内容可知,域适应属于一种直推式迁移学习。它在2006年由Daumeaume等人首次提出[1]。
1.域适应的一些基本公式
源域由充足的带标签数据组成,目标域由不充足的带标签数据或者充足的无标签数据组成。且目标域与源域中数据的分布不相同。X,Y分别表示带标签数据的数据以及标签, P(X,Y) 是X,Y的联合概率分布(joint distribution)(联合概率分布的几何意义为:如果将二维随机变量(X,Y)看成是平面上随机点的坐标,那么分布函数F(x,y)在(x,y)处的函数值就是随机点(X,Y)落在以点(x,y)为顶点而位于该点左下方的无穷矩形域内的概率。)。 P(X,Y)s P(X,Y)t 分别对应源域以及目标域的联合分布函数。 Ps(X) , Ps(Y) , Pt(X) , Pt(Y) 表示源域和目标域中X以及Y的边缘分布(marginal distributions)(某一组概率的加和,叫边缘概率。边缘概率的分布情况,就叫边缘分布)。 Ps(X|Y) , Ps(Y|X) , Pt(X|Y) ,

Domain adaptation is an active, emerging research area that attempts to address the changes in data distribution across training and testing datasets. With the availability of a multitude of image acquisition sen- sors, variations due to illumination, and viewpoint among others, com- puter vision applications present a very natural test bed for evaluating domain adaptation methods. In this monograph, we provide a compre- hensive overview of domain adaptation solutions for visual recognition problems. By starting with the problem description and illustrations, we discuss three adaptation scenarios namely, (i) unsupervised adap- tation where the “source domain” training data is partially labeled and the “target domain” test data is unlabeled, (ii) semi-supervised adaptation where the target domain also has partial labels, and (iii) multi-domain heterogeneous adaptation which studies the previous two settings with the source and/or target having more than one domain, and accounts for cases where the features used to represent the data in each domain are different. For all these topics we discuss existing adaptation techniques in the literature, which are motivated by the principles of max-margin discriminative learning, manifold learning, sparse coding, as well as low-rank representations. These techniques have shown improved performance on a variety of applications such as object recognition, face recognition, activity analysis, concept clas- sification, and person detection. We then conclude by analyzing the challenges posed by the realm of “big visual data”, in terms of the generalization ability of adaptation algorithms to unconstrained data acquisition as well as issues related to their computational tractability, and draw parallels with the efforts from vision community on image transformation models, and invariant descriptors so as to facilitate im- proved understanding of vision problems under uncertainty.
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值