Conditional Adversarial Domain Adaptation的理论支持

最新推荐文章于 2023-03-14 19:13:51 发布

蜉蝣之翼❉

最新推荐文章于 2023-03-14 19:13:51 发布

阅读量558

点赞数

分类专栏：迁移学习文章标签：域适应 GAN

本文链接：https://blog.csdn.net/fuyouzhiyi/article/details/102879867

版权

迁移学习专栏收录该内容

13 篇文章 4 订阅

订阅专栏

Reproducing Kernel Hilbert Space

Given a nonempty set $\mathcal{X}$ , and $\mathcal{H}$ is a Hilbert space of function $:\mathcal{X} \to R$ Then $\mathcal{H}$ called a Reproducing Kernel Hilbert Space endowed with the dot product $\cdot , \cdot>$ , if there exists a function $\mathcal{X} \times \mathcal{X} \to R$ with the following properties:

$<f(\cdot) , k(x,\cdot)>=f(x)$ , in particular $\cdot) , k(x',\cdot)>=k(x,x')$
k spans $\mathcal{H}$ : $\mathcal{H}= span\{ k(x,\cdot) | x \in \mathcal{X} \}=\{ f(\cdot)= \sum_{i=1}^m \alpha_ik(x,\cdot) : m \in N , x_i \in \mathcal{X} , \alpha_i \in R\}$

Hilbert Space Embedding

$\mathcal{X}$ : the domain of observations
$\mathbf{P}_x$ : a probability measure on $\mathcal{X}$

$\mathcal{Y}$ : the domain of observations
$\mathbf{P}_y$ : a probability measure on $\mathcal{Y}$ .
$\mathbf{P}_{x\times y}$ : A joint probability measure on $\mathcal{X \times Y}$

reproducing kernel Hilbert space (RKHS) $\mathcal{H}$ of functions on $\mathcal{X}$ with kernel $k(x,x'):=<\varphi(x) , \varphi(x')>$ .

mean map $\mu$ :
$\mu [\mathbf{P}_x]:= \mathbf{E}_x [k(x, \cdot)]$
$\mu [X] := \frac{1}{m} \sum_{i=1}^m k(x_i , \cdot)$

then $\mu [\mathbf{P}_x]$ is an element of th Hilbert space ,so
$<\mu[\mathbf{P}_x] , f>= \mathbf{E}_x[f(x)]$
$<\mu[X] , f>= \frac1m\sum_{i=1}^m f(x_i)$
where $X=\{ x_1 ,x_2 , \cdots , x_m \}$ is assumed to be drawn independently and identically distributed from $\mathbf{P}_x$ . and $\mu [X]$ is an estimate of the mean map.
$\mu [X] = \frac{1}{m} \sum_{i=1}^m k(x_i , \cdot)=\frac1m \mathbf{\gamma 1}_m$ , where $\mathbf{\gamma} :=\{ k(x_1,\cdot) , k(x_2,\cdot) , \cdots , k(x_m,\cdot)\}$

Covariance operators

Given A joint probability measure $\mathbf{P}_{x\times y}$ on $\mathcal{X \times Y}$ , the uncentered covariance operator $\mathcal{C}_{XY}$ (Baker, 1973) $\mathcal{C}_{XY} := \mathbb{E}_{XY}[\varphi(x) \otimes\phi(y)]$ , where $\otimes$ denotes tensor product.

Given $m$ pairs of $i . i . d .$ observations ${ (x^l , y^l) \}_{l=1}^m$ , we denote by $\mathbf{\gamma} = (\varphi(x^1) ,\varphi(x^2), \cdots,\varphi(x^m) )$ and $\Phi =(\phi(y^1),\phi(y^2),\cdots,\phi(y^m))$ . Conceptually, the covariance operator $\mathcal{C}_{XY}$ can then be estimated as $\hat{\mathcal{C}}_{XY}= \frac1m\gamma\Phi^T$
Notes on mean embeddings and covariance
operators

tensor product :Notes on Tensor Products and the Exterior Algebra

Tensor Product Kernels: Characteristic Propertyand Universality
张量积计算
由泛函分析得
let $(X,<\cdot,\cdot>)$ be an inner product space over a field $\mathbb{F}$ . For each $\in X$ , define $\Vert x\Vert := \sqrt {<x,x>}$ Then $\Vert \cdot \Vert$ defines a norm on $X$ . That is , $(X,\Vert \cdot \Vert)$ is a normed linear space over $\mathbb{F}$ .

问题：
原论文提到

Conditional embedding operators

By analogy with the embedding of marginal distributions, the conditional density $\mathbb{P}(Y|x)$ an also be rep-resented as an RKHS element: $\mu[Y|x] := \mathbb{E}_{Y|x}[\phi(Y)]$ with each element corresponding toa particular value of $x$ .
These conditional embeddings can be defined via a conditional embedding operator $\mathcal{C}_{Y|X} : \mathcal{F} \to \mathcal{G}$
$\mu[Y|x]=\mathcal{C}_{Y|X} \varphi(x):= \mathcal{C}_{YX}\mathcal{C}_{XX}^{-1}\varphi(x)$

Given $m$ pairs of $i . i . d .$ observations ${ (x^l , y^l) \}_{l=1}^m$ from $\mathbb{P}_{x\times y}$ , the conditional embedding operator can be estimated as $\hat\mathcal{C}_{Y|X} =\frac{\Phi\gamma^T}{m}(\frac{\gamma\gamma^T}{m}+\lambda I)^{-1}=\Phi(K+\lambda mI)^{-1}\gamma^T$ where $:=\gamma^T\gamma$ with $(i, j)$ th entry $k(x_i,x_j)$

参考文献

A Hilbert Space Embedding for Distributions PDF
Hilbert Space Embeddings of Hidden Markov Models
Hilbert Space Embeddings of Hidden Markov Models ppt

Generalization and Equilibrium in Generative Adversarial Nets (GANs)

class of generators : $\{ G_u , u \in \mathcal{U} \}$ , where $G$ is a function : $\mathbb{R}^l \to \mathbb{R}^d$ ,and $u$ is parameters of the generators.
$x=G_u(h) , h\sim l$ -dimensional spherical Gaussian distribution
class of discriminators : $\{ D_v , v\in \mathcal{V} \}$ , where $D$ is a function : $\mathbb{R}^d \to [0,1]$
$D_v(x)$ is usually interpreted as the probability that the sample $x$ comes from the real distribution $D_{real}$

Objective functions $\min_{u \in \mathcal{U}} \max_{v \in \mathcal{V}}\mathbb{E}_{x \in \mathcal{D}_{real}} [\log D_v(x)]+\mathbb{E}_{x \in \mathcal{D}_{G_u}} [\log(1-D_v(x))]$

蜉蝣之翼❉

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Conditional Adversarial Domain Adaptation的理论支持

Reproducing Kernel Hilbert SpaceGiven a nonempty set X\mathcal{X}X , and H\mathcal{H}H is a Hilbert space of function f:X→Rf :\mathcal{X} \to Rf:X→R Then H\mathcal{H}H called a Reproducing Kernel Hi...
复制链接

扫一扫

专栏目录