Rademacher复杂度

最新推荐文章于 2024-06-13 08:41:27 发布

置顶欧阳AI锋

最新推荐文章于 2024-06-13 08:41:27 发布

阅读量3.5k

点赞数

核方法(Kernel method)通常需要选取可函数及其超参数的取值。重采样(如： K-fold cross validation 和 Bootstrap)可以处理这个问题，但是通常很耗时。全局Rademacher复杂度和局部Rademacher复杂度是两个测量假设空间复杂度的量。Rademacher复杂度在多标签学习里已经有一些成果，感兴趣的读者可以自行查阅这方面的文献。

Rademacher 复杂度

它的官方定义为： $R_{n}(\mathcal{H})=\mathbb{E}\sup_{h\in \mathcal{H}}\frac{2}{n}\sum_{i=1}^{n}\sigma_{i}l(h(X_{i},Y_{i}))$ ,

这就是所谓的Global Rademacher复杂度了，它的经验版本为：

$\hat{R_{n}}(\mathcal{H})=\mathbb{E}_{\sigma}\sup_{h\in \mathcal{H}}\frac{2}{n}\sum_{i=1}^{n}\sigma_{i}l(h(X_{i},Y_{i}))$ .

以下是经典结果: 对于损失函数 $l:X\times Y \to [0,1]$ ,任意的假设 $h$ ，以下不等式以概率 $\geq 1-2e^{-x}$ 成立：

$L(h)\leq L_{n}(h)+\hat{R_{n}}(\mathcal{H})+3\sqrt{\frac{x}{2n}}$ .

而local Rademacher复杂度经典结果为：
令 $\hat{\mathcal{H}}_{r,a,x,n}=\{h|h\in \mathcal{H},\hat{L}^{2}_{n}\leq \frac{1}{\alpha^2}(3r+\sqrt{\frac{x}{2n}})\}$ , $\hat{r}_{n}$ 为子根函数

$\hat{\psi}_{n}(r)=\sup_{\alpha \in (0,1]}\alpha \hat{R}_{n}(\hat{\mathcal{H}}_{r,a,x,n})+\sqrt{\frac{2x}{n}}$

的固定点，对于损失函数 $l:X\times Y \to [0,1]$ ,任意的假设 $h$ ，以下不等式以概率 $\geq1-3e^{-x}$ 成立：

$L(h)\leq \min_{K>1}[\frac{K}{K-1}\hat{L}_{n}(h)+K\hat{r}_{n}]+2\sqrt{\frac{x}{2n}}$ .

估算Example

Example 1
$\mathcal{H}=\{h| h(X)=<W,\phi(X)>,||W||\leq H\}$ ,则
一方面，
$\hat{R_{n}}(\mathcal{H})\leq\frac{H}{n}\sqrt{\sum_{i=1}^{n}\lambda_{i}}=\frac{H}{n}\sqrt{\sum_{i=1}^{n}Q_{i,i}}$ ;
另一方面，
$\hat{R_{n}}(\mathcal{H})\geq\frac{H}{\sqrt{2}n}\sqrt{\sum_{i=1}^{n}\lambda_{i}}=\frac{H}{\sqrt{2}n}\sqrt{\sum_{i=1}^{n}Q_{i,i}}$ .

而且，当 $||\phi(X)||\leq 1$ 时, 以下不等式以概率 $\geq 1-2e^{-x}$ 成立：

$L_{H}(h)\leq L_{T,n}(h)+\frac{H}{n}\sqrt{\sum_{i=1}^{n}\lambda_{i}}+3\sqrt{\frac{x}{2n}}$ .

Example 2
$\mathcal{H}=\{h| h(X)=<W,\phi(X)>,||W||\leq 2H, ||\Phi W||^2\leq c_{2}\}$ ,则
一方面，
$\hat{R_{n}}(\mathcal{H})\leq 2H\sqrt{\sum_{i=1}^{n}\min[\frac{c_{2}}{4},\lambda_{i}]}$ ,
另一方面，
$\hat{R_{n}}(\mathcal{H})\geq c_{3}H\sqrt{\sum_{i=1}^{n}\min[\frac{c_{2}}{4},\lambda_{i}]}$ .

欧阳AI锋

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
Rademacher复杂度

核方法(Kernel method)通常需要选取可函数及其超参数的取值。重采样(如： K-fold cross validation 和 Bootstrap)可以处理这个问题，但是通常很耗时。全局Rademacher复杂度和局部Rademacher复杂度是两个测量假设空间复杂度的量。Rademacher复杂度在多标签学习里已经有一些成果，感兴趣的读者可以自行查阅这方面的文献。Rademacher...
复制链接

扫一扫