论文阅读理解：Understanding Black-box Predictions via Influence Functions

最新推荐文章于 2023-08-30 21:42:27 发布

billy145533

最新推荐文章于 2023-08-30 21:42:27 发布

阅读量620

点赞数

分类专栏：数据科学文章标签：影响函数 influence

本文链接：https://blog.csdn.net/billy145533/article/details/107044694

版权

数据科学专栏收录该内容

38 篇文章 8 订阅

订阅专栏

文章目录

网上关于这篇论文的阅读已经不少，这篇文章主要是想简单说说影响函数的具体意义

Cook Distance

$D_i=\frac{(\hat{y}_{(-i)}-\hat{y})^T(\hat{y}_{(-i)}-\hat{y})}{ps^2}=\frac{(\hat{\theta_{(-i)}}-\hat{\theta})^TX^TX(\hat{\theta_{(-i)}}-\hat{\theta})}{ps^2}$
这里，我们已经显示给出了损失函数
$L=\sum_{i=1}^n(y_i-x_i^T\beta)^2$

$D_i$ 实际上是F分布的值，表明了删除一个样本后，对模型的影响程度

Influence Function

假设输入空间为 $\mathcal{X}$ ，输出空间为 $\mathcal{Y}$ ,假设训练数据为 $z_1,\cdots,z_n$ , $z_i=(x_i,y_i) \in \mathcal{X} \times \mathcal{Y}$
损失函数 $L(z,\theta)$ 为二次可微
$\hat{\theta}= \underset{\theta \in \Theta}{arg \ min}\sum_{i=1}^nL(z_i,\theta)$

删除样本z之后,新的模型为
$\hat{\theta}_{-z}= \underset{\theta \in \Theta}{arg \ min}\sum_{i=1,z_i \neq z}^nL(z_i,\theta)=\underset{\theta \in \Theta}{arg \ min}\frac{1}{n}\sum_{i=1}^nL(z_i,\theta)-\frac{1}{n}L(z,\theta)$

更一般的有
$\hat{\theta}_{z,\epsilon}=\underset{\theta \in \Theta}{arg \ min}\frac{1}{n}\sum_{i=1}^nL(z_i,\theta)+\epsilon L(z,\theta)$

$\epsilon=-\frac{1}{n},\hat{\theta}_{z,\epsilon}=\hat{\theta}_{-z};\epsilon=0,\hat{\theta}_{z,\epsilon}=\hat{\theta}$

令 $R(\theta,\epsilon)=\frac{1}{n}\sum_{i=1}^nL(z_i,\theta)+\epsilon L(z,\theta)$

最优的一阶条件为：
$\frac{dR(\theta,\epsilon)}{d\theta}=\frac{dR(\theta,0)}{d\theta}+\epsilon \frac{dL(z,\theta)}{d\theta}=0\\ \frac{dR(\theta,\epsilon)}{d\epsilon}=L(z,\theta)=0$

令 $f(\theta)=\Delta R(\theta,0)+\epsilon \Delta L(z,\theta)$
因为 $\epsilon\rightarrow0\Rightarrow \hat{\theta}_{z,\epsilon}\rightarrow \hat{\theta}$
$\hat{\theta}_{z,\epsilon}=\hat{\theta} +\Delta \theta$
一阶泰勒展开
$f(\hat{\theta}_{z,\epsilon})=f(\hat{\theta}+\Delta \theta)\approx f(\hat{\theta})+f'(\hat{\theta})\Delta \theta\Rightarrow\\ \Delta \theta = -f'(\hat{\theta})^{-1}f(\hat{\theta})$

$f(\hat{\theta})=\bigtriangledown R(\hat{\theta},0)+\epsilon\bigtriangledown L(z,\hat{\theta})$
由于 $\bigtriangledown R(\hat{\theta},0)=0\Rightarrow f(\hat{\theta})=\epsilon \bigtriangledown L(z,\hat{\theta})$ ，这里的意义是 $\theta$ 关于z的下降梯度
$f'(\hat{\theta})=\bigtriangledown^2 R(\hat{\theta},0)+\epsilon\bigtriangledown^2 L(z,\hat{\theta})\approx \bigtriangledown^2R(\hat{\theta},0)\equiv H_{\hat{\theta}}$

参数影响 $\mathcal{I}_{up,params}(z)$

$\Delta \theta=-\epsilon H_{\hat{\theta}}^{-1} \bigtriangledown L(z,\hat{\theta})$

删除其中一个样本时，设置的 $\epsilon$ 大小是一样的，所以，把决定 $\Delta \theta$ 的主要部分定义为影响函数。
$\mathcal{I}_{up,params}(z)= -H_{\hat{\theta}}^{-1} \bigtriangledown L(z,\hat{\theta})$ ,
所以有
$\Delta \theta = \hat{\theta}_{-z}-\hat{\theta} =-\frac{1}{n}\mathcal{I}_{up,params}(z)$
定义为样本z对参数的影响程度。至于影响是好还是坏，不好说，一般来说，对参数影响大的样本，往往是异常点。这个其实很类似一个标准的牛顿梯度下降方法，只计算第一次下降的方向。得到的结果其实参数的下降的梯度(方向+步长)。梯度乘以海赛矩阵的逆矩阵，使得下降具有二次收敛的速度。

损失影响 $\mathcal{I}_{up,loss}(z,z_{test})$

相比cook距离，这个影响指标的大小并没有什么特别意义。在模型训练中，我们最关心，还是想知道引入或者删除一个样本，是提高了还是降低了模型的精度。最基本是知道一个训练样本z对另外一个样本 $z_{test}$ 的影响。

$f(z,z_{test},\epsilon)=L(z_{test},\hat{\theta}_{z,\epsilon})-L(z_{test},\hat{\theta)}\\ \mathcal{I}_{up,loss}(z,z_{test})=\frac{df(z,z_{test},\epsilon)}{d\epsilon}=\frac{dL(z_{test},\hat{\theta}_{z,\epsilon})}{d\epsilon}\\= \bigtriangledown L(z_{test},\hat{\theta}_{z,\epsilon})^T\frac{d\hat{\theta}_{z,\epsilon}}{d\epsilon}= \bigtriangledown L(z_{test},\hat{\theta}_{z,\epsilon})^T\mathcal{I}_{up,params}(z)$

$f(z,z_{test},\epsilon)$ 表示给样本z加权后对 $z_{test}$ 预测的影响。显然，该指标大于0表示为负影响，小于0则为正影响。
$\frac{f(z,z_{test},\epsilon)}{d\epsilon}$ 表示随着权值变换，f函数的变化梯度。由于 $\hat{\theta}_{z,\epsilon}$ 由一阶泰勒展开地近似表示，因此，该梯度是不会变化的。

$f(z,z_{test},-\frac{1}{n})=-\frac{1}{n}\mathcal{I}_{up,loss}(z,z_{test})$
在n比较大的时候， $\frac{1}{n}\rightarrow0$ ，这个函数精度应该没问题，如果n没那么大，不知道还能不能近似地计算出影响力。

billy145533

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
论文阅读理解：Understanding Black-box Predictions via Influence Functions

文章目录Cook DistanceInfluence Function参数影响Iup,params(z)\mathcal{I}_{up,params}(z)Iup,params(z)损失影响Iup,loss(z,ztest)\mathcal{I}_{up,loss}(z,z_{test})Iup,loss(z,ztest)网上关于这篇论文的阅读已经不少，这篇文章主要是想简单说说影响函数的具体意义Cook DistanceDi=(y^(−i)−y^)T(y^(−i)−y^)ps2=(θ(−i)
复制链接

扫一扫