论文阅读：Auditing differentially private machine learning: How private is private SGD?

最新推荐文章于 2023-07-09 13:20:12 发布

小小咸鱼也要努力的

最新推荐文章于 2023-07-09 13:20:12 发布

阅读量348

点赞数 1

分类专栏：差分隐私学习笔记文章标签：信息安全

本文链接：https://blog.csdn.net/weixin_43641509/article/details/120372646

版权

Audit

JAGIELSKI M, ULLMAN J, OPREA A. Auditing differentially private machine learning: How private is private SGD?[C]//Advances in Neural Information Processing Systems. .
正式版论文链接
 预收论文链接（更加详细）
视频链接

Differential privacy gives a strong worst-case guarantee of individual privacy:

a differentially private algorithm ensures that, for any set of training examples, no attacker, no matter how powerful attack, can not learn much more information about a single training example than they could have learned had that example been excluded from the training data.

So, how closely can we measure privacy loss ?

here upper bounds means more privacy and smaller epsilon

lower bounds means less privacy and larger epsilon

A privacy proof will only give an upper bound for privacy level (smaller epsilon, more privacy). Improvements into the proof will get closer and closer to the true value of the privacy loss. Indeed, the analysis of the algorithm is always pessimistic, and recently theoretical analysis is not tight.

Besides, differential privacy is a worst-case notion. That is it usually provide more privacy guarantee on the realistic datasets and realistic attacks. So, privacy attacks can only get closer and close to the worst-case. That’s also what we do in the after work——construct an efficient attack.

DP-SGD

In this paper, mainly discuss the audit in DP-SGD

DP-SGD is a modifaction of SGD, DP-SGD makes two modifications to the learning process to preserve privacy: clipping gradients and adding noise.

Every iteration it take a random sample L, and for each i ∈L_t ,we compute gradient and clip it. then sum all gradient and add noise

Poisoning Attacks

目的：构造数据集 D₀, D₁

BackGround Attacks

Implement

X_p = GETRANDOMROWS(X, k)

随机选择k个x进行投毒
Pert(x)：置x的前5*5的像素点为1
y_p：置y为1

However

Clipping provides no formal privacy on its own,but many poisoning attacks perform significantly worse in the presence of clipping

The objective of attack is to decrease the loss on (x_p, y_p). That is, we need to increasing gradient at every times iteration. In traditional SGD: g_t = 1/L Σ_ig_t(x_i)

∇_wl(w · x_p+ b, y_p) =l^’(w · x_p+ b, y_p)·x_p

By doubling this quantity of gradient , if |x_p| is fixed, half as many poisoning points are required for the same effect.

However in the presence of clipping ,this relationship broken

Clipping-Aware Poisoning

the attack must produce not only large gradients,but also distinguished gradients. (That is, the distribution of gradients arising from poisoned and cleaned data must be significantly different.)

minimizing Var_(x,y)∈D[l^’(w · x_p+ b, y_p）x_p· l^’(w · x+ b, y）x]

$Var_{(x,y)\in D}[\ell^{'}(w·x_p+b,y_p)x_p · \ell^{'}(w·x+b,y)x]$

Audit

in the case of $\delta = 0$

最低0.47元/天解锁文章

小小咸鱼也要努力的

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
论文阅读：Auditing differentially private machine learning: How private is private SGD?

Audit正式版论文链接预收论文链接（更加详细）视频链接Differential privacy gives a strong worst-case guarantee of individual privacy:a differentially private algorithm ensures that, for any set of training examples, no attacker, no matter how powerful attack, can not learn mu
复制链接

扫一扫