【论文阅读】Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models（2018）

Bosenya12

于 2024-08-21 15:18:03 发布

阅读量251

点赞数 3

分类专栏：科研学习文章标签：论文阅读生成对抗网络人工智能

本文链接：https://blog.csdn.net/Glass_Gun/article/details/141394164

版权

科研学习专栏收录该内容

16 篇文章 0 订阅

订阅专栏

摘要

In recent years（近年来）, deep neural network approaches（深度神经网络方法） have been widely adopted for（被广泛应用于） machine learning tasks（机器学习任务）, including classification（分类）. However, they were shown（被证明） to be vulnerable（容易受到） to adversarial perturbations（对抗性扰动）: carefully crafted small perturbations（精心制作的小干扰） can cause misclassification of legitimate images（合法图像的错误分类）. We propose Defense-GAN, a new framework（新框架） leveraging（利用） the expressive capability（表达能力） of generative models（生成模型） to defend（防御） deep neural networks against such attacks. Defense-GAN is trained to model the distribution of unperturbed images（无扰动图像的分布）. At inference time（在推理时）, it finds a close output（接近输出） to a given image（给定图像） which does not contain the adversarial changes（不包含对抗性变化）. This output is then fed to the classifier（反馈给分类器）. Our proposed method（我们提出的方法） can be used with any classification model（用在任何分类模型） and does not modify the classifier structure or training procedure（不修改分类器结构或训练过程）. It can also be used as a defense against any attack（防御任何攻击） as it does not assume（假设） knowledge of the process（过程的知识） for generating the adversarial examples（生成对抗样本）. We empirically show （经验表明）that Defense-GAN is consistently effective（一致有效） against different attack methods（不同的攻击方法） and improves on existing defense strategies（改进了现有的防御策略）.

方法

在这里插入图片描述

论文链接

Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models

Bosenya12

关注

3
点赞
踩
5

收藏

觉得还不错? 一键收藏
打赏
0
评论
【论文阅读】Defense-GAN: Protecting Classifiers Against Adversarial Attacks Using Generative Models（2018）

In recent years（近年来）, deep neural network approaches（深度神经网络方法） have been widely adopted for（被广泛应用于） machine learning tasks（机器学习任务）, including classification（分类）. However, they were shown（被证明） to be vulnerable（容易受到） to adversarial perturbations（对抗性扰动）: care
复制链接

扫一扫