NFDI4DS | UHH-SEMS - Publication Details

A novel and universal GAN-based countermeasure to recover adversarial examples to benign examples

0202 electrical engineering, electronic engineering, information engineering 02 engineering and technology

DOI: 10.1016/j.cose.2021.102457 Publication Date: 2021-09-04T22:51:21Z

Abstract Supplemental Material References Cited by

AUTHORS (4)

Rui Yang

Tian-Jie Cao

Xiu-Qing Chen

Feng-Rong Zhang

ABSTRACT

Abstract Some recent studies have demonstrated that the deep neural network (DNN) is vulnerable to adversarial examples, which contain some subtle and human-imperceptible perturbations. Although numerous countermeasures have been proposed and play a significant role, most of them all have some flaws and are only effective for certain types of adversarial examples. In the paper, we present a novel and universal countermeasure to recover multiple types of adversarial examples to benign examples before they are fed into the deep neural network. The idea is to model the mapping between adversarial examples and benign examples using a generative adversarial network (GAN). Its GAN architecture consists of a generator based on UNET, a discriminator based on ACGAN, and a newly added third-party classifier. The UNET can enhance the capacity of the generator to recover adversarial examples to benign examples. The loss function makes full use of the advantages of ACGAN and WGAN-GP to ensure the stability of the training process and accelerate its convergence. Besides, a classification loss and a perceptual loss, all from the third-party classifier, are employed to improve further the generator's capacity to eliminate adversarial perturbations. Experiments are conducted on the MNIST, CIFAR10, and IMAGENET datasets. First, we perform ablation experiments to prove the proposed countermeasure's validity. Then, we defend against seven types of state-of-the-art adversarial examples on four deep neural networks and compare them with six existing countermeasures. Finally, the experimental results demonstrate that the proposed countermeasure is universal and has a more excellent performance than other countermeasures. The experimental code is available at https://github.com/Afreadyang/IAED-GAN .

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES (51)

CITATIONS (5)

EXTERNAL LINKS

OPENAIRE - Products CROSSREF - Publications

PlumX Metrics

A novel and universal GAN-based countermeasure to recover adversarial examples to benign examples

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....