Quan, Pengrui

Exploiting Human Perception for Adversarial Attacks

2020

Quan, Pengrui
Advisor(s): Srivastava, Mani B.

Abstract

There has been a significant amount of recent work towards fooling deep-learning-based classifiers, particularly for images, via adversarial inputs that are perceptually similar to benign examples. However, researchers typically use minimization of the $L_p$-norm as a proxy for imperceptibility, an approach that oversimplifies the complexity of real-world images and human visual perception. We exploit the relationship between image features and human perception to propose a \textit{Perceptual Loss (PL)} metric to better capture human imperceptibly during the generation of adversarial images. By focusing on human perceptible distortion of image features, the metric yields better visual quality adversarial images as our experiments validate. Our results also demonstrate the effectiveness and efficiency of our algorithm.

Main Content

For improved accessibility of PDF content, download the file to your device.

UCLA

Exploiting Human Perception for Adversarial Attacks