🌙

潜在模型的对抗鲁棒性:重新审视鲁棒性与标准准确率的权衡

Adversarial Robustness for Latent Models: Revisiting the Robust-Standard Accuracies Tradeoff

Operations Research · 2023
被引 0
人大 AFT50UTD24ABS 4*

中文导读

研究证明当数据具有低维结构时,可以训练出在标准准确率和对抗鲁棒性上都接近最优的模型,解决了二者冲突的问题。

Abstract

Low-dimensional structure of data can solve the adversarial robustness-accuracy conflict for machine learning systems. Modern machine learning systems have demonstrated breakthrough performance in a multitude of applications. However, they are known to be highly vulnerable to small perturbations to the input data, known as adversarial attacks. There are many well-documented examples of such behavior, for example small perturbations of an image, which is imperceptible to a human, can significantly degrade performance of modern classifiers. Adversarial training has been put forward as a way to improve robustness of learning algorithms to adversarial attacks. However, this benefit often comes at the cost of decreasing accuracy on natural unperturbed inputs, pointing to a potential conflict between adversarial robustness and standard accuracy. In “Adversarial robustness for latent models: Revisiting the robust-standard accuracies tradeoff,” Adel Javanmard and Mohammad Mehrabi develop a theory to show that when the data enjoys low-dimensional structure, then it is possible to train models that are nearly optimal with respect to both, the standard and robust accuracies.

机器学习对抗攻击鲁棒性数据低维结构