Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

Lin Li; Michael Spratling

doi:10.1016/j.patcog.2022.109229

Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

Lin Li, Michael Spratling

Research output: Contribution to journal › Article › peer-review

21 Citations (Scopus)

69 Downloads (Pure)

Abstract

Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms underlying overfitting through analyzing the loss landscape w.r.t. the input. We find that robust overfitting results from standard training, specifically the minimization of the clean loss, and can be mitigated by regularization of the loss gradients. Moreover, we find that robust overfitting turns severer during adversarial training partially because the gradient regularization effect of adversarial training becomes weaker due to the increase in the loss landscapes curvature. To improve robust generalization, we propose a new regularizer to smooth the loss landscape by penalizing the weighted logits variation along the adversarial direction. Our method significantly mitigates robust overfitting and achieves the highest robustness and efficiency compared to similar previous methods. Code is available at https://github.com/TreeLLi/Combating-RO-AdvLC.

Original language	English
Article number	109229
Journal	PATTERN RECOGNITION
Volume	136
Early online date	8 Dec 2022
DOIs	https://doi.org/10.1016/j.patcog.2022.109229
Publication status	Published - Apr 2023

Keywords

cs.LG

Access to Document

10.1016/j.patcog.2022.109229Licence: CC BY-NC-ND

Understanding and combating_LI_Publishedonline8December2022_GOLD VoR (CC BY-NC-ND)Final published version, 1.65 MBLicence: CC BY-NC-ND

Cite this

@article{05fdf35e4ee64c7e9f9357f38952ff63,

title = "Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization",

abstract = "Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms underlying overfitting through analyzing the loss landscape w.r.t. the input. We find that robust overfitting results from standard training, specifically the minimization of the clean loss, and can be mitigated by regularization of the loss gradients. Moreover, we find that robust overfitting turns severer during adversarial training partially because the gradient regularization effect of adversarial training becomes weaker due to the increase in the loss landscapes curvature. To improve robust generalization, we propose a new regularizer to smooth the loss landscape by penalizing the weighted logits variation along the adversarial direction. Our method significantly mitigates robust overfitting and achieves the highest robustness and efficiency compared to similar previous methods. Code is available at https://github.com/TreeLLi/Combating-RO-AdvLC. ",

keywords = "cs.LG",

author = "Lin Li and Michael Spratling",

note = "Funding Information: The authors acknowledge the use of the research computing facility at King{\textquoteright}s College London, King{\textquoteright}s Computational Research, Engineering and Technology Environment (CREATE), and the Joint Academic Data science Endeavour (JADE) facility. This research was funded by the King{\textquoteright}s - China Scholarship Council (K-CSC). Funding Information: The authors acknowledge the use of the research computing facility at King's College London, King's Computational Research, Engineering and Technology Environment (CREATE), and the Joint Academic Data science Endeavour (JADE) facility. This research was funded by the King's - China Scholarship Council (K-CSC). Publisher Copyright: {\textcopyright} 2022 The Author(s)",

year = "2023",

month = apr,

doi = "10.1016/j.patcog.2022.109229",

language = "English",

volume = "136",

journal = "PATTERN RECOGNITION",

issn = "0031-3203",

publisher = "Elsevier Limited",

}

TY - JOUR

T1 - Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

AU - Li, Lin

AU - Spratling, Michael

N1 - Funding Information: The authors acknowledge the use of the research computing facility at King’s College London, King’s Computational Research, Engineering and Technology Environment (CREATE), and the Joint Academic Data science Endeavour (JADE) facility. This research was funded by the King’s - China Scholarship Council (K-CSC). Funding Information: The authors acknowledge the use of the research computing facility at King's College London, King's Computational Research, Engineering and Technology Environment (CREATE), and the Joint Academic Data science Endeavour (JADE) facility. This research was funded by the King's - China Scholarship Council (K-CSC). Publisher Copyright: © 2022 The Author(s)

PY - 2023/4

Y1 - 2023/4

N2 - Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms underlying overfitting through analyzing the loss landscape w.r.t. the input. We find that robust overfitting results from standard training, specifically the minimization of the clean loss, and can be mitigated by regularization of the loss gradients. Moreover, we find that robust overfitting turns severer during adversarial training partially because the gradient regularization effect of adversarial training becomes weaker due to the increase in the loss landscapes curvature. To improve robust generalization, we propose a new regularizer to smooth the loss landscape by penalizing the weighted logits variation along the adversarial direction. Our method significantly mitigates robust overfitting and achieves the highest robustness and efficiency compared to similar previous methods. Code is available at https://github.com/TreeLLi/Combating-RO-AdvLC.

AB - Adversarial training is widely used to improve the robustness of deep neural networks to adversarial attack. However, adversarial training is prone to overfitting, and the cause is far from clear. This work sheds light on the mechanisms underlying overfitting through analyzing the loss landscape w.r.t. the input. We find that robust overfitting results from standard training, specifically the minimization of the clean loss, and can be mitigated by regularization of the loss gradients. Moreover, we find that robust overfitting turns severer during adversarial training partially because the gradient regularization effect of adversarial training becomes weaker due to the increase in the loss landscapes curvature. To improve robust generalization, we propose a new regularizer to smooth the loss landscape by penalizing the weighted logits variation along the adversarial direction. Our method significantly mitigates robust overfitting and achieves the highest robustness and efficiency compared to similar previous methods. Code is available at https://github.com/TreeLLi/Combating-RO-AdvLC.

KW - cs.LG

UR - http://www.scopus.com/inward/record.url?scp=85145582255&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2022.109229

DO - 10.1016/j.patcog.2022.109229

M3 - Article

SN - 0031-3203

VL - 136

JO - PATTERN RECOGNITION

JF - PATTERN RECOGNITION

M1 - 109229

ER -

Understanding and Combating Robust Overfitting via Input Loss Landscape Analysis and Regularization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this