Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense

Escalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a spe...

Full description

Saved in:

Bibliographic Details
Main Authors:	Inpyo Hong, Sokjoon Lee
Format:	Article
Language:	English
Published:	MDPI AG 2024-11-01
Series:	Applied Sciences
Subjects:	adversarial attack adversarial robustness knowledge distillation feature denoising
Online Access:	https://www.mdpi.com/2076-3417/14/23/10872
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846124490258382848
author	Inpyo Hong Sokjoon Lee
author_facet	Inpyo Hong Sokjoon Lee
author_sort	Inpyo Hong
collection	DOAJ
description	Escalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a specific form of assault methodology. Although various countermeasures have been explored, practical defense models are scarce. Current adversarial defense methods suffer from reduced accuracy, increased training time, and incomplete defense against adversarial attacks, indicating performance limitations and a lack of robustness. To address these limitations, we propose a composite defense model, the knowledge Distillation and deNoising Network (DiNo-Net), which integrates knowledge distillation and feature denoising techniques. Furthermore, we analyzed a correlation between the loss surface of adversarial perturbations and denoising techniques. Using DiNo-Net, we confirmed that increasing the temperature during the knowledge distillation process effectively amplifies the loss surface around the ground truth. Consequently, this enables more efficient denoising of the adversarial perturbations. It achieved a defense success rate of 72.7%, which is a remarkable improvement over the 41.0% success rate of models with only denoising defense mechanisms. Furthermore, DiNo-Net reduced the training time and maintained higher accuracy, confirming its efficient defense performance. We hope that this relationship will spur the development of fundamental defense strategies.
format	Article
id	doaj-art-2a06d51d6b6f41d5adf562408aa5a859
institution	Kabale University
issn	2076-3417
language	English
publishDate	2024-11-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-2a06d51d6b6f41d5adf562408aa5a8592024-12-13T16:22:01ZengMDPI AGApplied Sciences2076-34172024-11-0114231087210.3390/app142310872Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial DefenseInpyo Hong0Sokjoon Lee1Department of Computer Science, Yonsei University, Seoul 03722, Republic of KoreaDepartment of Computer Engineering, Gachon University, Seongnam 13120, Republic of KoreaEscalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a specific form of assault methodology. Although various countermeasures have been explored, practical defense models are scarce. Current adversarial defense methods suffer from reduced accuracy, increased training time, and incomplete defense against adversarial attacks, indicating performance limitations and a lack of robustness. To address these limitations, we propose a composite defense model, the knowledge Distillation and deNoising Network (DiNo-Net), which integrates knowledge distillation and feature denoising techniques. Furthermore, we analyzed a correlation between the loss surface of adversarial perturbations and denoising techniques. Using DiNo-Net, we confirmed that increasing the temperature during the knowledge distillation process effectively amplifies the loss surface around the ground truth. Consequently, this enables more efficient denoising of the adversarial perturbations. It achieved a defense success rate of 72.7%, which is a remarkable improvement over the 41.0% success rate of models with only denoising defense mechanisms. Furthermore, DiNo-Net reduced the training time and maintained higher accuracy, confirming its efficient defense performance. We hope that this relationship will spur the development of fundamental defense strategies.https://www.mdpi.com/2076-3417/14/23/10872adversarial attackadversarial robustnessknowledge distillationfeature denoising
spellingShingle	Inpyo Hong Sokjoon Lee Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense Applied Sciences adversarial attack adversarial robustness knowledge distillation feature denoising
title	Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_full	Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_fullStr	Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_full_unstemmed	Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_short	Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_sort	exploring synergy of denoising and distillation novel method for efficient adversarial defense
topic	adversarial attack adversarial robustness knowledge distillation feature denoising
url	https://www.mdpi.com/2076-3417/14/23/10872
work_keys_str_mv	AT inpyohong exploringsynergyofdenoisinganddistillationnovelmethodforefficientadversarialdefense AT sokjoonlee exploringsynergyofdenoisinganddistillationnovelmethodforefficientadversarialdefense

Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense

Similar Items