Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense

Escalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a spe...

Full description

Saved in:
Bibliographic Details
Main Authors: Inpyo Hong, Sokjoon Lee
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/23/10872
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846124490258382848
author Inpyo Hong
Sokjoon Lee
author_facet Inpyo Hong
Sokjoon Lee
author_sort Inpyo Hong
collection DOAJ
description Escalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a specific form of assault methodology. Although various countermeasures have been explored, practical defense models are scarce. Current adversarial defense methods suffer from reduced accuracy, increased training time, and incomplete defense against adversarial attacks, indicating performance limitations and a lack of robustness. To address these limitations, we propose a composite defense model, the knowledge Distillation and deNoising Network (DiNo-Net), which integrates knowledge distillation and feature denoising techniques. Furthermore, we analyzed a correlation between the loss surface of adversarial perturbations and denoising techniques. Using DiNo-Net, we confirmed that increasing the temperature during the knowledge distillation process effectively amplifies the loss surface around the ground truth. Consequently, this enables more efficient denoising of the adversarial perturbations. It achieved a defense success rate of 72.7%, which is a remarkable improvement over the 41.0% success rate of models with only denoising defense mechanisms. Furthermore, DiNo-Net reduced the training time and maintained higher accuracy, confirming its efficient defense performance. We hope that this relationship will spur the development of fundamental defense strategies.
format Article
id doaj-art-2a06d51d6b6f41d5adf562408aa5a859
institution Kabale University
issn 2076-3417
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-2a06d51d6b6f41d5adf562408aa5a8592024-12-13T16:22:01ZengMDPI AGApplied Sciences2076-34172024-11-0114231087210.3390/app142310872Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial DefenseInpyo Hong0Sokjoon Lee1Department of Computer Science, Yonsei University, Seoul 03722, Republic of KoreaDepartment of Computer Engineering, Gachon University, Seongnam 13120, Republic of KoreaEscalating advancements in artificial intelligence (AI) has prompted significant security concerns, especially with its increasing commercialization. This necessitates research on safety measures to securely utilize AI models. Existing AI models are vulnerable to adversarial attacks, which are a specific form of assault methodology. Although various countermeasures have been explored, practical defense models are scarce. Current adversarial defense methods suffer from reduced accuracy, increased training time, and incomplete defense against adversarial attacks, indicating performance limitations and a lack of robustness. To address these limitations, we propose a composite defense model, the knowledge Distillation and deNoising Network (DiNo-Net), which integrates knowledge distillation and feature denoising techniques. Furthermore, we analyzed a correlation between the loss surface of adversarial perturbations and denoising techniques. Using DiNo-Net, we confirmed that increasing the temperature during the knowledge distillation process effectively amplifies the loss surface around the ground truth. Consequently, this enables more efficient denoising of the adversarial perturbations. It achieved a defense success rate of 72.7%, which is a remarkable improvement over the 41.0% success rate of models with only denoising defense mechanisms. Furthermore, DiNo-Net reduced the training time and maintained higher accuracy, confirming its efficient defense performance. We hope that this relationship will spur the development of fundamental defense strategies.https://www.mdpi.com/2076-3417/14/23/10872adversarial attackadversarial robustnessknowledge distillationfeature denoising
spellingShingle Inpyo Hong
Sokjoon Lee
Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
Applied Sciences
adversarial attack
adversarial robustness
knowledge distillation
feature denoising
title Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_full Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_fullStr Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_full_unstemmed Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_short Exploring Synergy of Denoising and Distillation: Novel Method for Efficient Adversarial Defense
title_sort exploring synergy of denoising and distillation novel method for efficient adversarial defense
topic adversarial attack
adversarial robustness
knowledge distillation
feature denoising
url https://www.mdpi.com/2076-3417/14/23/10872
work_keys_str_mv AT inpyohong exploringsynergyofdenoisinganddistillationnovelmethodforefficientadversarialdefense
AT sokjoonlee exploringsynergyofdenoisinganddistillationnovelmethodforefficientadversarialdefense