Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning

Abstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yulong Ji, Kunjin Zou, Bin Zou
Format:	Article
Language:	English
Published:	SpringerOpen 2024-11-01
Series:	Cybersecurity
Subjects:	Malware classification Few-shot learning Meta-learning Data augmentation
Online Access:	https://doi.org/10.1186/s42400-024-00314-9
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846147690636771328
author	Yulong Ji Kunjin Zou Bin Zou
author_facet	Yulong Ji Kunjin Zou Bin Zou
author_sort	Yulong Ji
collection	DOAJ
description	Abstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few samples available. Meta-learning methods offer a solution by allowing models to quickly adapt to new tasks, even with a small number of samples. However, the effectiveness of meta-learning approaches in malware classification varies due to the diverse nature of malware types. Most meta-learning-based methodologies for malware classification either focus solely on data augmentation or utilize existing neural networks and learning rate schedules to adapt to the meta-learning model. These approaches do not consider the integration of both processes or tailor the neural network and learning rate schedules to the specific task. As a result, the classification performance and generalization capabilities are suboptimal. In this paper, we propose a multi-improved model-agnostic meta-learning (MI-MAML) model that aims to address the challenges encountered in few-shot malware classification. Specifically, we propose two data augmentation techniques to improve the classification performance of few-shot malware. These techniques involve utilizing grayscale images and the Lab color space. Additionally, we customize neural network architectures and learning rate schemes based on the representative few-shot classification method, MAML, to further enhance the model’s classification performance and generalization ability for the task of few-shot malware classification. The results obtained from multiple few-shot malware datasets demonstrate that MI-MAML outperforms other models in terms of categorical accuracy, precision, and f1-score. Furthermore, we have conducted ablation experiments to validate the effectiveness of each stage of our work.
format	Article
id	doaj-art-a734a570de5544f5a5eadd8ca9c010b6
institution	Kabale University
issn	2523-3246
language	English
publishDate	2024-11-01
publisher	SpringerOpen
record_format	Article
series	Cybersecurity
spelling	doaj-art-a734a570de5544f5a5eadd8ca9c010b62024-12-01T12:32:02ZengSpringerOpenCybersecurity2523-32462024-11-017111910.1186/s42400-024-00314-9Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learningYulong Ji0Kunjin Zou1Bin Zou2School of Cyber Science and Technology, Hubei UniversityManchester Metropolitan Joint Institute, Hubei UniversitySchool of Mathematics and Statistic, Hubei Key Laboratory of Applied Mathematics, Hubei UniversityAbstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few samples available. Meta-learning methods offer a solution by allowing models to quickly adapt to new tasks, even with a small number of samples. However, the effectiveness of meta-learning approaches in malware classification varies due to the diverse nature of malware types. Most meta-learning-based methodologies for malware classification either focus solely on data augmentation or utilize existing neural networks and learning rate schedules to adapt to the meta-learning model. These approaches do not consider the integration of both processes or tailor the neural network and learning rate schedules to the specific task. As a result, the classification performance and generalization capabilities are suboptimal. In this paper, we propose a multi-improved model-agnostic meta-learning (MI-MAML) model that aims to address the challenges encountered in few-shot malware classification. Specifically, we propose two data augmentation techniques to improve the classification performance of few-shot malware. These techniques involve utilizing grayscale images and the Lab color space. Additionally, we customize neural network architectures and learning rate schemes based on the representative few-shot classification method, MAML, to further enhance the model’s classification performance and generalization ability for the task of few-shot malware classification. The results obtained from multiple few-shot malware datasets demonstrate that MI-MAML outperforms other models in terms of categorical accuracy, precision, and f1-score. Furthermore, we have conducted ablation experiments to validate the effectiveness of each stage of our work.https://doi.org/10.1186/s42400-024-00314-9Malware classificationFew-shot learningMeta-learningData augmentation
spellingShingle	Yulong Ji Kunjin Zou Bin Zou Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning Cybersecurity Malware classification Few-shot learning Meta-learning Data augmentation
title	Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_full	Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_fullStr	Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_full_unstemmed	Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_short	Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_sort	mi maml classifying few shot advanced malware using multi improved model agnostic meta learning
topic	Malware classification Few-shot learning Meta-learning Data augmentation
url	https://doi.org/10.1186/s42400-024-00314-9
work_keys_str_mv	AT yulongji mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning AT kunjinzou mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning AT binzou mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning

Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning

Similar Items