Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning

Abstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few...

Full description

Saved in:
Bibliographic Details
Main Authors: Yulong Ji, Kunjin Zou, Bin Zou
Format: Article
Language:English
Published: SpringerOpen 2024-11-01
Series:Cybersecurity
Subjects:
Online Access:https://doi.org/10.1186/s42400-024-00314-9
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846147690636771328
author Yulong Ji
Kunjin Zou
Bin Zou
author_facet Yulong Ji
Kunjin Zou
Bin Zou
author_sort Yulong Ji
collection DOAJ
description Abstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few samples available. Meta-learning methods offer a solution by allowing models to quickly adapt to new tasks, even with a small number of samples. However, the effectiveness of meta-learning approaches in malware classification varies due to the diverse nature of malware types. Most meta-learning-based methodologies for malware classification either focus solely on data augmentation or utilize existing neural networks and learning rate schedules to adapt to the meta-learning model. These approaches do not consider the integration of both processes or tailor the neural network and learning rate schedules to the specific task. As a result, the classification performance and generalization capabilities are suboptimal. In this paper, we propose a multi-improved model-agnostic meta-learning (MI-MAML) model that aims to address the challenges encountered in few-shot malware classification. Specifically, we propose two data augmentation techniques to improve the classification performance of few-shot malware. These techniques involve utilizing grayscale images and the Lab color space. Additionally, we customize neural network architectures and learning rate schemes based on the representative few-shot classification method, MAML, to further enhance the model’s classification performance and generalization ability for the task of few-shot malware classification. The results obtained from multiple few-shot malware datasets demonstrate that MI-MAML outperforms other models in terms of categorical accuracy, precision, and f1-score. Furthermore, we have conducted ablation experiments to validate the effectiveness of each stage of our work.
format Article
id doaj-art-a734a570de5544f5a5eadd8ca9c010b6
institution Kabale University
issn 2523-3246
language English
publishDate 2024-11-01
publisher SpringerOpen
record_format Article
series Cybersecurity
spelling doaj-art-a734a570de5544f5a5eadd8ca9c010b62024-12-01T12:32:02ZengSpringerOpenCybersecurity2523-32462024-11-017111910.1186/s42400-024-00314-9Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learningYulong Ji0Kunjin Zou1Bin Zou2School of Cyber Science and Technology, Hubei UniversityManchester Metropolitan Joint Institute, Hubei UniversitySchool of Mathematics and Statistic, Hubei Key Laboratory of Applied Mathematics, Hubei UniversityAbstract Malware classification has been successful in utilizing machine learning methods. However, it is limited by the reliance on a large number of high-quality labeled datasets and the issue of overfitting. These limitations hinder the accurate classification of advanced malware with only a few samples available. Meta-learning methods offer a solution by allowing models to quickly adapt to new tasks, even with a small number of samples. However, the effectiveness of meta-learning approaches in malware classification varies due to the diverse nature of malware types. Most meta-learning-based methodologies for malware classification either focus solely on data augmentation or utilize existing neural networks and learning rate schedules to adapt to the meta-learning model. These approaches do not consider the integration of both processes or tailor the neural network and learning rate schedules to the specific task. As a result, the classification performance and generalization capabilities are suboptimal. In this paper, we propose a multi-improved model-agnostic meta-learning (MI-MAML) model that aims to address the challenges encountered in few-shot malware classification. Specifically, we propose two data augmentation techniques to improve the classification performance of few-shot malware. These techniques involve utilizing grayscale images and the Lab color space. Additionally, we customize neural network architectures and learning rate schemes based on the representative few-shot classification method, MAML, to further enhance the model’s classification performance and generalization ability for the task of few-shot malware classification. The results obtained from multiple few-shot malware datasets demonstrate that MI-MAML outperforms other models in terms of categorical accuracy, precision, and f1-score. Furthermore, we have conducted ablation experiments to validate the effectiveness of each stage of our work.https://doi.org/10.1186/s42400-024-00314-9Malware classificationFew-shot learningMeta-learningData augmentation
spellingShingle Yulong Ji
Kunjin Zou
Bin Zou
Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
Cybersecurity
Malware classification
Few-shot learning
Meta-learning
Data augmentation
title Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_full Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_fullStr Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_full_unstemmed Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_short Mi-maml: classifying few-shot advanced malware using multi-improved model-agnostic meta-learning
title_sort mi maml classifying few shot advanced malware using multi improved model agnostic meta learning
topic Malware classification
Few-shot learning
Meta-learning
Data augmentation
url https://doi.org/10.1186/s42400-024-00314-9
work_keys_str_mv AT yulongji mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning
AT kunjinzou mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning
AT binzou mimamlclassifyingfewshotadvancedmalwareusingmultiimprovedmodelagnosticmetalearning