An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository

The number of exploits of Docker images involving the injection of adversarial behaviors into the image’s layers is increasing immensely. Docker images are a fundamental component of Docker. Therefore, developing a machine learning classifier that effectively predicts and classifies wheth...

Full description

Saved in:

Bibliographic Details
Main Authors:	Maram Aldiabat, Qussai M. Yaseen, Qusai Abu Ein
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Docker Docker images Docker containers machine learning malware detection
Online Access:	https://ieeexplore.ieee.org/document/10768874/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846123615571935232
author	Maram Aldiabat Qussai M. Yaseen Qusai Abu Ein
author_facet	Maram Aldiabat Qussai M. Yaseen Qusai Abu Ein
author_sort	Maram Aldiabat
collection	DOAJ
description	The number of exploits of Docker images involving the injection of adversarial behaviors into the image’s layers is increasing immensely. Docker images are a fundamental component of Docker. Therefore, developing a machine learning classifier that effectively predicts and classifies whether a Docker image contains injected malicious behaviors is crucial as a proactive approach. This paper proposes a machine learning model to assess the feasibility of employing machine learning algorithms for detecting the security status of Docker images available in the Docker Hub repository. The paper develops a machine learning model for detecting malicious Docker images by using a newly created dataset containing Docker images associated with 14 corresponding features that were specifically chosen as they are critical indicators of potential security risks in Docker images, and the dataset was published for research purposes. Moreover, the paper developed and tested several machine learning algorithms using Docker image features: Naïve Bayes, Decision Tree, Random Forest, Gradient Boosting, Extreme Gradient Boosting, and Neural Network. The results show that the Random Forest classifier demonstrates exceptional accuracy, achieving a 99% F1-score and an AUC of 100%. This performance refers to its capability to accurately classify the images and effectively distinguish between secure and insecure images, in addition to the minimal error rate of less than 1%, outperforming state-of-the-art models to identify malicious Docker images.
format	Article
id	doaj-art-2be9e1ca7011433bb9cb16d1f4ee47c2
institution	Kabale University
issn	2169-3536
language	English
publishDate	2024-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-2be9e1ca7011433bb9cb16d1f4ee47c22024-12-14T00:01:28ZengIEEEIEEE Access2169-35362024-01-011218558618560410.1109/ACCESS.2024.350666310768874An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub RepositoryMaram Aldiabat0https://orcid.org/0009-0004-0706-0038Qussai M. Yaseen1https://orcid.org/0000-0003-0777-1871Qusai Abu Ein2Department of Computer Science and Software Engineering, Auburn University, Auburn, AL, USADepartment of Information Technology, AIRC Center, Ajman University, Ajman, United Arab EmiratesDepartment of Computer Information Systems, Jordan University of Science and Technology, Irbid, JordanThe number of exploits of Docker images involving the injection of adversarial behaviors into the image’s layers is increasing immensely. Docker images are a fundamental component of Docker. Therefore, developing a machine learning classifier that effectively predicts and classifies whether a Docker image contains injected malicious behaviors is crucial as a proactive approach. This paper proposes a machine learning model to assess the feasibility of employing machine learning algorithms for detecting the security status of Docker images available in the Docker Hub repository. The paper develops a machine learning model for detecting malicious Docker images by using a newly created dataset containing Docker images associated with 14 corresponding features that were specifically chosen as they are critical indicators of potential security risks in Docker images, and the dataset was published for research purposes. Moreover, the paper developed and tested several machine learning algorithms using Docker image features: Naïve Bayes, Decision Tree, Random Forest, Gradient Boosting, Extreme Gradient Boosting, and Neural Network. The results show that the Random Forest classifier demonstrates exceptional accuracy, achieving a 99% F1-score and an AUC of 100%. This performance refers to its capability to accurately classify the images and effectively distinguish between secure and insecure images, in addition to the minimal error rate of less than 1%, outperforming state-of-the-art models to identify malicious Docker images.https://ieeexplore.ieee.org/document/10768874/DockerDocker imagesDocker containersmachine learningmalware detection
spellingShingle	Maram Aldiabat Qussai M. Yaseen Qusai Abu Ein An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository IEEE Access Docker Docker images Docker containers machine learning malware detection
title	An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository
title_full	An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository
title_fullStr	An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository
title_full_unstemmed	An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository
title_short	An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository
title_sort	efficient random forest classifier for detecting malicious docker images in docker hub repository
topic	Docker Docker images Docker containers machine learning malware detection
url	https://ieeexplore.ieee.org/document/10768874/
work_keys_str_mv	AT maramaldiabat anefficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository AT qussaimyaseen anefficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository AT qusaiabuein anefficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository AT maramaldiabat efficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository AT qussaimyaseen efficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository AT qusaiabuein efficientrandomforestclassifierfordetectingmaliciousdockerimagesindockerhubrepository

An Efficient Random Forest Classifier for Detecting Malicious Docker Images in Docker Hub Repository

Similar Items