Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays

This study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes...

Full description

Saved in:
Bibliographic Details
Main Authors: Haddou Bouazza Sara, Haddou Bouazza Jihad
Format: Article
Language:English
Published: EDP Sciences 2024-01-01
Series:ITM Web of Conferences
Online Access:https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841554742679961600
author Haddou Bouazza Sara
Haddou Bouazza Jihad
author_facet Haddou Bouazza Sara
Haddou Bouazza Jihad
author_sort Haddou Bouazza Sara
collection DOAJ
description This study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes. We applied multiple feature selection methods, including Signal to Noise Ratio (SNR), ReliefF, Correlation Coefficient (CC), Mutual Information (MI), and several others. These methods were combined with classifiers such as K Nearest Neighbor (KNN), Support Vector Machine (SVM), Linear Discriminant Analysis (LDA), Decision Tree Classifier (DTC), Naïve Bayes (NB), and Artificial Neural Network (ANN). Our results demonstrated that the best combination was the Signal to Noise Ratio with Linear Discriminant Analysis, achieving a classification accuracy of 95% using only six genes. This study underscores the importance of effective feature selection and classifier combination for precise and efficient prostate cancer diagnosis, paving the way for improved personalized healthcare strategies. Future work will focus on validating these findings with larger datasets and exploring advanced machine learning techniques to enhance classification performance further.
format Article
id doaj-art-e78bff426277490e93b8dca6e3ebb04e
institution Kabale University
issn 2271-2097
language English
publishDate 2024-01-01
publisher EDP Sciences
record_format Article
series ITM Web of Conferences
spelling doaj-art-e78bff426277490e93b8dca6e3ebb04e2025-01-08T10:58:54ZengEDP SciencesITM Web of Conferences2271-20972024-01-01690200410.1051/itmconf/20246902004itmconf_maih2024_02004Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA MicroarraysHaddou Bouazza Sara0Haddou Bouazza Jihad1LAMIGEP, EMSILAMIGEP, EMSIThis study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes. We applied multiple feature selection methods, including Signal to Noise Ratio (SNR), ReliefF, Correlation Coefficient (CC), Mutual Information (MI), and several others. These methods were combined with classifiers such as K Nearest Neighbor (KNN), Support Vector Machine (SVM), Linear Discriminant Analysis (LDA), Decision Tree Classifier (DTC), Naïve Bayes (NB), and Artificial Neural Network (ANN). Our results demonstrated that the best combination was the Signal to Noise Ratio with Linear Discriminant Analysis, achieving a classification accuracy of 95% using only six genes. This study underscores the importance of effective feature selection and classifier combination for precise and efficient prostate cancer diagnosis, paving the way for improved personalized healthcare strategies. Future work will focus on validating these findings with larger datasets and exploring advanced machine learning techniques to enhance classification performance further.https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf
spellingShingle Haddou Bouazza Sara
Haddou Bouazza Jihad
Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
ITM Web of Conferences
title Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
title_full Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
title_fullStr Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
title_full_unstemmed Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
title_short Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
title_sort evaluating machine learning models for prostate cancer classification using gene expression profiles from dna microarrays
url https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf
work_keys_str_mv AT haddoubouazzasara evaluatingmachinelearningmodelsforprostatecancerclassificationusinggeneexpressionprofilesfromdnamicroarrays
AT haddoubouazzajihad evaluatingmachinelearningmodelsforprostatecancerclassificationusinggeneexpressionprofilesfromdnamicroarrays