Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays
This study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2024-01-01
|
Series: | ITM Web of Conferences |
Online Access: | https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841554742679961600 |
---|---|
author | Haddou Bouazza Sara Haddou Bouazza Jihad |
author_facet | Haddou Bouazza Sara Haddou Bouazza Jihad |
author_sort | Haddou Bouazza Sara |
collection | DOAJ |
description | This study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes. We applied multiple feature selection methods, including Signal to Noise Ratio (SNR), ReliefF, Correlation Coefficient (CC), Mutual Information (MI), and several others. These methods were combined with classifiers such as K Nearest Neighbor (KNN), Support Vector Machine (SVM), Linear Discriminant Analysis (LDA), Decision Tree Classifier (DTC), Naïve Bayes (NB), and Artificial Neural Network (ANN). Our results demonstrated that the best combination was the Signal to Noise Ratio with Linear Discriminant Analysis, achieving a classification accuracy of 95% using only six genes. This study underscores the importance of effective feature selection and classifier combination for precise and efficient prostate cancer diagnosis, paving the way for improved personalized healthcare strategies. Future work will focus on validating these findings with larger datasets and exploring advanced machine learning techniques to enhance classification performance further. |
format | Article |
id | doaj-art-e78bff426277490e93b8dca6e3ebb04e |
institution | Kabale University |
issn | 2271-2097 |
language | English |
publishDate | 2024-01-01 |
publisher | EDP Sciences |
record_format | Article |
series | ITM Web of Conferences |
spelling | doaj-art-e78bff426277490e93b8dca6e3ebb04e2025-01-08T10:58:54ZengEDP SciencesITM Web of Conferences2271-20972024-01-01690200410.1051/itmconf/20246902004itmconf_maih2024_02004Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA MicroarraysHaddou Bouazza Sara0Haddou Bouazza Jihad1LAMIGEP, EMSILAMIGEP, EMSIThis study evaluates various machine learning models for classifying prostate cancer using gene expression profiles from DNA microarrays. Due to the high dimensionality of these datasets, effective dimensionality reduction through feature selection is essential to identify and remove redundant genes. We applied multiple feature selection methods, including Signal to Noise Ratio (SNR), ReliefF, Correlation Coefficient (CC), Mutual Information (MI), and several others. These methods were combined with classifiers such as K Nearest Neighbor (KNN), Support Vector Machine (SVM), Linear Discriminant Analysis (LDA), Decision Tree Classifier (DTC), Naïve Bayes (NB), and Artificial Neural Network (ANN). Our results demonstrated that the best combination was the Signal to Noise Ratio with Linear Discriminant Analysis, achieving a classification accuracy of 95% using only six genes. This study underscores the importance of effective feature selection and classifier combination for precise and efficient prostate cancer diagnosis, paving the way for improved personalized healthcare strategies. Future work will focus on validating these findings with larger datasets and exploring advanced machine learning techniques to enhance classification performance further.https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf |
spellingShingle | Haddou Bouazza Sara Haddou Bouazza Jihad Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays ITM Web of Conferences |
title | Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays |
title_full | Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays |
title_fullStr | Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays |
title_full_unstemmed | Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays |
title_short | Evaluating Machine Learning Models for Prostate Cancer Classification Using Gene Expression Profiles from DNA Microarrays |
title_sort | evaluating machine learning models for prostate cancer classification using gene expression profiles from dna microarrays |
url | https://www.itm-conferences.org/articles/itmconf/pdf/2024/12/itmconf_maih2024_02004.pdf |
work_keys_str_mv | AT haddoubouazzasara evaluatingmachinelearningmodelsforprostatecancerclassificationusinggeneexpressionprofilesfromdnamicroarrays AT haddoubouazzajihad evaluatingmachinelearningmodelsforprostatecancerclassificationusinggeneexpressionprofilesfromdnamicroarrays |