Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
2024-12-01
|
Series: | Applied Medical Informatics |
Subjects: | |
Online Access: | https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841558891348885504 |
---|---|
author | Pratyush Kumar MAHARANA Tapan Kumar BEHERA Pradeep Kumar NAIK |
author_facet | Pratyush Kumar MAHARANA Tapan Kumar BEHERA Pradeep Kumar NAIK |
author_sort | Pratyush Kumar MAHARANA |
collection | DOAJ |
description |
Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known, with the most common being hepatocellular carcinoma (HCC), which is the main type of liver cell called hepatocytes. Another less common type is cholangiocarcinoma, which originates in the bile ducts the liver. This study aimed to evaluate and compare machine-learning-based models for the early detection of liver cancer to improve diagnostic accuracy. Method: In this study, various models, such as SVM, decision tree, random forest, logistic regression, K-neighbor, Gaussian NB, AdaBoost classifier, MLP classifier, passive aggressive, ridge classifier, extra tree, bagging classifier, extra trees, gradient boosting, SGD classifier, linear SVC, voting classifier, and stacking classifier were used. Five performance metrics (accuracy, precision, recall, F1 score, and Cohen’s kappa) were used to evaluate the performance of the proposed methods. Results: The dataset comprised 12 instances, and across all the models tested, we utilized the extra tree classifier for the early detection of liver cancer, achieving a notable accuracy of 85.8%. The model demonstrated a precision of 75.5%, while it achieved a high recall of 92.2%, and the F1 score of 83.2% underscored its robust performance, suggesting significant potential for enhancing diagnostic accuracy and necessitating further investigation. These performance metrics highlight the potential of the extra tree classifier to improve early detection strategies for liver cancer. Conclusion: After performing the complete process, we conclude that the extra tree classifier out of 17 models is the most suitable machine learning algorithm for liver cancer prediction.
|
format | Article |
id | doaj-art-0ce8bad0e96944c3a1d2d6dc04117973 |
institution | Kabale University |
issn | 2067-7855 |
language | English |
publishDate | 2024-12-01 |
publisher | Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca |
record_format | Article |
series | Applied Medical Informatics |
spelling | doaj-art-0ce8bad0e96944c3a1d2d6dc041179732025-01-05T20:06:50ZengIuliu Hatieganu University of Medicine and Pharmacy, Cluj-NapocaApplied Medical Informatics2067-78552024-12-01464Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical SignificancePratyush Kumar MAHARANA0Tapan Kumar BEHERA1Pradeep Kumar NAIK2Department of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, IndiaDepartment of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, IndiaDepartment of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, India Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known, with the most common being hepatocellular carcinoma (HCC), which is the main type of liver cell called hepatocytes. Another less common type is cholangiocarcinoma, which originates in the bile ducts the liver. This study aimed to evaluate and compare machine-learning-based models for the early detection of liver cancer to improve diagnostic accuracy. Method: In this study, various models, such as SVM, decision tree, random forest, logistic regression, K-neighbor, Gaussian NB, AdaBoost classifier, MLP classifier, passive aggressive, ridge classifier, extra tree, bagging classifier, extra trees, gradient boosting, SGD classifier, linear SVC, voting classifier, and stacking classifier were used. Five performance metrics (accuracy, precision, recall, F1 score, and Cohen’s kappa) were used to evaluate the performance of the proposed methods. Results: The dataset comprised 12 instances, and across all the models tested, we utilized the extra tree classifier for the early detection of liver cancer, achieving a notable accuracy of 85.8%. The model demonstrated a precision of 75.5%, while it achieved a high recall of 92.2%, and the F1 score of 83.2% underscored its robust performance, suggesting significant potential for enhancing diagnostic accuracy and necessitating further investigation. These performance metrics highlight the potential of the extra tree classifier to improve early detection strategies for liver cancer. Conclusion: After performing the complete process, we conclude that the extra tree classifier out of 17 models is the most suitable machine learning algorithm for liver cancer prediction. https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063Hepatocellular Carcinoma (HCC)Machine LearningMultilayer PerceptronReceiver Operating Characteristic (ROC) curve |
spellingShingle | Pratyush Kumar MAHARANA Tapan Kumar BEHERA Pradeep Kumar NAIK Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance Applied Medical Informatics Hepatocellular Carcinoma (HCC) Machine Learning Multilayer Perceptron Receiver Operating Characteristic (ROC) curve |
title | Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance |
title_full | Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance |
title_fullStr | Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance |
title_full_unstemmed | Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance |
title_short | Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance |
title_sort | machine learning based classification and statistical analysis of liver cancer a comprehensive study of model performance and clinical significance |
topic | Hepatocellular Carcinoma (HCC) Machine Learning Multilayer Perceptron Receiver Operating Characteristic (ROC) curve |
url | https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063 |
work_keys_str_mv | AT pratyushkumarmaharana machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance AT tapankumarbehera machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance AT pradeepkumarnaik machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance |