Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance

Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known...

Full description

Saved in:
Bibliographic Details
Main Authors: Pratyush Kumar MAHARANA, Tapan Kumar BEHERA, Pradeep Kumar NAIK
Format: Article
Language:English
Published: Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca 2024-12-01
Series:Applied Medical Informatics
Subjects:
Online Access:https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841558891348885504
author Pratyush Kumar MAHARANA
Tapan Kumar BEHERA
Pradeep Kumar NAIK
author_facet Pratyush Kumar MAHARANA
Tapan Kumar BEHERA
Pradeep Kumar NAIK
author_sort Pratyush Kumar MAHARANA
collection DOAJ
description Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known, with the most common being hepatocellular carcinoma (HCC), which is the main type of liver cell called hepatocytes. Another less common type is cholangiocarcinoma, which originates in the bile ducts the liver. This study aimed to evaluate and compare machine-learning-based models for the early detection of liver cancer to improve diagnostic accuracy. Method: In this study, various models, such as SVM, decision tree, random forest, logistic regression, K-neighbor, Gaussian NB, AdaBoost classifier, MLP classifier, passive aggressive, ridge classifier, extra tree, bagging classifier, extra trees, gradient boosting, SGD classifier, linear SVC, voting classifier, and stacking classifier were used. Five performance metrics (accuracy, precision, recall, F1 score, and Cohen’s kappa) were used to evaluate the performance of the proposed methods. Results: The dataset comprised 12 instances, and across all the models tested, we utilized the extra tree classifier for the early detection of liver cancer, achieving a notable accuracy of 85.8%. The model demonstrated a precision of 75.5%, while it achieved a high recall of 92.2%, and the F1 score of 83.2% underscored its robust performance, suggesting significant potential for enhancing diagnostic accuracy and necessitating further investigation. These performance metrics highlight the potential of the extra tree classifier to improve early detection strategies for liver cancer. Conclusion: After performing the complete process, we conclude that the extra tree classifier out of 17 models is the most suitable machine learning algorithm for liver cancer prediction.
format Article
id doaj-art-0ce8bad0e96944c3a1d2d6dc04117973
institution Kabale University
issn 2067-7855
language English
publishDate 2024-12-01
publisher Iuliu Hatieganu University of Medicine and Pharmacy, Cluj-Napoca
record_format Article
series Applied Medical Informatics
spelling doaj-art-0ce8bad0e96944c3a1d2d6dc041179732025-01-05T20:06:50ZengIuliu Hatieganu University of Medicine and Pharmacy, Cluj-NapocaApplied Medical Informatics2067-78552024-12-01464Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical SignificancePratyush Kumar MAHARANA0Tapan Kumar BEHERA1Pradeep Kumar NAIK2Department of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, IndiaDepartment of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, IndiaDepartment of Biotechnology and Bioinformatics, Sambalpur University, Jyoti Vihar, Burla, Sambalpur-768019, Odisha, India Background: The liver is an internal organ located in the upper right section of the abdomen, just beneath the diaphragm, and near the stomach. It performs numerous functions that are essential for metabolism, digestion, detoxification, and nutrient storage. Several types of liver cancers are known, with the most common being hepatocellular carcinoma (HCC), which is the main type of liver cell called hepatocytes. Another less common type is cholangiocarcinoma, which originates in the bile ducts the liver. This study aimed to evaluate and compare machine-learning-based models for the early detection of liver cancer to improve diagnostic accuracy. Method: In this study, various models, such as SVM, decision tree, random forest, logistic regression, K-neighbor, Gaussian NB, AdaBoost classifier, MLP classifier, passive aggressive, ridge classifier, extra tree, bagging classifier, extra trees, gradient boosting, SGD classifier, linear SVC, voting classifier, and stacking classifier were used. Five performance metrics (accuracy, precision, recall, F1 score, and Cohen’s kappa) were used to evaluate the performance of the proposed methods. Results: The dataset comprised 12 instances, and across all the models tested, we utilized the extra tree classifier for the early detection of liver cancer, achieving a notable accuracy of 85.8%. The model demonstrated a precision of 75.5%, while it achieved a high recall of 92.2%, and the F1 score of 83.2% underscored its robust performance, suggesting significant potential for enhancing diagnostic accuracy and necessitating further investigation. These performance metrics highlight the potential of the extra tree classifier to improve early detection strategies for liver cancer. Conclusion: After performing the complete process, we conclude that the extra tree classifier out of 17 models is the most suitable machine learning algorithm for liver cancer prediction. https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063Hepatocellular Carcinoma (HCC)Machine LearningMultilayer PerceptronReceiver Operating Characteristic (ROC) curve
spellingShingle Pratyush Kumar MAHARANA
Tapan Kumar BEHERA
Pradeep Kumar NAIK
Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
Applied Medical Informatics
Hepatocellular Carcinoma (HCC)
Machine Learning
Multilayer Perceptron
Receiver Operating Characteristic (ROC) curve
title Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
title_full Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
title_fullStr Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
title_full_unstemmed Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
title_short Machine Learning-Based Classification and Statistical Analysis of Liver Cancer: A Comprehensive Study of Model Performance and Clinical Significance
title_sort machine learning based classification and statistical analysis of liver cancer a comprehensive study of model performance and clinical significance
topic Hepatocellular Carcinoma (HCC)
Machine Learning
Multilayer Perceptron
Receiver Operating Characteristic (ROC) curve
url https://ami.info.umfcluj.ro/index.php/AMI/article/view/1063
work_keys_str_mv AT pratyushkumarmaharana machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance
AT tapankumarbehera machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance
AT pradeepkumarnaik machinelearningbasedclassificationandstatisticalanalysisoflivercanceracomprehensivestudyofmodelperformanceandclinicalsignificance