Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models

Abstract Background Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohammad Bazrafshan, Kourosh Sayehmiri
Format: Article
Language:English
Published: BMC 2024-11-01
Series:BMC Psychiatry
Subjects:
Online Access:https://doi.org/10.1186/s12888-024-06273-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849221020029812736
author Mohammad Bazrafshan
Kourosh Sayehmiri
author_facet Mohammad Bazrafshan
Kourosh Sayehmiri
author_sort Mohammad Bazrafshan
collection DOAJ
description Abstract Background Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes. Materials and methods This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher’s exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC. Results Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father’s education, and mother’s education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested. Conclusion This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.
format Article
id doaj-art-a58b3f4cbf534bad90f90c1f9f9881f7
institution Kabale University
issn 1471-244X
language English
publishDate 2024-11-01
publisher BMC
record_format Article
series BMC Psychiatry
spelling doaj-art-a58b3f4cbf534bad90f90c1f9f9881f72024-11-24T12:37:51ZengBMCBMC Psychiatry1471-244X2024-11-0124111210.1186/s12888-024-06273-2Predicting suicidal behavior outcomes: an analysis of key factors and machine learning modelsMohammad Bazrafshan0Kourosh Sayehmiri1Medical Doctor, Faculty of Medicine, Ilam University of Medical SciencesDepartment of Biostatistics, Faculty of Health, Ilam University of Medical SciencesAbstract Background Suicidal behaviors, which may lead to death (suicide) or survival (suicide attempt), are influenced by various factors. Identifying the specific risk factors for suicidal behavior mortality is critical for improving prevention strategies and clinical interventions. Predicting the outcomes of suicidal behaviors can help identify individuals at higher risk of death, enabling timely and targeted interventions. This study aimed to determine the critical risk factors associated with suicidal behavior mortality and identify an effective classification model for predicting suicidal behavior outcomes. Materials and methods This study utilized data recorded in the suicidal behavior registry system of hospitals in Ilam Province. In the first phase, duplicate records were removed, and the data was numerically encoded via Python version 3.11; then, the data was analyzed using chi-square and Fisher’s exact tests in SPSS version 22 software to identify the factors influencing suicidal behavior mortality. In the second phase, missing data were removed, and the dataset was standardized. Five binary classification algorithms were utilized, including Random Forest, Logistic Regression, and Decision Trees, with hyperparameters optimized using the area under the receiver operating characteristic curve (AUC) and F1 score metrics. These models were compared based on accuracy, recall, precision, F1 score, and AUC. Results Among 3833 cases of suicidal behavior in various hospitals in Ilam Province, the results indicated that the method of suicidal behavior (P < 0.001), reason for suicidal behavior (P < 0.001), age group (P < 0.001), education level (P < 0.001), marital status (P = 0.004), and employment status (P = 0.042) were significantly associated with suicide. Variables such as the season of suicidal behavior, gender, father’s education, and mother’s education were not significantly related to suicidal behavior mortality. Furthermore, the random forest model demonstrated the highest area under the ROC curve (0.79) and the highest classification accuracy and F1 score on both the training data (0.85 and 0.2, respectively) and test data (0.86 and 0.31, respectively) for predicting suicidal behaviors outcomes among the models tested. Conclusion This study identified key factors such as older age, lower education, divorce or widowhood, employment, physical methods, and socioeconomic issues as significant predictors of suicidal behavior outcomes. A combination of statistical models for feature selection and machine learning algorithms for prediction was used, with Random Forest showing the best performance. This approach highlights the potential of integrating statistical methods with machine learning to improve suicide risk prediction and intervention strategies.https://doi.org/10.1186/s12888-024-06273-2SuicideSuicide attemptSuicidal behaviorSuicide risk factorsMachine learningClassification algorithms
spellingShingle Mohammad Bazrafshan
Kourosh Sayehmiri
Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
BMC Psychiatry
Suicide
Suicide attempt
Suicidal behavior
Suicide risk factors
Machine learning
Classification algorithms
title Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
title_full Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
title_fullStr Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
title_full_unstemmed Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
title_short Predicting suicidal behavior outcomes: an analysis of key factors and machine learning models
title_sort predicting suicidal behavior outcomes an analysis of key factors and machine learning models
topic Suicide
Suicide attempt
Suicidal behavior
Suicide risk factors
Machine learning
Classification algorithms
url https://doi.org/10.1186/s12888-024-06273-2
work_keys_str_mv AT mohammadbazrafshan predictingsuicidalbehavioroutcomesananalysisofkeyfactorsandmachinelearningmodels
AT kouroshsayehmiri predictingsuicidalbehavioroutcomesananalysisofkeyfactorsandmachinelearningmodels