Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning

The relationship between coronary heart disease (CHD) and complex urban built environments remains a subject of considerable uncertainty. The development of predictive models via machine learning to explore the underlying mechanisms of this association, as well as the formulation of intervention pol...

Full description

Saved in:
Bibliographic Details
Main Authors: Dan Jiang, Fei Guo, Ziteng Zhang, Xiaoqing Yu, Jing Dong, Hongchi Zhang, Zhen Zhang
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Buildings
Subjects:
Online Access:https://www.mdpi.com/2075-5309/14/12/4024
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846105406807474176
author Dan Jiang
Fei Guo
Ziteng Zhang
Xiaoqing Yu
Jing Dong
Hongchi Zhang
Zhen Zhang
author_facet Dan Jiang
Fei Guo
Ziteng Zhang
Xiaoqing Yu
Jing Dong
Hongchi Zhang
Zhen Zhang
author_sort Dan Jiang
collection DOAJ
description The relationship between coronary heart disease (CHD) and complex urban built environments remains a subject of considerable uncertainty. The development of predictive models via machine learning to explore the underlying mechanisms of this association, as well as the formulation of intervention policies and planning strategies, has emerged as a pivotal area of research. A cross-sectional dataset of hospital admissions for CHD over the course of a year from a hospital in Dalian City, China, was assembled and matched with multi-source built environment data via residential addresses. This study evaluates five machine learning models, including decision tree (DT), random forest (RF), eXtreme gradient boosting (XGBoost), multi-layer perceptron (MLP), and support vector machine (SVM), and compares them with multiple linear regression models. The results show that DT, RF, and XGBoost exhibit superior predictive capabilities, with all R<sup>2</sup> values exceeding 0.70. The DT model performed the best, with an R<sup>2</sup> value of 0.818, and the best performance was based on metrics such as MAE and MSE. Additionally, using explainable AI techniques, this study reveals the contribution of different built environment factors to CHD and identifies the significant factors influencing CHD in cold regions, ranked as age, Digital Elevation Model (DEM), house price (HP), sky view factor (SVF), and interaction factors. Stratified analyses by age and gender show variations in the influencing factors for different groups: for those under 60 years old, Road Density is the most influential factor; for the 61–70 age group, house price is the top factor; for the 71–80 age group, age is the most significant factor; for those over 81 years old, building height is the leading factor; in males, GDP is the most influential factor; and in females, age is the most influential factor. This study explores the feasibility and performance of machine learning in predicting CHD risk in the built environment of cold regions and provides a comprehensive methodology and workflow for predicting cardiovascular disease risk based on refined neighborhood-level built environment factors, offering scientific support for the construction of sustainable healthy cities.
format Article
id doaj-art-2eebf718cc9c438eba38897f9c3e42d0
institution Kabale University
issn 2075-5309
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Buildings
spelling doaj-art-2eebf718cc9c438eba38897f9c3e42d02024-12-27T14:16:04ZengMDPI AGBuildings2075-53092024-12-011412402410.3390/buildings14124024Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine LearningDan Jiang0Fei Guo1Ziteng Zhang2Xiaoqing Yu3Jing Dong4Hongchi Zhang5Zhen Zhang6Cardiology Department, The Second Hospital of Dalian Medical University, Dalian 116023, ChinaSchool of Architecture and Fine Art, Dalian University of Technology, Dalian 116024, ChinaSchool of Architecture and Fine Art, Dalian University of Technology, Dalian 116024, ChinaCardiology Department, The Second Hospital of Dalian Medical University, Dalian 116023, ChinaSchool of Architecture and Fine Art, Dalian University of Technology, Dalian 116024, ChinaSchool of Architecture and Fine Art, Dalian University of Technology, Dalian 116024, ChinaCardiology Department, The Second Hospital of Dalian Medical University, Dalian 116023, ChinaThe relationship between coronary heart disease (CHD) and complex urban built environments remains a subject of considerable uncertainty. The development of predictive models via machine learning to explore the underlying mechanisms of this association, as well as the formulation of intervention policies and planning strategies, has emerged as a pivotal area of research. A cross-sectional dataset of hospital admissions for CHD over the course of a year from a hospital in Dalian City, China, was assembled and matched with multi-source built environment data via residential addresses. This study evaluates five machine learning models, including decision tree (DT), random forest (RF), eXtreme gradient boosting (XGBoost), multi-layer perceptron (MLP), and support vector machine (SVM), and compares them with multiple linear regression models. The results show that DT, RF, and XGBoost exhibit superior predictive capabilities, with all R<sup>2</sup> values exceeding 0.70. The DT model performed the best, with an R<sup>2</sup> value of 0.818, and the best performance was based on metrics such as MAE and MSE. Additionally, using explainable AI techniques, this study reveals the contribution of different built environment factors to CHD and identifies the significant factors influencing CHD in cold regions, ranked as age, Digital Elevation Model (DEM), house price (HP), sky view factor (SVF), and interaction factors. Stratified analyses by age and gender show variations in the influencing factors for different groups: for those under 60 years old, Road Density is the most influential factor; for the 61–70 age group, house price is the top factor; for the 71–80 age group, age is the most significant factor; for those over 81 years old, building height is the leading factor; in males, GDP is the most influential factor; and in females, age is the most influential factor. This study explores the feasibility and performance of machine learning in predicting CHD risk in the built environment of cold regions and provides a comprehensive methodology and workflow for predicting cardiovascular disease risk based on refined neighborhood-level built environment factors, offering scientific support for the construction of sustainable healthy cities.https://www.mdpi.com/2075-5309/14/12/4024built environmentcoronary heart diseasemachine learninghealthy cities
spellingShingle Dan Jiang
Fei Guo
Ziteng Zhang
Xiaoqing Yu
Jing Dong
Hongchi Zhang
Zhen Zhang
Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
Buildings
built environment
coronary heart disease
machine learning
healthy cities
title Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
title_full Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
title_fullStr Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
title_full_unstemmed Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
title_short Urban Built Environment as a Predictor for Coronary Heart Disease—A Cross-Sectional Study Based on Machine Learning
title_sort urban built environment as a predictor for coronary heart disease a cross sectional study based on machine learning
topic built environment
coronary heart disease
machine learning
healthy cities
url https://www.mdpi.com/2075-5309/14/12/4024
work_keys_str_mv AT danjiang urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT feiguo urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT zitengzhang urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT xiaoqingyu urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT jingdong urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT hongchizhang urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning
AT zhenzhang urbanbuiltenvironmentasapredictorforcoronaryheartdiseaseacrosssectionalstudybasedonmachinelearning