From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.

<h4>Background</h4>Progression free survival (PFS) is a critical clinical outcome endpoint during cancer management and treatment evaluation. Yet, PFS is often missing from publicly available datasets due to the current subjective, expert, and time-intensive nature of generating PFS metr...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shreya Chappidi, Mason J Belue, Stephanie A Harmon, Sarisha Jagasia, Ying Zhuge, Erdal Tasci, Baris Turkbey, Jatinder Singh, Kevin Camphausen, Andra V Krauze
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2025-05-01
Series:	PLOS Digital Health
Online Access:	https://doi.org/10.1371/journal.pdig.0000755
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849327795622117376
author	Shreya Chappidi Mason J Belue Stephanie A Harmon Sarisha Jagasia Ying Zhuge Erdal Tasci Baris Turkbey Jatinder Singh Kevin Camphausen Andra V Krauze
author_facet	Shreya Chappidi Mason J Belue Stephanie A Harmon Sarisha Jagasia Ying Zhuge Erdal Tasci Baris Turkbey Jatinder Singh Kevin Camphausen Andra V Krauze
author_sort	Shreya Chappidi
collection	DOAJ
description	<h4>Background</h4>Progression free survival (PFS) is a critical clinical outcome endpoint during cancer management and treatment evaluation. Yet, PFS is often missing from publicly available datasets due to the current subjective, expert, and time-intensive nature of generating PFS metrics. Given emerging research in multi-modal machine learning (ML), we explored the benefits and challenges associated with mining different electronic health record (EHR) data modalities and automating extraction of PFS metrics via ML algorithms.<h4>Methods</h4>We analyzed EHR data from 92 pathology-proven GBM patients, obtaining 233 corticosteroid prescriptions, 2080 radiology reports, and 743 brain MRI scans. Three methods were developed to derive clinical PFS: 1) frequency analysis of corticosteroid prescriptions, 2) natural language processing (NLP) of reports, and 3) computer vision (CV) volumetric analysis of imaging. Outputs from these methods were compared to manually annotated clinical guideline PFS metrics.<h4>Results</h4>Employing data-driven methods, standalone progression rates were 63% (prescription), 78% (NLP), and 54% (CV), compared to the 99% progression rate from manually applied clinical guidelines using integrated data sources. The prescription method identified progression an average of 5.2 months later than the clinical standard, while the CV and NLP algorithms identified progression earlier by 2.6 and 6.9 months, respectively. While lesion growth is a clinical guideline progression indicator, only half of patients exhibited increasing contrast-enhancing tumor volumes during scan-based CV analysis.<h4>Conclusion</h4>Our results indicate that data-driven algorithms can extract tumor progression outcomes from existing EHR data. However, ML methods are subject to varying availability bias, supporting contextual information, and pre-processing resource burdens that influence the extracted PFS endpoint distributions. Our scan-based CV results also suggest that the automation of clinical criteria may not align with human intuition. Our findings indicate a need for improved data source integration, validation, and revisiting of clinical criteria in parallel to multi-modal ML algorithm development.
format	Article
id	doaj-art-fdd969b51a204107a87ff70ebe750b4c
institution	Kabale University
issn	2767-3170
language	English
publishDate	2025-05-01
publisher	Public Library of Science (PLoS)
record_format	Article
series	PLOS Digital Health
spelling	doaj-art-fdd969b51a204107a87ff70ebe750b4c2025-08-20T03:47:45ZengPublic Library of Science (PLoS)PLOS Digital Health2767-31702025-05-0145e000075510.1371/journal.pdig.0000755From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.Shreya ChappidiMason J BelueStephanie A HarmonSarisha JagasiaYing ZhugeErdal TasciBaris TurkbeyJatinder SinghKevin CamphausenAndra V Krauze<h4>Background</h4>Progression free survival (PFS) is a critical clinical outcome endpoint during cancer management and treatment evaluation. Yet, PFS is often missing from publicly available datasets due to the current subjective, expert, and time-intensive nature of generating PFS metrics. Given emerging research in multi-modal machine learning (ML), we explored the benefits and challenges associated with mining different electronic health record (EHR) data modalities and automating extraction of PFS metrics via ML algorithms.<h4>Methods</h4>We analyzed EHR data from 92 pathology-proven GBM patients, obtaining 233 corticosteroid prescriptions, 2080 radiology reports, and 743 brain MRI scans. Three methods were developed to derive clinical PFS: 1) frequency analysis of corticosteroid prescriptions, 2) natural language processing (NLP) of reports, and 3) computer vision (CV) volumetric analysis of imaging. Outputs from these methods were compared to manually annotated clinical guideline PFS metrics.<h4>Results</h4>Employing data-driven methods, standalone progression rates were 63% (prescription), 78% (NLP), and 54% (CV), compared to the 99% progression rate from manually applied clinical guidelines using integrated data sources. The prescription method identified progression an average of 5.2 months later than the clinical standard, while the CV and NLP algorithms identified progression earlier by 2.6 and 6.9 months, respectively. While lesion growth is a clinical guideline progression indicator, only half of patients exhibited increasing contrast-enhancing tumor volumes during scan-based CV analysis.<h4>Conclusion</h4>Our results indicate that data-driven algorithms can extract tumor progression outcomes from existing EHR data. However, ML methods are subject to varying availability bias, supporting contextual information, and pre-processing resource burdens that influence the extracted PFS endpoint distributions. Our scan-based CV results also suggest that the automation of clinical criteria may not align with human intuition. Our findings indicate a need for improved data source integration, validation, and revisiting of clinical criteria in parallel to multi-modal ML algorithm development.https://doi.org/10.1371/journal.pdig.0000755
spellingShingle	Shreya Chappidi Mason J Belue Stephanie A Harmon Sarisha Jagasia Ying Zhuge Erdal Tasci Baris Turkbey Jatinder Singh Kevin Camphausen Andra V Krauze From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities. PLOS Digital Health
title	From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.
title_full	From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.
title_fullStr	From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.
title_full_unstemmed	From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.
title_short	From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.
title_sort	from manual clinical criteria to machine learning algorithms comparing outcome endpoints derived from diverse electronic health record data modalities
url	https://doi.org/10.1371/journal.pdig.0000755
work_keys_str_mv	AT shreyachappidi frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT masonjbelue frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT stephanieaharmon frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT sarishajagasia frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT yingzhuge frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT erdaltasci frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT baristurkbey frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT jatindersingh frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT kevincamphausen frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities AT andravkrauze frommanualclinicalcriteriatomachinelearningalgorithmscomparingoutcomeendpointsderivedfromdiverseelectronichealthrecorddatamodalities

From manual clinical criteria to machine learning algorithms: Comparing outcome endpoints derived from diverse electronic health record data modalities.

Similar Items