Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records

Background: In predictive modelling, particularly in fields such as healthcare, the importance of understanding the model’s behaviour rivals, if not surpasses, that of discriminability. To this end, attention mechanisms have been included in deep learning models for years. However, when comparing di...

Full description

Saved in:

Bibliographic Details
Main Authors:	Lucía A. Carrasco-Ribelles, Margarita Cabrera-Bean, Jose Llanes-Jurado, Concepción Violán
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Applied Sciences
Subjects:	attention mechanism clinical plausibility discriminability electronic health record recurrent neural network longitudinal data
Online Access:	https://www.mdpi.com/2076-3417/15/1/146
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841549429632401408
author	Lucía A. Carrasco-Ribelles Margarita Cabrera-Bean Jose Llanes-Jurado Concepción Violán
author_facet	Lucía A. Carrasco-Ribelles Margarita Cabrera-Bean Jose Llanes-Jurado Concepción Violán
author_sort	Lucía A. Carrasco-Ribelles
collection	DOAJ
description	Background: In predictive modelling, particularly in fields such as healthcare, the importance of understanding the model’s behaviour rivals, if not surpasses, that of discriminability. To this end, attention mechanisms have been included in deep learning models for years. However, when comparing different models, the one with the best discriminability is usually chosen without considering the clinical plausibility of their predictions. Objective: In this work several attention-based deep learning architectures with increasing degrees of complexity were designed and compared aiming to study the balance between discriminability and plausibility with architecture complexity when working with longitudinal data from Electronic Health Records (EHRs). Methods: We developed four deep learning-based architectures with attention mechanisms that were progressively more complex to handle longitudinal data from EHRs. We evaluated their discriminability and resulting attention maps and compared them amongst architectures and different input processing approaches. We trained them on 10 years of data from EHRs from Catalonia (Spain) and evaluated them using a 5-fold cross-validation to predict 1-year all-cause mortality in a subsample of 500,000 people over 65 years of age. Results: Generally, the simplest architectures led to the best overall discriminability, slightly decreasing with complexity by up to 8.7%. However, the attention maps resulting from the simpler architectures were less informative and less clinically plausible compared to those from more complex architectures. Moreover, the latter could give attention weights both in the time and feature domains. Conclusions: Our results suggest that discriminability and more informative and clinically plausible attention maps do not always go together. Given the preferences within the healthcare field for enhanced explainability, establishing a balance with discriminability is imperative.
format	Article
id	doaj-art-2478f54b37c847068992b1a89d2755e3
institution	Kabale University
issn	2076-3417
language	English
publishDate	2024-12-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-2478f54b37c847068992b1a89d2755e32025-01-10T13:14:36ZengMDPI AGApplied Sciences2076-34172024-12-0115114610.3390/app15010146Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health RecordsLucía A. Carrasco-Ribelles0Margarita Cabrera-Bean1Jose Llanes-Jurado2Concepción Violán3Institut Universitari d’Investigació en Atenció Primària Jordi Gol (IDIAP Jordi Gol), 08007 Barcelona, SpainDepartment of Signal Theory and Communications, Universitat Politècnica de Catalunya (UPC), 08034 Barcelona, SpainInstituto Universitario de Investigación en Tecnología Centrada en el Ser Humano, Universitat Politècnica de València, 46022 Valencia, SpainUnitat de Suport a la Recerca Metropolitana Nord, Institut Universitari d’Investigació en Atenció Primària Jordi Gol (IDIAP Jordi Gol), 08303 Mataró, SpainBackground: In predictive modelling, particularly in fields such as healthcare, the importance of understanding the model’s behaviour rivals, if not surpasses, that of discriminability. To this end, attention mechanisms have been included in deep learning models for years. However, when comparing different models, the one with the best discriminability is usually chosen without considering the clinical plausibility of their predictions. Objective: In this work several attention-based deep learning architectures with increasing degrees of complexity were designed and compared aiming to study the balance between discriminability and plausibility with architecture complexity when working with longitudinal data from Electronic Health Records (EHRs). Methods: We developed four deep learning-based architectures with attention mechanisms that were progressively more complex to handle longitudinal data from EHRs. We evaluated their discriminability and resulting attention maps and compared them amongst architectures and different input processing approaches. We trained them on 10 years of data from EHRs from Catalonia (Spain) and evaluated them using a 5-fold cross-validation to predict 1-year all-cause mortality in a subsample of 500,000 people over 65 years of age. Results: Generally, the simplest architectures led to the best overall discriminability, slightly decreasing with complexity by up to 8.7%. However, the attention maps resulting from the simpler architectures were less informative and less clinically plausible compared to those from more complex architectures. Moreover, the latter could give attention weights both in the time and feature domains. Conclusions: Our results suggest that discriminability and more informative and clinically plausible attention maps do not always go together. Given the preferences within the healthcare field for enhanced explainability, establishing a balance with discriminability is imperative.https://www.mdpi.com/2076-3417/15/1/146attention mechanismclinical plausibilitydiscriminabilityelectronic health recordrecurrent neural networklongitudinal data
spellingShingle	Lucía A. Carrasco-Ribelles Margarita Cabrera-Bean Jose Llanes-Jurado Concepción Violán Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records Applied Sciences attention mechanism clinical plausibility discriminability electronic health record recurrent neural network longitudinal data
title	Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
title_full	Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
title_fullStr	Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
title_full_unstemmed	Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
title_short	Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records
title_sort	use of attention maps to enrich discriminability in deep learning prediction models using longitudinal data from electronic health records
topic	attention mechanism clinical plausibility discriminability electronic health record recurrent neural network longitudinal data
url	https://www.mdpi.com/2076-3417/15/1/146
work_keys_str_mv	AT luciaacarrascoribelles useofattentionmapstoenrichdiscriminabilityindeeplearningpredictionmodelsusinglongitudinaldatafromelectronichealthrecords AT margaritacabrerabean useofattentionmapstoenrichdiscriminabilityindeeplearningpredictionmodelsusinglongitudinaldatafromelectronichealthrecords AT josellanesjurado useofattentionmapstoenrichdiscriminabilityindeeplearningpredictionmodelsusinglongitudinaldatafromelectronichealthrecords AT concepcionviolan useofattentionmapstoenrichdiscriminabilityindeeplearningpredictionmodelsusinglongitudinaldatafromelectronichealthrecords

Use of Attention Maps to Enrich Discriminability in Deep Learning Prediction Models Using Longitudinal Data from Electronic Health Records

Similar Items