Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model

Introduction Asthma is a long-term condition with rapid onset worsening of symptoms (‘attacks’) which can be unpredictable and may prove fatal. Models predicting asthma attacks require high sensitivity to minimise mortality risk, and high specificity to avoid unnecessary prescribing of preventative...

Full description

Saved in:
Bibliographic Details
Main Authors: Aziz Sheikh, Holly Tibble, Athanasios Tsanas, Elsie Horne, Robert Horne, Mehrdad Mizani, Colin R Simpson
Format: Article
Language:English
Published: BMJ Publishing Group 2019-07-01
Series:BMJ Open
Online Access:https://bmjopen.bmj.com/content/9/7/e028375.full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846149148787605504
author Aziz Sheikh
Holly Tibble
Athanasios Tsanas
Elsie Horne
Robert Horne
Mehrdad Mizani
Colin R Simpson
author_facet Aziz Sheikh
Holly Tibble
Athanasios Tsanas
Elsie Horne
Robert Horne
Mehrdad Mizani
Colin R Simpson
author_sort Aziz Sheikh
collection DOAJ
description Introduction Asthma is a long-term condition with rapid onset worsening of symptoms (‘attacks’) which can be unpredictable and may prove fatal. Models predicting asthma attacks require high sensitivity to minimise mortality risk, and high specificity to avoid unnecessary prescribing of preventative medications that carry an associated risk of adverse events. We aim to create a risk score to predict asthma attacks in primary care using a statistical learning approach trained on routinely collected electronic health record data.Methods and analysis We will employ machine-learning classifiers (naïve Bayes, support vector machines, and random forests) to create an asthma attack risk prediction model, using the Asthma Learning Health System (ALHS) study patient registry comprising 500 000 individuals across 75 Scottish general practices, with linked longitudinal primary care prescribing records, primary care Read codes, accident and emergency records, hospital admissions and deaths. Models will be compared on a partition of the dataset reserved for validation, and the final model will be tested in both an unseen partition of the derivation dataset and an external dataset from the Seasonal Influenza Vaccination Effectiveness II (SIVE II) study.Ethics and dissemination Permissions for the ALHS project were obtained from the South East Scotland Research Ethics Committee 02 [16/SS/0130] and the Public Benefit and Privacy Panel for Health and Social Care (1516–0489). Permissions for the SIVE II project were obtained from the Privacy Advisory Committee (National Services NHS Scotland) [68/14] and the National Research Ethics Committee West Midlands–Edgbaston [15/WM/0035]. The subsequent research paper will be submitted for publication to a peer-reviewed journal and code scripts used for all components of the data cleaning, compiling, and analysis will be made available in the open source GitHub website (https://github.com/hollytibble).
format Article
id doaj-art-13646442a1b94e14b1a6b6a726264297
institution Kabale University
issn 2044-6055
language English
publishDate 2019-07-01
publisher BMJ Publishing Group
record_format Article
series BMJ Open
spelling doaj-art-13646442a1b94e14b1a6b6a7262642972024-11-29T21:35:13ZengBMJ Publishing GroupBMJ Open2044-60552019-07-019710.1136/bmjopen-2018-028375Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction modelAziz Sheikh0Holly Tibble1Athanasios Tsanas2Elsie Horne3Robert Horne4Mehrdad Mizani5Colin R Simpson6Usher Institute, University of Edinburgh, Edinburgh, UK3 Asthma UK Centre for Applied Research, Centre for Medical Informatics, Usher Institute, The University of Edinburgh, Edinburgh, UKUsher Institute of Population Health Sciences and Informatics, Edinburgh Medical School, College of Medicine and Veterinary Medicine, University of Edinburgh, Edinburgh, UKUsher Institute of Population Health Sciences and Informatics, Edinburgh Medical School, College of Medicine and Veterinary Medicine, University of Edinburgh, Edinburgh, UKUCL School of Pharmacy, University College London, London, UKUsher Institute of Population Health Sciences and Informatics, Edinburgh Medical School, College of Medicine and Veterinary Medicine, University of Edinburgh, Edinburgh, UKWellington Faculty of Health, Victoria University of Wellington, Wellington, New ZealandIntroduction Asthma is a long-term condition with rapid onset worsening of symptoms (‘attacks’) which can be unpredictable and may prove fatal. Models predicting asthma attacks require high sensitivity to minimise mortality risk, and high specificity to avoid unnecessary prescribing of preventative medications that carry an associated risk of adverse events. We aim to create a risk score to predict asthma attacks in primary care using a statistical learning approach trained on routinely collected electronic health record data.Methods and analysis We will employ machine-learning classifiers (naïve Bayes, support vector machines, and random forests) to create an asthma attack risk prediction model, using the Asthma Learning Health System (ALHS) study patient registry comprising 500 000 individuals across 75 Scottish general practices, with linked longitudinal primary care prescribing records, primary care Read codes, accident and emergency records, hospital admissions and deaths. Models will be compared on a partition of the dataset reserved for validation, and the final model will be tested in both an unseen partition of the derivation dataset and an external dataset from the Seasonal Influenza Vaccination Effectiveness II (SIVE II) study.Ethics and dissemination Permissions for the ALHS project were obtained from the South East Scotland Research Ethics Committee 02 [16/SS/0130] and the Public Benefit and Privacy Panel for Health and Social Care (1516–0489). Permissions for the SIVE II project were obtained from the Privacy Advisory Committee (National Services NHS Scotland) [68/14] and the National Research Ethics Committee West Midlands–Edgbaston [15/WM/0035]. The subsequent research paper will be submitted for publication to a peer-reviewed journal and code scripts used for all components of the data cleaning, compiling, and analysis will be made available in the open source GitHub website (https://github.com/hollytibble).https://bmjopen.bmj.com/content/9/7/e028375.full
spellingShingle Aziz Sheikh
Holly Tibble
Athanasios Tsanas
Elsie Horne
Robert Horne
Mehrdad Mizani
Colin R Simpson
Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
BMJ Open
title Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
title_full Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
title_fullStr Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
title_full_unstemmed Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
title_short Predicting asthma attacks in primary care: protocol for developing a machine learning-based prediction model
title_sort predicting asthma attacks in primary care protocol for developing a machine learning based prediction model
url https://bmjopen.bmj.com/content/9/7/e028375.full
work_keys_str_mv AT azizsheikh predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT hollytibble predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT athanasiostsanas predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT elsiehorne predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT roberthorne predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT mehrdadmizani predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel
AT colinrsimpson predictingasthmaattacksinprimarycareprotocolfordevelopingamachinelearningbasedpredictionmodel