Risk prediction of 30-day mortality after stroke using machine learning: a nationwide registry-based cohort study

Wenjuan Wang; Anthony Rudd; Yanzhong Wang; Vasa Curcin; Charles Wolfe; Niels Peek; Benjamin Bray

doi:10.1186/s12883-022-02722-1

Risk prediction of 30-day mortality after stroke using machine learning: a nationwide registry-based cohort study

Wenjuan Wang, Anthony Rudd, Yanzhong Wang, Vasa Curcin, Charles Wolfe, Niels Peek, Benjamin Bray

Research output: Contribution to journal › Article › peer-review

9 Citations (Scopus)

94 Downloads (Pure)

Abstract

Backgrounds
We aimed to develop and validate machine learning (ML) models for 30-day stroke mortality for mortality risk stratification and as benchmarking models for quality improvement in stroke care.

Methods
Data from the UK Sentinel Stroke National Audit Program between 2013 to 2019 were used. Models were developed using XGBoost, Logistic Regression (LR), LR with elastic net with/without interaction terms using 80% randomly selected admissions from 2013 to 2018, validated on the 20% remaining admissions, and temporally validated on 2019 admissions. The models were developed with 30 variables. A reference model was developed using LR and 4 variables. Performances of all models was evaluated in terms of discrimination, calibration, reclassification, Brier scores and Decision-curves.

Results
In total, 488,497 stroke patients with a 12.3% 30-day mortality rate were included in the analysis. In 2019 temporal validation set, XGBoost model obtained the lowest Brier score (0.069 (95% CI: 0.068–0.071)) and the highest area under the ROC curve (AUC) (0.895 (95% CI: 0.891–0.900)) which outperformed LR reference model by 0.04 AUC (p < 0.001) and LR with elastic net and interaction term model by 0.003 AUC (p < 0.001). All models were perfectly calibrated for low (< 5%) and moderate risk groups (5–15%) and ≈1% underestimation for high-risk groups (> 15%). The XGBoost model reclassified 1648 (8.1%) low-risk cases by the LR reference model as being moderate or high-risk and gained the most net benefit in decision curve analysis.

Conclusions
All models with 30 variables are potentially useful as benchmarking models in stroke-care quality improvement with ML slightly outperforming others.

Original language	English
Article number	195
Journal	BMC Neurology
Volume	22
Issue number	1
DOIs	https://doi.org/10.1186/s12883-022-02722-1
Publication status	Published - 27 May 2022

Access to Document

10.1186/s12883-022-02722-1Licence: CC BY

Risk prediction of 30-day mortality after strokeFinal published version, 1.29 MBLicence: CC BY

Cite this

@article{cf3360d40f694ad2a3b35c99978c70f3,

title = "Risk prediction of 30-day mortality after stroke using machine learning: a nationwide registry-based cohort study",

abstract = "BackgroundsWe aimed to develop and validate machine learning (ML) models for 30-day stroke mortality for mortality risk stratification and as benchmarking models for quality improvement in stroke care.MethodsData from the UK Sentinel Stroke National Audit Program between 2013 to 2019 were used. Models were developed using XGBoost, Logistic Regression (LR), LR with elastic net with/without interaction terms using 80% randomly selected admissions from 2013 to 2018, validated on the 20% remaining admissions, and temporally validated on 2019 admissions. The models were developed with 30 variables. A reference model was developed using LR and 4 variables. Performances of all models was evaluated in terms of discrimination, calibration, reclassification, Brier scores and Decision-curves.ResultsIn total, 488,497 stroke patients with a 12.3% 30-day mortality rate were included in the analysis. In 2019 temporal validation set, XGBoost model obtained the lowest Brier score (0.069 (95% CI: 0.068–0.071)) and the highest area under the ROC curve (AUC) (0.895 (95% CI: 0.891–0.900)) which outperformed LR reference model by 0.04 AUC (p < 0.001) and LR with elastic net and interaction term model by 0.003 AUC (p < 0.001). All models were perfectly calibrated for low (< 5%) and moderate risk groups (5–15%) and ≈1% underestimation for high-risk groups (> 15%). The XGBoost model reclassified 1648 (8.1%) low-risk cases by the LR reference model as being moderate or high-risk and gained the most net benefit in decision curve analysis.ConclusionsAll models with 30 variables are potentially useful as benchmarking models in stroke-care quality improvement with ML slightly outperforming others.",

author = "Wenjuan Wang and Anthony Rudd and Yanzhong Wang and Vasa Curcin and Charles Wolfe and Niels Peek and Benjamin Bray",

note = "Funding Information: CDW, NP, VC, AGR, and WW acknowledge the financial support from the Health Foundation. CDW, VC, and YW acknowledge support from the National Institute for Health Research (NIHR) Biomedical Research Centre (BRC) based at Guy{\textquoteright}s and St Thomas{\textquoteright} National Health Service (NHS) Foundation Trust and King{\textquoteright}s College London, and the NIHR Collaboration for Leadership in Applied Health Research and Care (ARC) South London at King{\textquoteright}s College Hospital NHS Foundation Trust. VC is supported by the Public Health and Multi- morbidity Theme of the National Institute for Health Research{\textquoteright}s Applied Research Collaboration (ARC) South London. VC is also supported by the EPSRC CONSULT grant (EP/P010105/1). NP acknowledges support from the NIHR Manchester BRC. The views expressed are those of the authors and not necessarily those of the NHS, the BRC or ARC. Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = may,

day = "27",

doi = "10.1186/s12883-022-02722-1",

language = "English",

volume = "22",

journal = "BMC Neurology",

issn = "1471-2377",

publisher = "BioMed Central",

number = "1",

}

TY - JOUR

T1 - Risk prediction of 30-day mortality after stroke using machine learning

T2 - a nationwide registry-based cohort study

AU - Wang, Wenjuan

AU - Rudd, Anthony

AU - Wang, Yanzhong

AU - Curcin, Vasa

AU - Wolfe, Charles

AU - Peek, Niels

AU - Bray, Benjamin

N1 - Funding Information: CDW, NP, VC, AGR, and WW acknowledge the financial support from the Health Foundation. CDW, VC, and YW acknowledge support from the National Institute for Health Research (NIHR) Biomedical Research Centre (BRC) based at Guy’s and St Thomas’ National Health Service (NHS) Foundation Trust and King’s College London, and the NIHR Collaboration for Leadership in Applied Health Research and Care (ARC) South London at King’s College Hospital NHS Foundation Trust. VC is supported by the Public Health and Multi- morbidity Theme of the National Institute for Health Research’s Applied Research Collaboration (ARC) South London. VC is also supported by the EPSRC CONSULT grant (EP/P010105/1). NP acknowledges support from the NIHR Manchester BRC. The views expressed are those of the authors and not necessarily those of the NHS, the BRC or ARC. Publisher Copyright: © 2022, The Author(s).

PY - 2022/5/27

Y1 - 2022/5/27

N2 - BackgroundsWe aimed to develop and validate machine learning (ML) models for 30-day stroke mortality for mortality risk stratification and as benchmarking models for quality improvement in stroke care.MethodsData from the UK Sentinel Stroke National Audit Program between 2013 to 2019 were used. Models were developed using XGBoost, Logistic Regression (LR), LR with elastic net with/without interaction terms using 80% randomly selected admissions from 2013 to 2018, validated on the 20% remaining admissions, and temporally validated on 2019 admissions. The models were developed with 30 variables. A reference model was developed using LR and 4 variables. Performances of all models was evaluated in terms of discrimination, calibration, reclassification, Brier scores and Decision-curves.ResultsIn total, 488,497 stroke patients with a 12.3% 30-day mortality rate were included in the analysis. In 2019 temporal validation set, XGBoost model obtained the lowest Brier score (0.069 (95% CI: 0.068–0.071)) and the highest area under the ROC curve (AUC) (0.895 (95% CI: 0.891–0.900)) which outperformed LR reference model by 0.04 AUC (p < 0.001) and LR with elastic net and interaction term model by 0.003 AUC (p < 0.001). All models were perfectly calibrated for low (< 5%) and moderate risk groups (5–15%) and ≈1% underestimation for high-risk groups (> 15%). The XGBoost model reclassified 1648 (8.1%) low-risk cases by the LR reference model as being moderate or high-risk and gained the most net benefit in decision curve analysis.ConclusionsAll models with 30 variables are potentially useful as benchmarking models in stroke-care quality improvement with ML slightly outperforming others.

AB - BackgroundsWe aimed to develop and validate machine learning (ML) models for 30-day stroke mortality for mortality risk stratification and as benchmarking models for quality improvement in stroke care.MethodsData from the UK Sentinel Stroke National Audit Program between 2013 to 2019 were used. Models were developed using XGBoost, Logistic Regression (LR), LR with elastic net with/without interaction terms using 80% randomly selected admissions from 2013 to 2018, validated on the 20% remaining admissions, and temporally validated on 2019 admissions. The models were developed with 30 variables. A reference model was developed using LR and 4 variables. Performances of all models was evaluated in terms of discrimination, calibration, reclassification, Brier scores and Decision-curves.ResultsIn total, 488,497 stroke patients with a 12.3% 30-day mortality rate were included in the analysis. In 2019 temporal validation set, XGBoost model obtained the lowest Brier score (0.069 (95% CI: 0.068–0.071)) and the highest area under the ROC curve (AUC) (0.895 (95% CI: 0.891–0.900)) which outperformed LR reference model by 0.04 AUC (p < 0.001) and LR with elastic net and interaction term model by 0.003 AUC (p < 0.001). All models were perfectly calibrated for low (< 5%) and moderate risk groups (5–15%) and ≈1% underestimation for high-risk groups (> 15%). The XGBoost model reclassified 1648 (8.1%) low-risk cases by the LR reference model as being moderate or high-risk and gained the most net benefit in decision curve analysis.ConclusionsAll models with 30 variables are potentially useful as benchmarking models in stroke-care quality improvement with ML slightly outperforming others.

UR - http://www.scopus.com/inward/record.url?scp=85130680026&partnerID=8YFLogxK

U2 - 10.1186/s12883-022-02722-1

DO - 10.1186/s12883-022-02722-1

M3 - Article

SN - 1471-2377

VL - 22

JO - BMC Neurology

JF - BMC Neurology

IS - 1

M1 - 195

ER -

Risk prediction of 30-day mortality after stroke using machine learning: a nationwide registry-based cohort study

Abstract

Access to Document

Other files and links

Fingerprint

Cite this