Projects per year
Abstract
Background
The density of information in digital health records offers new potential opportunities for automated prediction of cost-relevant outcomes.
Aims
We investigated the extent to which routinely recorded data held in the electronic health record (EHR) predict priority service outcomes and whether natural language processing tools enhance the predictions. We evaluated three high priority outcomes: in-patient duration, readmission following in-patient care and high service cost after first presentation.
Method
We used data obtained from a clinical database derived from the EHR of a large mental healthcare provider within the UK. We combined structured data with text-derived data relating to diagnosis statements, medication and psychiatric symptomatology. Predictors of the three different clinical outcomes were modelled using logistic regression with performance evaluated against a validation set to derive areas under receiver operating characteristic curves.
Results
In validation samples, the full models (using all available data) achieved areas under receiver operating characteristic curves between 0.59 and 0.85 (in-patient duration 0.63, readmission 0.59, high service use 0.85). Adding natural language processing-derived data to the models increased the variance explained across all clinical scenarios (observed increase in r2 = 12–46%).
Conclusions
EHR data offer the potential to improve routine clinical predictions by utilising previously inaccessible data. Of our scenarios, prediction of high service use after initial presentation achieved the highest performance.
The density of information in digital health records offers new potential opportunities for automated prediction of cost-relevant outcomes.
Aims
We investigated the extent to which routinely recorded data held in the electronic health record (EHR) predict priority service outcomes and whether natural language processing tools enhance the predictions. We evaluated three high priority outcomes: in-patient duration, readmission following in-patient care and high service cost after first presentation.
Method
We used data obtained from a clinical database derived from the EHR of a large mental healthcare provider within the UK. We combined structured data with text-derived data relating to diagnosis statements, medication and psychiatric symptomatology. Predictors of the three different clinical outcomes were modelled using logistic regression with performance evaluated against a validation set to derive areas under receiver operating characteristic curves.
Results
In validation samples, the full models (using all available data) achieved areas under receiver operating characteristic curves between 0.59 and 0.85 (in-patient duration 0.63, readmission 0.59, high service use 0.85). Adding natural language processing-derived data to the models increased the variance explained across all clinical scenarios (observed increase in r2 = 12–46%).
Conclusions
EHR data offer the potential to improve routine clinical predictions by utilising previously inaccessible data. Of our scenarios, prediction of high service use after initial presentation achieved the highest performance.
Original language | English |
---|---|
Article number | e10 |
Journal | BJPsych Open |
Volume | 6 |
Issue number | 1 |
DOIs | |
Publication status | Published - 17 Jan 2020 |
Keywords
- Digital health records
- Mental health service
- Natural language processing
- Prediction
Fingerprint
Dive into the research topics of 'Predicting high-cost care in a mental health setting'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Linking electronic health records with passive smartphone activity data to predict outcomes in psychotic disorders
Patel, R. (Primary Investigator), McGuire, P. (Primary Investigator) & Curcin, V. (Primary Investigator)
14/02/2018 → 13/02/2021
Project: Research
-
Symptom dimensions in first episode psychosis: predicting clinical outcomes using natural language processing
Patel, R. (Primary Investigator)
3/10/2016 → 2/10/2018
Project: Research