TY - JOUR
T1 - Creating a next-generation phenotype library
T2 - the health data research UK Phenotype Library
AU - Thayer, Daniel S.
AU - Mumtaz, Shahzad
AU - Elmessary, Muhammad A.
AU - Scanlon, Ieuan
AU - Zinnurov, Artur
AU - Coldea, Alex Ioan
AU - Scanlon, Jack
AU - Chapman, Martin
AU - Curcin, Vasa
AU - John, Ann
AU - Delpozo-Banos, Marcos
AU - Davies, Hannah
AU - Karwath, Andreas
AU - Gkoutos, Georgios V.
AU - Fitzpatrick, Natalie K.
AU - Quint, Jennifer K.
AU - Varma, Susheel
AU - Milner, Chris
AU - Oliveira, Carla
AU - Parkinson, Helen
AU - Denaxas, Spiros
AU - Hemingway, Harry
AU - Jefferson, Emily
N1 - Publisher Copyright:
© 2024 The Author(s). Published by Oxford University Press on behalf of the American Medical Informatics Association.
PY - 2024/7/1
Y1 - 2024/7/1
N2 - Objective: To enable reproducible research at scale by creating a platform that enables health data users to find, access, curate, and re-use electronic health record phenotyping algorithms. Materials and Methods: We undertook a structured approach to identifying requirements for a phenotype algorithm platform by engaging with key stakeholders. User experience analysis was used to inform the design, which we implemented as a web application featuring a novel metadata standard for defining phenotyping algorithms, access via Application Programming Interface (API), support for computable data flows, and version control. The application has creation and editing functionality, enabling researchers to submit phenotypes directly. Results: We created and launched the Phenotype Library in October 2021. The platform currently hosts 1049 phenotype definitions defined against 40 health data sources and >200K terms across 16 medical ontologies. We present several case studies demonstrating its utility for supporting and enabling research: the library hosts curated phenotype collections for the BREATHE respiratory health research hub and the Adolescent Mental Health Data Platform, and it is supporting the development of an informatics tool to generate clinical evidence for clinical guideline development groups. Discussion: This platform makes an impact by being open to all health data users and accepting all appropriate content, as well as implementing key features that have not been widely available, including managing structured metadata, access via an API, and support for computable phenotypes. Conclusions: We have created the first openly available, programmatically accessible resource enabling the global health research community to store and manage phenotyping algorithms. Removing barriers to describing, sharing, and computing phenotypes will help unleash the potential benefit of health data for patients and the public.
AB - Objective: To enable reproducible research at scale by creating a platform that enables health data users to find, access, curate, and re-use electronic health record phenotyping algorithms. Materials and Methods: We undertook a structured approach to identifying requirements for a phenotype algorithm platform by engaging with key stakeholders. User experience analysis was used to inform the design, which we implemented as a web application featuring a novel metadata standard for defining phenotyping algorithms, access via Application Programming Interface (API), support for computable data flows, and version control. The application has creation and editing functionality, enabling researchers to submit phenotypes directly. Results: We created and launched the Phenotype Library in October 2021. The platform currently hosts 1049 phenotype definitions defined against 40 health data sources and >200K terms across 16 medical ontologies. We present several case studies demonstrating its utility for supporting and enabling research: the library hosts curated phenotype collections for the BREATHE respiratory health research hub and the Adolescent Mental Health Data Platform, and it is supporting the development of an informatics tool to generate clinical evidence for clinical guideline development groups. Discussion: This platform makes an impact by being open to all health data users and accepting all appropriate content, as well as implementing key features that have not been widely available, including managing structured metadata, access via an API, and support for computable phenotypes. Conclusions: We have created the first openly available, programmatically accessible resource enabling the global health research community to store and manage phenotyping algorithms. Removing barriers to describing, sharing, and computing phenotypes will help unleash the potential benefit of health data for patients and the public.
KW - algorithms
KW - application programming interface
KW - electronic health records
KW - medical informatics
KW - phenotyping
KW - public health informatics
UR - http://www.scopus.com/inward/record.url?scp=85196218093&partnerID=8YFLogxK
U2 - 10.1093/jamiaopen/ooae049
DO - 10.1093/jamiaopen/ooae049
M3 - Article
AN - SCOPUS:85196218093
SN - 2574-2531
VL - 7
JO - JAMIA Open
JF - JAMIA Open
IS - 2
M1 - ooae049
ER -