Creating a next-generation phenotype library: the health data research UK Phenotype Library

Daniel S. Thayer, Shahzad Mumtaz*, Muhammad A. Elmessary, Ieuan Scanlon, Artur Zinnurov, Alex Ioan Coldea, Jack Scanlon, Martin Chapman, Vasa Curcin, Ann John, Marcos Delpozo-Banos, Hannah Davies, Andreas Karwath, Georgios V. Gkoutos, Natalie K. Fitzpatrick, Jennifer K. Quint, Susheel Varma, Chris Milner, Carla Oliveira, Helen ParkinsonSpiros Denaxas, Harry Hemingway, Emily Jefferson

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Objective: To enable reproducible research at scale by creating a platform that enables health data users to find, access, curate, and re-use electronic health record phenotyping algorithms. Materials and Methods: We undertook a structured approach to identifying requirements for a phenotype algorithm platform by engaging with key stakeholders. User experience analysis was used to inform the design, which we implemented as a web application featuring a novel metadata standard for defining phenotyping algorithms, access via Application Programming Interface (API), support for computable data flows, and version control. The application has creation and editing functionality, enabling researchers to submit phenotypes directly. Results: We created and launched the Phenotype Library in October 2021. The platform currently hosts 1049 phenotype definitions defined against 40 health data sources and >200K terms across 16 medical ontologies. We present several case studies demonstrating its utility for supporting and enabling research: the library hosts curated phenotype collections for the BREATHE respiratory health research hub and the Adolescent Mental Health Data Platform, and it is supporting the development of an informatics tool to generate clinical evidence for clinical guideline development groups. Discussion: This platform makes an impact by being open to all health data users and accepting all appropriate content, as well as implementing key features that have not been widely available, including managing structured metadata, access via an API, and support for computable phenotypes. Conclusions: We have created the first openly available, programmatically accessible resource enabling the global health research community to store and manage phenotyping algorithms. Removing barriers to describing, sharing, and computing phenotypes will help unleash the potential benefit of health data for patients and the public.

Original languageEnglish
Article numberooae049
JournalJAMIA Open
Volume7
Issue number2
DOIs
Publication statusPublished - 1 Jul 2024

Keywords

  • algorithms
  • application programming interface
  • electronic health records
  • medical informatics
  • phenotyping
  • public health informatics

Fingerprint

Dive into the research topics of 'Creating a next-generation phenotype library: the health data research UK Phenotype Library'. Together they form a unique fingerprint.

Cite this