TY - JOUR
T1 - Hypothesis-free phenotype prediction within a genetics-first framework
AU - Danovi, Davide
N1 - Funding Information:
We extend our deepest gratitude to all of the DTC participants from numerous countries who made this work possible, generously donating their time and sharing their personal and genetic data for no reward other than the satisfaction of contributing to medical research for the global good. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003], a parallel funding partnership between Wellcome and the Department of Health, and the Wellcome Sanger Institute [grant number WT098051]. The views expressed in this publication are those of the author(s) and not necessarily those of Wellcome or the Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC). The research team acknowledges the support of the National Institute for Health Research, through the Comprehensive Clinical Research Network. This study makes use of data generated by the HipSci Consortium, funded by the Wellcome Trust and the MRC. We thank Sano Genetics for helping to recruit 557 participants. We thank the LMB visual aids department for help with the figures. We thank Jake Grimmett and Toby Darling for their help with high-performance computing. This work was supported by the Medical Research Council, as part of United Kingdom Research and Innovation (also known as UK Research and Innovation) [MC_UP_1201/14] and a grant from the Biotechnology and Biological Sciences Research Council [BB/N019431/1]. Kings College investigators are supported by the Wellcome Trust and MRC through the Human Induced Pluripotent Stem Cell Initiative [WT098503]. This work was partly supported by the Center for Research and Interdisciplinarity (CRI) Research Fellowship to Bastian Greshake Tzovaras, who also thanks the Bettencourt Schueller Foundation long term partnership.
Funding Information:
We extend our deepest gratitude to all of the DTC participants from numerous countries who made this work possible, generously donating their time and sharing their personal and genetic data for no reward other than the satisfaction of contributing to medical research for the global good. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003], a parallel funding partnership between Wellcome and the Department of Health, and the Wellcome Sanger Institute [grant number WT098051]. The views expressed in this publication are those of the author(s) and not necessarily those of Wellcome or the Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC). The research team acknowledges the support of the National Institute for Health Research, through the Comprehensive Clinical Research Network. This study makes use of data generated by the HipSci Consortium, funded by the Wellcome Trust and the MRC. We thank Sano Genetics for helping to recruit 557 participants. We thank the LMB visual aids department for help with the figures. We thank Jake Grimmett and Toby Darling for their help with high-performance computing. This work was supported by the Medical Research Council, as part of United Kingdom Research and Innovation (also known as UK Research and Innovation) [MC_UP_1201/14] and a grant from the Biotechnology and Biological Sciences Research Council [BB/N019431/1]. Kings College investigators are supported by the Wellcome Trust and MRC through the Human Induced Pluripotent Stem Cell Initiative [WT098503]. This work was partly supported by the Center for Research and Interdisciplinarity (CRI) Research Fellowship to Bastian Greshake Tzovaras, who also thanks the Bettencourt Schueller Foundation long term partnership.
Publisher Copyright:
© 2023, The Author(s).
PY - 2023/2/17
Y1 - 2023/2/17
N2 - Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed 'rare', even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied.
AB - Cohort-wide sequencing studies have revealed that the largest category of variants is those deemed 'rare', even for the subset located in coding regions (99% of known coding variants are seen in less than 1% of the population. Associative methods give some understanding how rare genetic variants influence disease and organism-level phenotypes. But here we show that additional discoveries can be made through a knowledge-based approach using protein domains and ontologies (function and phenotype) that considers all coding variants regardless of allele frequency. We describe an ab initio, genetics-first method making molecular knowledge-based interpretations for exome-wide non-synonymous variants for phenotypes at the organism and cellular level. By using this reverse approach, we identify plausible genetic causes for developmental disorders that have eluded other established methods and present molecular hypotheses for the causal genetics of 40 phenotypes generated from a direct-to-consumer genotype cohort. This system offers a chance to extract further discovery from genetic data after standard tools have been applied.
UR - http://www.scopus.com/inward/record.url?scp=85148348753&partnerID=8YFLogxK
U2 - 10.1038/s41467-023-36634-6
DO - 10.1038/s41467-023-36634-6
M3 - Article
SN - 2041-1723
VL - 14
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 919
ER -