Menu
Currency
GWAS Study

Multi-domain rule-based phenotyping algorithms enable improved GWAS signal.

Newbury A, Elhussein A, Gürsoy G

40753274 PubMed ID
GWAS Study Type
405811 Participants
42 Views
Scroll to explore
Chapter I

Publication Details

Comprehensive information about this research publication

Authors

NA
Newbury A
EA
Elhussein A
GG
Gürsoy G
Chapter II

Abstract

Summary of the research findings

Biobanks are a rich source of data for genome-wide association studies (GWAS). They store clinical data from electronic health records, with data domains such as laboratory measurements, conditions, and self-reported diagnoses. Traditionally, biobank GWAS utilize case-control cohorts built exclusively from conditions. However, because reported conditions are primarily collected for billing purposes, they face data quality issues. Consequently, incorporating additional data domains in cohort construction can improve cohort accuracy and GWAS results. Here, we assess the impact of various rule-based phenotyping algorithms on GWAS outcomes, examining factors such as power, heritability, replicability, functional annotations, and polygenic risk score prediction accuracy across seven diseases in the UK Biobank. We find that high complexity phenotyping algorithms generally improve GWAS outcomes, including increased power, hits within coding and functional genomic regions, and co-localization with expression quantitative trait loci. Our findings suggest that biobank-scale GWAS can benefit from phenotyping algorithms that integrate multiple data domains.

405,811 European ancestry, unknown ancestry individuals

Chapter III

Study Statistics

Key metrics and study information

405811
Total Participants
GWAS
Study Type
No
Replicated
European, NR
Ancestry
U.K.
Recruitment Country
Chapter IV

AI-Generated Summary

AI-generated by DNAGENICS

Independent AI summary of health and genetic findings from the published study

Important: This summary is AI-generated by DNAGENICS for informational purposes only. It was not created by, affiliated with, or endorsed by the researchers behind the original publication, and is based solely on that published research. It may contain errors or omissions. DNAGENICS disclaims all liability for any inaccuracies or consequences arising from use of this information. Verify all information against the original publication. This is not professional scientific review or medical advice.

AI Summary In Progress

Our AI-generated summary of this publication is being prepared. Please check back soon.