Menu
GWAS Study

A Fast and Accurate Method for Genome-Wide Time-to-Event Data Analysis and Its Application to UK Biobank.

Bi W, Fritsche LG, Mukherjee B et al.

32589924 PubMed ID
GWAS Study Type
282871 Participants
Scroll to explore
Chapter I

Publication Details

Comprehensive information about this research publication

Authors

BW
Bi W
FL
Fritsche LG
MB
Mukherjee B
KS
Kim S
LS
Lee S
Chapter II

Abstract

Summary of the research findings

With increasing biobanking efforts connecting electronic health records and national registries to germline genetics, the time-to-event data analysis has attracted increasing attention in the genetics studies of human diseases. In time-to-event data analysis, the Cox proportional hazards (PH) regression model is one of the most used approaches. However, existing methods and tools are not scalable when analyzing a large biobank with hundreds of thousands of samples and endpoints, and they are not accurate when testing low-frequency and rare variants. Here, we propose a scalable and accurate method, SPACox (a saddlepoint approximation implementation based on the Cox PH regression model), that is applicable for genome-wide scale time-to-event data analysis. SPACox requires fitting a Cox PH regression model only once across the genome-wide analysis and then uses a saddlepoint approximation (SPA) to calibrate the test statistics. Simulation studies show that SPACox is 76-252 times faster than other existing alternatives, such as gwasurvivr, 185-511 times faster than the standard Wald test, and more than 6,000 times faster than the Firth correction and can control type I error rates at the genome-wide significance level regardless of minor allele frequencies. Through the analysis of UK Biobank inpatient data of 282,871 white British European ancestry samples, we show that SPACox can efficiently analyze large sample sizes and accurately control type I error rates. We identified 611 loci associated with time-to-event phenotypes of 12 common diseases, of which 38 loci would be missed within a logistic regression framework with a binary phenotype defined as event occurrence status during the follow-up period.

76,566 British ancestry cases, 206,305 British ancestry controls

Chapter III

Study Statistics

Key metrics and study information

282871
Total Participants
GWAS
Study Type
No
Replicated
European
Ancestry
U.K.
Recruitment Country
Chapter IV

Analysis

Comprehensive review of health and genetic findings

Important Disclaimer: This review has been performed semi-automatically and is provided for informational purposes only. While we strive for accuracy, this analysis may contain errors, omissions, or misinterpretations of the original research. DNA Genics disclaims all liability for any inaccuracies, errors, or consequences arising from the use of this information. Users should independently verify all information and consult original research publications before making any decisions based on this content. This analysis is not intended as a substitute for professional scientific review or medical advice.

Analysis In Progress

Our analysis of this publication is currently being prepared. Please check back soon for comprehensive insights into the health and genetic findings discussed in this research.