Evaluating multi-ancestry genome-wide association methods: Statistical power, population structure, and practical implications.
Dias JA, Chen T, Xing H et al.
Publication Details
Comprehensive information about this research publication
Abstract
Summary of the research findings
The increasing availability of diverse biobanks has enabled multi-ancestry genome-wide association studies (GWASs) to enhance the discovery of genetic variants across traits and diseases. However, the choice of an optimal method remains debated, due to challenges in statistical power differences across ancestral groups and approaches to account for population structure. Two primary strategies exist: (1) pooled analysis, which combines individuals from all genetic backgrounds into a single dataset while adjusting for population stratification using principal components, increasing the sample size and statistical power but requiring careful control of population stratification; and (2) meta-analysis, which performs ancestry-group-specific GWASs and subsequently combines summary statistics, potentially capturing fine-scale population structure but facing limitations in handling admixed individuals. Using large-scale simulations with varying sample sizes and ancestry compositions, we compare these methods alongside real data analyses of eight continuous and five binary traits from the UK Biobank (N ≈ 324,000) and the All of Us Research Program (N ≈ 207,000). Our results demonstrate that pooled analysis generally exhibits better statistical power while effectively adjusting for population stratification. We further present a theoretical framework linking power differences to allele-frequency variations across populations. These findings, validated across both biobanks, highlight pooled analysis as a powerful and scalable strategy for multi-ancestry GWASs, improving genetic discovery while maintaining rigorous population structure control.
22,348 European ancestry cases, 288,704 European ancestry controls, 37 admixed American ancestry cases, 552 admixed American ancestry controls, 539 African ancestry cases, 6,324 African ancestry controls, 30 East Asian ancestry cases, 555 East Asian ancestry controls, 575 South Asian ancestry cases, 5,158 South Asian ancestry controls
Study Statistics
Key metrics and study information
AI-Generated Summary
AI-generated by DNAGENICSIndependent AI summary of health and genetic findings from the published study
Important: This summary is AI-generated by DNAGENICS for informational purposes only. It was not created by, affiliated with, or endorsed by the researchers behind the original publication, and is based solely on that published research. It may contain errors or omissions. DNAGENICS disclaims all liability for any inaccuracies or consequences arising from use of this information. Verify all information against the original publication. This is not professional scientific review or medical advice.
AI Summary In Progress
Our AI-generated summary of this publication is being prepared. Please check back soon.