Menu
Research Publication

Old vs. new local ancestry inference in HCHS/SOL: a comparative study.

Chen Xueying, X Wang, Hao H et al.

40485222 PubMed ID
18 Authors
2025-08-16 Published
0 Views
Scroll to explore
Chapter I

Publication Details

Comprehensive information about this research publication

Authors

CX
Chen Xueying
XW
X Wang
HH
Hao H
BI
Broce Iris
ID
I Dale
AA
Anders A
YB
Yu Bing
BZ
B Zhou
LY
Laura Y LY
LX
Li Xihao
XA
X Argos
MM
Maria M
DM
Daviglus Martha L
MC
ML Cai
JJ
Jianwen J
FN
Franceschini Nora
NS
N Sofer
TT
Tamar T
Chapter II

Abstract

Summary of the research findings

Hispanic/Latino populations are admixed, with genetic contributions from multiple ancestral populations. To uncover genetic associations in these populations, researchers often turn to admixture mapping, which relies on inferred counts of "local" ancestry, i.e. the source ancestral population at a locus. Local ancestries are inferred using external reference panels that represent ancestral populations, making the choice of inference method and reference panel critical. This study used a dataset of Hispanic/Latino individuals from the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) to evaluate how updates in local ancestry inference (LAI) affect results, specifically, the 'old' LAI performed using a popular inference method RFMix alongside 'new' inferences performed using Fast Local Ancestry Estimation (FLARE) with an updated reference panel. We compared their performance in terms of global and local ancestry correlations, as well as admixture mapping-based associations. Overall, the old and new inferences produced highly similar global and local ancestry estimates, with FLARE-based results closely matching those from RFMix in admixture mapping analyses. However, in some genomic regions, the old and new local ancestries showed relatively lower correlations (Pearson R < 0.9). Most of these regions (86.42%) were mapped to either ENCODE blacklist regions or gene clusters, compared to 7.67% of randomly-matched regions with high correlations (Pearson R > 0.97). These findings show that old and new inferences largely agree and suggest that regions of lower agreement are mostly due to genomic sequence contexts that lead to less stable inference, rather than due to the LAI software or genotyping technology used.

Chapter III

Analysis

Comprehensive review of ancestry and genetic findings

Important Disclaimer: This review has been performed semi-automatically and is provided for informational purposes only. While we strive for accuracy, this analysis may contain errors, omissions, or misinterpretations of the original research. DNA Genics disclaims all liability for any inaccuracies, errors, or consequences arising from the use of this information. Users should independently verify all information and consult original research publications before making any decisions based on this content. This analysis is not intended as a substitute for professional scientific review or medical advice.

Analysis In Progress

Our analysis of this publication is currently being prepared. Please check back soon for comprehensive insights into the ancestry and genetic findings discussed in this research.