The Story
The journey of mtDNA haplogroup L2*
Origins and Evolution
mtDNA haplogroup L2 is a major maternal lineage that arose during the Late Pleistocene in West/Central Africa, with a time to most recent common ancestor (TMRCA) on the order of ~70 kya for the broader L2 clade. The notation L2* (L2 star) refers to mtDNA sequences that belong to the L2 branch but cannot be assigned confidently to one of the defined downstream subclades (for example L2a, L2b, L2c, L2d) either because they are basal lineages or because the available sequence data lacks diagnostic mutations. As a basal component of the L2 phylogeny, L2* preserves signal of early diversification within the clade prior to the spread and differentiation that produced the more commonly sampled subclades.
Molecular-clock estimates and phylogeographic analyses indicate that populations carrying L2 lineages experienced both deep local diversification in West/Central Africa and later range expansions during the Holocene. Migration, gene flow, and demographic events such as the spread of agriculture and pastoralism affected the distribution and relative frequencies of L2 subclades across Africa.
Subclades (if applicable)
Although this entry focuses on L2* (undifferentiated/basal L2), the main recognized subclades of L2 include L2a, L2b, L2c, and L2d, with L2a typically the most frequent and geographically widespread. L2* sequences can represent:
- Truly basal lineages that predate the split into named subclades.
- Partially characterized samples lacking the full mitochondrial sequence necessary to place them in a subclade.
Identification of L2* therefore highlights either deep, uncommon lineages that escaped later radiations or limitations in sequence resolution. When whole-mitogenome data are available, many L2* samples can be resolved into known subclades, which refines phylogeographic inferences.
Geographical Distribution
L2 and L2* are concentrated in sub-Saharan Africa with the highest frequencies in West and Central Africa. Key distributional notes:
- West and Central Africa: High frequency and diversity, reflecting the likely origin and long-term population continuity in these regions.
- East Africa and the Horn: Moderate frequencies, often reflecting ancient gene flow, localized expansions, and later admixture; some L2 lineages (including L2a) are common among Horn populations.
- Southern Africa: Present at moderate to low levels, including detection in some Khoe-San and Bantu-speaking groups, consistent with Holocene migrations and localized contact.
- North Africa and the Middle East: Low frequencies, typically interpreted as the result of historical trans-Saharan and Mediterranean gene flow.
- The Americas and Caribbean: L2 lineages, including basal L2*, are present in African-descended populations due to the trans-Atlantic slave trade; these appear at variable frequencies depending on source populations of enslaved Africans.
L2* has been identified in multiple modern populations and in a small number of ancient DNA samples (noted here as 5 instances in the referenced database), providing direct temporal evidence of its historical presence.
Historical and Cultural Significance
Although mtDNA lineages alone cannot identify cultural groups, patterns of L2 diversity correlate with major demographic events in African prehistory and history:
- Bantu-associated expansions (Holocene/Iron Age): The demographic spread of Bantu-speaking, agriculturalist populations redistributed many maternal lineages across much of sub-Saharan Africa; some L2 subclades were carried along and increased in frequency in new regions.
- Pastoral and agropastoral expansions in East Africa: L2 lineages appear in pastoralist and farming communities, reflecting complex admixture between incoming and resident populations.
- Trans-Atlantic slave trade: L2 entered the Americas primarily during the last 500 years through forced migrations; its presence in Afro-descended populations today is a direct genetic legacy of those historical events.
The presence of L2* in ancient contexts, although currently limited in sample count, helps anchor the antiquity of L2 lineages in archaeological time and supports continuity and movement of maternal lineages in Africa over tens of thousands of years.
Conclusion
L2* signals either basal branches of the important African L2 maternal clade or unresolved lineages due to limited sequence data. Its highest diversity and frequency in West/Central Africa support an origin in that region during the Late Pleistocene, with later spread through Holocene demographic processes (including the Bantu expansion and historical admixture). Continued whole-mitogenome sampling and ancient DNA studies will improve resolution between L2* basal lineages and described L2 subclades, refining our understanding of maternal population history in Africa and the African diaspora.
Key Points
- Origins and Evolution
- Subclades (if applicable)
- Geographical Distribution
- Historical and Cultural Significance
- Conclusion