The Story
The journey of mtDNA haplogroup M5C
Origins and Evolution
mtDNA haplogroup M5C is a daughter lineage of the broader South Asian clade M5, itself a deep branch of macro-haplogroup M. While M5 has been estimated to originate around ~35 kya in the Indian subcontinent, M5C appears to have arisen later, plausibly during the Late PleistoceneāEarly Holocene transition (roughly 10ā15 kya). This timing is consistent with post-glacial demographic expansions and the increasing regional population structure in South Asia. As a subclade of M5, M5C preserves part of the ancient maternal heritage of the subcontinent and shows a pattern of localized diversity indicative of long-term presence and limited geographic spread compared with more widely distributed mtDNA lineages.
Subclades
M5C is an intermediate-level clade within the M5 phylogeny. Compared with some other M5 sublineages, M5C is relatively uncommon and shows limited, often regionally restricted substructure in published population surveys. A small number of deeper sublineages have been reported in high-resolution mtDNA sequencing studies, but overall M5C is characterized more by focal, localized occurrence than by extensive derived diversity across multiple regions.
Geographical Distribution
M5C is concentrated in South Asia, with highest observations in eastern and central parts of the Indian subcontinent and adjacent lowland Himalayan foothills. Reported occurrences are most frequent (relative to its rarity overall) among various caste and tribal groups in eastern India (including Bengal, Odisha, Jharkhand), among Bengali-speaking populations in Bangladesh and West Bengal, and in some Nepalese groups along the Terai and foothill regions. Low-frequency occurrences are documented in Pakistan (especially near eastern border regions), Sri Lanka, and among South Asian diaspora communities in the Middle East, Europe and North America. The distribution pattern ā localized but persistent ā is consistent with a lineage that originated locally and persisted within both sedentary and mobile populations over millennia.
Historical and Cultural Significance
Because M5C likely arose in the early Holocene, its carriers would have been part of the post-Last Glacial Maximum human populations of South Asia, contributing to the genetic background of later Mesolithic hunterāgatherers. During the Neolithic and subsequent Chalcolithic and Bronze Age periods, populations bearing M5C would have mixed with incoming or locally expanding agricultural groups, becoming integrated into the genetic make-up of diverse caste and tribal communities. Although no single archaeological culture can be uniquely tied to M5C, the lineage's persistence through time means it likely participated indirectly in major cultural transitions in South Asia ā from Mesolithic subsistence systems through the spread of agriculture and into the urbanized societies of the Bronze Age (e.g., the Indus cultural sphere) and later historical periods.
Conclusion
M5C is a regionally focused maternal lineage that represents a component of the deep South Asian mtDNA pool derived from the ancient M5 radiation. Its restricted geographic footprint and limited substructure make it particularly informative for studies of local population history, post-glacial demographic processes in South Asia, and the formation of modern regional maternal diversity. Continued high-resolution sequencing and dense regional sampling will help refine the internal branching of M5C and clarify its microgeographic history across the Indian subcontinent.
Key Points
- Origins and Evolution
- Subclades
- Geographical Distribution
- Historical and Cultural Significance
- Conclusion