523-individual ancient DNA study maps South Asian population formation
Narasimhan et al. 2019 in Science. Largest ancient DNA study of South/Central Asia. Maps formation of ANI and ASI populations. Agriculture in India developed independently.
Detailed Analysis
The Narasimhan et al. 2019 study, published in Science, analyzed ancient DNA from 523 individuals across South and Central Asia — the largest such study to date. Its findings reshaped the population genetics of the subcontinent. Key findings: 1. **Ancestral North Indians (ANI)**: Formed from a mixture of Indus Valley Civilization (IVC) people and Steppe pastoralists who migrated southward between 2,300 and 1,500 BCE. ANI ancestry is highest in upper-caste North Indians. 2. **Ancestral South Indians (ASI)**: Formed from a mixture of IVC people and indigenous peninsular hunter-gatherers (Ancient Ancestral South Indians, or AASI). ASI ancestry is highest in tribal populations and South Indians. 3. **IVC genetic profile**: A single individual from the IVC site of Rakhigarhi (Shinde et al. 2019, published simultaneously in Cell) showed no Steppe pastoralist ancestry and no Iranian farmer ancestry — her genetic profile is 'the primary source of ancestry in South Asia today.' 4. **Independent agriculture**: The data shows that South Asian agriculture developed independently, without requiring gene flow from Iranian farmers (previously hypothesized). The IVC people were genetically distinct from Iranian Neolithic populations. 5. **Steppe ancestry tracks Indo-Iranian language spread**: The Steppe-related ancestry that spread into South Asia after 2,300 BCE carries a genetic profile matching Bronze Age Eastern Europeans — the same profile associated with the spread of Indo-Iranian languages. David Reich described this as 'checkmate for the Anatolian hypothesis' (which proposed Indo-European languages spread from Turkey via farming). The study does not resolve whether language change preceded, accompanied, or followed genetic mixing. DNA proves population mixture timing but cannot determine what language the migrants spoke. This remains the central open question.
Methodology
Ancient DNA extraction and whole-genome sequencing from 523 individuals across South and Central Asia, spanning 8,000 years. ADMIXTURE analysis, qpAdm modeling, principal component analysis. Published in Science (2019). Complemented by Shinde et al. Cell (2019) for the Rakhigarhi individual.
Counter-Arguments & Responses
Vasant Shinde (co-author) interprets the Rakhigarhi data as disproving the Aryan Migration Theory entirely.
Nick Patterson (also co-author) disagrees, noting the larger dataset shows Steppe ancestry arrived after IVC decline. The data supports later migration, even if the single Rakhigarhi individual had no Steppe DNA. Both interpretations use the same data — the disagreement is about what 'migration' means for language and culture.
Source: Shinde et al. Cell (2019); Narasimhan et al. Science (2019)
Falsifiability Criteria
If future ancient DNA from IVC sites consistently shows Steppe ancestry at pre-2,000 BCE dates, the migration timeline would need revision. If IVC-era individuals are found with Iranian farmer ancestry, the independent-agriculture finding would weaken.