On the Evolution of the Southern Arc Hypothesis
There’s something rare and beautiful in science, and you don’t see it happen this publicly very often. In August 2022, a landmark trio of papers drops in Science under the banner of the Southern Arc. The lead author is Iosif Lazaridis, working with David Reich’s ancient DNA lab at Harvard. The big, splashy claim: the homeland of Indo-European languages - the entire family, including the Anatolian branch that gave us Hittite - lay south of the Caucasus. The evidence pointed to a genetic continuum running from eastern Anatolia through northern Mesopotamia into western Iran, a southern arc where the first Indo-Anatolian speech supposedly emerged. The steppe, in this model, was just a secondary staging ground for the later branches. News outlets run with it. Books get rewritten. The southern homeland looks settled.
Then, barely a year later, a preprint surfaces on bioRxiv in late 2023 titled "The Genetic Origins of the Indo-Europeans." Same lead author, Lazaridis. Same lab. But the conclusion has shifted dramatically. By the time the peer-reviewed version lands in Nature in early 2024, the southern arc as an ultimate origin is gone. The new root is placed further north, in something they call the Caucasus-Lower Volga cline, or CLV. This is a genetic and cultural gradient stretching from the northern foothills of the Caucasus into the lower Volga steppe, dated to roughly 4400-4000 BC. The deepest ancestor of all Indo-European languages, including Anatolian, was spoken right there, north of the mountains. The south didn't birth the family; it received an early offshoot.
The shift is not a small tweak. It’s a fundamental reorientation. In the 2022 model, Anatolian languages like Hittite were essentially stay-at-home southerners, remnants of a deep population that never left the original homeland. In the 2024 model, Anatolian is instead an early emigrant. The scenario now goes like this: the Proto-Indo-Anatolian community lived in the CLV cline. Sometime before 4000 BC, a group breaks south, crosses the Caucasus, and settles in eastern Anatolia. They carry the ancestor of Hittite, Luwian, and Palaic with them. Genetically, these people were mostly Caucasus-derived, without the full steppe ancestry package that would later define the Yamnaya. That explains why Bronze Age Hittites show zero steppe ancestry, a fact the 2022 papers had also established. The new model just flips the direction of movement and redefines the homeland.
What I love about this is how clearly it shows the self-correcting machinery of ancient DNA research. The data that forced the revision wasn't some ideological squabble; it was new samples and better models. The team kept finding that South Caucasus populations alone couldn't serve as the root for all the later Indo-European branches. The steppe groups didn't just spring from a fully southern source. They were an integral part of a cline that stretched north-south, and the linguistic root sat smack in the middle of that cline, not at its southern tip. The 2024 paper explicitly states that the South Caucasus was too genetically isolated and too diverged to be the fount of the entire language family. So the homeland walked north.
This has a pleasing symmetry. The Anatolian branch remains the first to split, the most archaic, the one that preserves sounds like the laryngeals that Saussure theorized decades before anyone dug up a Hittite tablet. But now its origin story mirrors its linguistic character: early, isolated, moving into Anatolia before the wagons and the kurgans and the massive steppe expansions, while the rest of the family stayed behind in the CLV cline and eventually gave rise to Yamnaya and all its thunderous consequences. The southern arc as a concept isn’t dead - it’s still the bridge - but it’s no longer the cradle.
For anyone following this stuff outside academia, the speed of the reversal is dizzying. Two years. That’s all it took for the same team to publish a blockbuster, then revisit their own evidence and publish a correction that reshuffles the map of the world’s largest language family. That’s not a sign of weakness. It’s what happens when the bones keep talking and you’re actually listening. The prehistory of language is being written and rewritten in real time, and the CLV cline is now the name you need to know, a corridor of grass and mountains north of the Caucasus where someone first said the words that would become "water," "night," and "daughter" in a hundred tongues.
An updated overview of aDNA relevant to Proto-Indo-European origins