For what they were... we are: Ancient genomes from Neolithic West Asia

June 26, 2016

Ancient genomes from Neolithic West Asia

This week we got to know a lot more about the genetics of ancient West Asians, from the Mesolithic, Neolithic and later times. All in a single major study:

Iosif Lazaridis et al., The genetic structure of the world's first farmers. BioRxiv 2016. Freely accessible (pre-pub) → LINK [doi: http://dx.doi.org/10.1101/059311]

Abstract

We report genome-wide ancient DNA from 44 ancient Near Easterners ranging in time between ~12,000-1,400 BCE, from Natufian hunter-gatherers to Bronze Age farmers. We show that the earliest populations of the Near East derived around half their ancestry from a 'Basal Eurasian' lineage that had little if any Neanderthal admixture and that separated from other non-African lineages prior to their separation from each other. The first farmers of the southern Levant (Israel and Jordan) and Zagros Mountains (Iran) were strongly genetically differentiated, and each descended from local hunter-gatherers. By the time of the Bronze Age, these two populations and Anatolian-related farmers had mixed with each other and with the hunter-gatherers of Europe to drastically reduce genetic differentiation. The impact of the Near Eastern farmers extended beyond the Near East: farmers related to those of Anatolia spread westward into Europe; farmers related to those of the Levant spread southward into East Africa; farmers related to those from Iran spread northward into the Eurasian steppe; and people related to both the early farmers of Iran and to the pastoralists of the Eurasian steppe spread eastward into South Asia.

Highlights:

There were (at least) two clearly distinct populations in West Asia in the Mesolithic and Early Neolithic times.
Both populations contributed to the West Anatolian farmers that are precursors of the settlers of Neolithic Europe.
The so-called "Basal Eurasian" component is not yet clarified if it is something local or admixture with Africans or both. However it is clear that it is associated with reduced Neanderthal admixture.
West Eurasian genetic composition can be now understood quite well as the mixture from four sources: two West Asian ones, favored by the Neolithic revolution, and two Paleo-European ones.

This graphic shows pretty well how the ancient populations of West Eurasia are expressed as a mixture of those four founder populations:

That is if you can get through the nomenclature, which is inherited in many cases from a long array of recent studies. I'm not even sure myself in many cases what samples exactly and where from are thrown in each category. But the most important part is that Iran_N and Levant_N are the two Neolithic-specific founder populations of the Fertile Crescent (yeah, N stands for "Neolithic", not "North") and that the other two founder populations from pre-Neolithic Europe are WHG (Epi-Magdalenian peoples from Western and Central Europe) and EHG (Eastern European hunter-gatherers, of Epigravettian culture and maybe even proto-Uralic in one case).

Then we see in the case of Europe how:

1. Anatolia_N (precursors of mainline European Neolithic) are a mix of both West Asian farmer groups, plus a sizable fraction of Western Paleo-european ancestry already.

2. This fraction of Western Paleoeuropeanness increases as the farmers expanded into Europe (EN) and then as there was probably some backflow of Western origins in relation to Megalithism and Bell Beaker (MNChL). But in general remains the same basic genetic composition and in no known case incorporates any Eastern Paleoeuropean component at all, not yet.

3. It is only with the Indoeuropean ("Kurgan") invasions reflected in the category LNBA, when the EHG component begins feeling very important in Europe. If I'm correct, all those samples are from Germany other areas of Central and North Europe, with the Iberian and Italian ones of similar chronology placed in the MNChL tag instead. The LNBA/MNChL contrast is not a strictly chronological analysis but an analysis by categories of ancestry that do overlap in time.

4. In Armenia instead, we see a decrease of the minor EHG component but then an increase in the MLBA ("middle and late Bronze Age") ~~when Armenians arrive from the Balcans and Phrygia~~, ~~conquering the pre-existing Hurro-Urartean peoples (whose language was probably related to Chechen and other NE Caucasian languages)~~, which should correspond to the formation of Urartu and more specifically to the Hayasa-Azzi and Shupria stages, both considered Urartean (Hurrian). The WHG and Levant-N components we see since the Chalcolithic is similar to what we see in West Anatolia and probably reflect interactions corresponding to Central-Eastern Anatolia, Kurdistan and Syria, for which we have no direct ancient data yet.

Ancient samples (colored and labeled) projected on a PCA of modern West Eurasian populations (in gray):

For a reference on which are the modern populations in gray, a good reference is this older but fully labeled PCA by Olalde.

Briefly: Natufians fall on top of modern Palestinians, their slightly admixed Neolithic descendants fall between Palestinians and Jews, Middle Neolithic European Farmers fall on top of Sardinians, the so-called Europe-Steppe continuum (early Western Indoeuropeans) fall between Central Europe, France and the Balcans, most Western Europeans do not overlap with ancient samples but appear to have even greater Paleoeuropean admixture instead, etc.

Y-DNA Haplogroups

Iranian Mesolithic and Neolithic samples carried the following patrilineages:

Mesolithic: J(xJ2a1b3,J2b2a1a1)
Ganj Dareh Neolithic: P1(xQ,R1b1a2,R1a1a1b1a1b,R1a1a1b1a3a,R1a1a1b2a2a) and an undefined CT
Late Neolithic: G2a1(xG2a1a)

Meanwhile Palestinian Mesolithic and Neolithic samples carried:

Natufian (Mesolithic): E1b1b1b2(xE1b1b1b2a,E1b1b1b2b), E1b1(xE1b1a1,E1b1b1b1), E1b1b1b2(xE1b1b1b2a,E1b1b1b2b), plus two undefined CT.
Pre-Pottery Neolithic B/C: H2, E(xE2,E1a,E1b1a1a1c2c3b1,E1b1b1b1a1,E1b1b1b2b), E1b1b1, T(xT1a1,T1a2a), E1b1b1(xE1b1b1b1a1,E1b1b1a1b1,E1b1b1a1b2,E1b1b1b2a1c), plus three ill-defined CT.

CT is the main pan-Eurasian macro-haplogroup and is not informative, except in Palestine because it implies exclusion of E.

Otherwise we see an important presence of E (mostly E1b1b) a lineage we know was carried by early farmers into Europe and that has ultimately African origins. It probably indicates migration of NE Africans into Palestine in the Mesolithic, something also supported by Archaeology. However these NE Africans were surely already mixed with Eurasian ancestry, which probably arrived to the Nile Basin in the early LSA, some 50-40 Ka ago. So it's a complex story of multiple admixture events in the continental crossroads that is Egypt and also Palestine and other nearby areas.

We also see G2a1 in Late Neolithic Iran, and this one is the main lineage brought to Europe by the early farmers if we are to judge on known ancient sequences (today it is not more important that E1b but it is maybe more evenly distributed). However we only see it in the Late Neolithic, so it may have originated further west.

We see too little J, only J(xJ1a,J2a1,J2b) in Chalcolithic Iran and in Bronze Age Jordan: J(xJ1,J2a,J2b2a) again and J1(xJ1a). I guess that a lot remains to be researched on this issue because J is by far nowadays the most common haplogroup of West Asia, and also impacted Europe and South Asia (J2) and North and NE Africa (J1).

On the issue of "Basal Eurasian": African or West Asian?

The question remains unanswered, as I said before but there are two clues: on one side the presence of E1b in Mesolithic and Neolithic Palestine clearly supports a direct NE African influence, also backed by archaeological evidence. But there is some nuance in the issue of FST distances that I want to highlight.

The distances are available in a very extensive supplementary table, so I took just a few to get a better understanding, not only of this issue but in general of the genetic distances of the four founder populations:

Quite ironically it is not the Natufians who are the closest to the African reference population (Yoruba) but the CHG, Iran-N and Levant-N groups. In fact the Natufians are the most distant ones after the WHG population. However this is tricky because the affinity to Yoruba may also be caused by the "ghost" Basal Eurasian population, claimed first of all by Lazaridis 2014, which would be a remnant of the Out of Africa Migration (not strictly African but close enough and impossible to discern from true African admixture in most analyses).

So we may imagine that the "Highlander" (CHG and Iran-N) populations were somehow influenced by that Basal Eurasian ghostly population, which might have survived in the Persian Gulf oasis, for example. Or whatever else.

The presence of the same or similar element in Levant-N reflects possibly admixture with Iran-N or a similar population, something that is implicit in the table above but I'll address below more explicitly.

If there is (and there must be, because of Y-DNA E1b) some African admixture in the Natufian population, it was very diluted already in the autosomal (general DNA) aspect before farming began.

Update (Jul 2): all the four paragraphs above are possibly misleading to some extent because, as several commenters have rightfully pointed out, generic drift alone just causes the effect of increased distance to general reference populations like Yoruba and Han, this genetic drift is caused by relative isolation, so it seems that Magdalenian Europeans (WHG) and Natufian Palestinians (Natufian) were both more isolated populations in general terms than the Iran-Caucasus-Eastern Europe ones, whose sheer numbers apparently kept them more similar to the generic root of Humankind, less endogamous.

However, per archaeology, such "sheer numbers" are not to be expected in that area, rather the opposite (Western Europe and Palestine are much more richer areas in terms archaeological, suggesting denser populations). So the question remains open as far as I can tell but it should be discerner with more precise tools than mere FST.

A visual of smallest genetic distances between (each "-" represents 0.01 in the table above):

a) Ancient West Asians:

CHG-----IrN-------LeN----Nat

Neolithic peoples of West Asia, even if different, are closer among them than their pre-Neolithic precursors.

b) Pre-Neolithic West Eurasians:

WHG--------EHG----------CHG--------------Nat

The distances between Natufians and everyone else are comparable to those with Han Chinese, however only in the case of the populations that appear to have extra affinity to East Asia (Iran, Caucasus and Eastern Europe), otherwise it is smaller.

All four populations were distant enough from each other to be considered clearly distinctive. Even EHG and WHG were quite dissimilar.

c) The four West Eurasian founders considered above:

WHG--------EHG----------IrN-------LeN

There is much greater similitude between Iran and Levant Neolithic peoples than between their Mesolithic precursors. This implies some sort of intense admixture as agriculture and herding developed. Not enough to erase the differences but enough to blur them significantly.

Genetic influence from East Asia or a related population is also apparent in all Northeastern populations but even more so in Iran Neolithic. Why?

There is much more in the study and supp. materials but I can only review so much.

85 comments:

KristiinaJune 26, 2016 at 9:19 AM
What do you think about EBA R1b? I find it very intriguing that this sample is less EHG than the Chalcolithic Armenians and MLBA Armenians. Moreover, he has some extra WHG.

According to Genetiker MLBA Armenian yDNAs are: R1b1a2, E1b1b1b2a1a-L788, R1b1a2a2, J2b2a-Z590, E1b1b1b2a1a-FGC18319/Y5413.

So you argue that “In Armenia instead, we see a decrease of the minor EHG component but then an increase in the MLBA ("middle and late Bronze Age") when Armenians arrive from the Balcans and Phrygia, conquering the pre-existing Hurro-Urartean peoples (whose language was probably related to Chechen and other NE Caucasian languages).”

If you take a look at the admixture chart on p. 16 in The Caucasus as an asymmetric semipermeable barrier to ancient human migrations, you see that Armenians are very much Neolithic Near East. They have less North European ancestry than Adyge, Chechens, Lezgins, Balkars, Kumyks and Nogays. Also Georgians and Abkhazians have little North European ancestry but compared to them, Armenians have light blue “Bedouin ancestry”. On the basis of their genetic makeup, Armenians could indeed have come from Anatolia and the Balkans.

In Neolithic patrilineal signals indicate that the Armenian plateau was repopulated by agriculturalists, it is found that:
“Of the lineages within haplogroup R, the largely Near Eastern27 R1b1b1*-L23 predominates in Ararat Valley, Gardman and Lake Van (33%, 31% and 32%, respectively). Furthermore, in Ararat Valley we find five individuals belonging to the paraphyletic haplogroup R1b1b*-M269. The Sasun collection, meanwhile, contains comparable distributions of haplogroups R1b1b1*-L23 (15%) and R2-M124 (17%).”
(http://www.nature.com/ejhg/journal/v20/n3/full/ejhg2011192a.html)

Time estimate for Armenian R1b is c. 6 kya, so it would be in the Chalcolithic time frame. (http://www.nature.com/ejhg/journal/v20/n3/fig_tab/ejhg2011192t3.html#figure-title)

However, R1b-L23 is typical of Yamna, so Armenian R1b-L23 may well have originated north of Caucasus but the situation is still puzzling: Anatolia ChL is very low on EHG and if Armenians arrived from Anatolia during the Calcolithic or Early Bronze Age, they probably were very low on EHG and I doubt that R1b-L23 existed there at that time. If it is true that Armenians arrive in the MLBA and were more EHG, I would argue that they came from the North and not from Anatolia.
ReplyDelete
Replies
ShaikorthJune 26, 2016 at 12:05 PM
By the way, relative to Yoruba Iran N is no more shifted towards Han than European Neolithics are.

Fst to Han / Fst to Yoruba

Anatolia_N 0.821

Europe_EN 0.806

Iran_N 0.804

BedouinA 0.881

Natufian 0.946
Something is causing European, Anatolian and Iranian neolithics to be Han-shifted relative to BedouinA and Natufians or something is causing BedouinA and Natufians to be more African-shifted in comparison to Han and the Neolithics. With BedouinA the reason should be as f3 tests say.
ReplyDelete
Replies
ShaikorthJune 26, 2016 at 2:56 PM
Karitiana are very drifted, yes, and they serve as an example of how much drift alone can elevate absolute fst.

In case of BedouinA f3 mixture tests and so on show SSA. For Natufians it doesn't need to be Yoruba per say, but could be something that has more affinity to Africans than to Eurasians. If Natufians' fst was elevated by something with no affinity to Han and Yoruba relative to European Neolithic and Iran Neolithic (which are very different from each other but have the same relative position re: Han and Yoruba) their relative position should not shift towards Yoruba.
ReplyDelete
Replies
GioielloJune 27, 2016 at 8:49 AM
@ Maju
“Or why not to be ancestral to Volga R1b? After all East European ancient samples (Steppe-gibberish ones) all have strong "Iran-N" component. What is clear is that R1b expanded from West Asia in several directions and that we haven't found it was in Europe before Neolithic. However we haven't found it in West Asia either, not yet, so all options are open”.

Let's speak again after that Tyrrhenian Italy is tested. No one, except me, thought that R1b1a would have been found in Italy (Villabruna 14000 years ago, WHG, very likely the tribe ancestress of all the R1b1-L389+ in Europe, not counting R-V88 oldest in Sardinia/Italy and not found elsewhere).
I am waiting that also R1a-M420* is found near Villabruna.
ReplyDelete
Replies
Olympus MonsJune 27, 2016 at 2:50 PM
@Gioiello,
Isn't it R1b1 too close to originals to mean anything being found in Italy Villabruna? probably R1b1 is close to 18,000 years old, so it can be found anywhere, right. Its not like R1b was hiding under a rock for so many thousands years, upon thousands of years.

Wasn't it the one found in Kura-araxes P25? so a close call to P297 therefore a step away from M269?
ReplyDelete
Replies
Olympus MonsJune 27, 2016 at 3:05 PM
@Kristiina,
Regarding your last paragraph, and the fluctuations of EHG here is my alternate story....

By 7,000 BC a population arrive in southern Caucasus with EHG and mixed with CHG.
By 5000 BC the Ubaid changes admix all of them, giving the levant and EHG component to Iran_C, to Anatolia_C , Armenia_C; etc
By 4000/3500 BC they (south Caucasus) had crossed (already with L23) to north Caucasus and giving the Levant component to Steppe and reducing the EHG in the steppe.

See it’s a matter of time. And also It was south Caucasus moving into steppe (mostly) and not the other way around..

Just how I see it.

ReplyDelete
Replies
RyanJune 27, 2016 at 7:26 PM
"Genetic influence from East Asia or a related population is also apparent in all Northeastern populations but even more so in Iran Neolithic. Why?"

There are sundadont remains in Neolithic Mehrgarh. So perhaps that is the vector.
ReplyDelete
Replies
Samuel AndrewsJune 28, 2016 at 4:12 AM
Anatolia Neolithic can't be modeled well as a mixture of older genomes. It is not a mixture of Natufian, Iran Neolithic, and "WHG". It might have mixture from people similar to those three but it can't be fully explained as a mixture of them.

Instead actually Levant Neolithic looks like a mixture of Natufian and Anatolia Neolithic. Also, Iran_Chalcolithic and Armenia Chacolithic look like they have Anatolia Neolithic ancestry. So Anatolia_Neolithic definitely has ancestry or is fully descended from an unsampled Paleo population that affected all those regions.
ReplyDelete
Replies
GioielloJune 28, 2016 at 11:43 AM
Olympus Mons, what you say could be right in general, but...
1) When I said that, I was refused because for the levantinists it wasn't true: R1b would have come from Middle East, thus not present in Europe before Neolithic, i.e. before the "colonization" of Europe from the Middle Easterner agriculturalists, and, as also Maju said, this line has been demonstrated false.
2) I gave also a political-economical explanation about that, indicating who funded and funds those presumed scientific researches and the known firm at the center of that. For that I have a general theory and not sporadic observations. Of course Maju is looking at that from his left side which isn't mine.
3) But for speaking only from a scientific point of view, one thing is having found an R1b1a (pre-P297) 14000 years ago in Villabruna, in what I am saying from at least ten years was the "Italian Refugium", and another thing is having found an R1b1, perhaps also L389+, in Kura Araxes of pretty 10000 years later, when I have explained from so long that Caucasus has only one cluster of R1b1-L389+, that with YCAII=21-23, whereas Italy has at least three (18-22, 18-23, 23-23), and the subclades have YCAII=19-23, derived friom the Italian/Western European samples and not from the Caucasian ones.
4) The rest is in the more than 10000 letters I wrote.
ReplyDelete
Replies
KristiinaJune 28, 2016 at 7:33 PM
Olympus, I do not have much of a theory here. I am mostly observing.

I am curious to know if your idea is that R1b-L278 (mostly EHG) came from the Steppe to Southern Caucasus 7000 BC and gave rise to R1b-269 and the proto-IE language? Maybe you are not making any presumptions on language, but I would however be interested to know your linguistic suggestions for Ubaid. Personally, I am not convinced that proto-IE originated in southern Caucasus, but the origin of R1b-269 there is not impossible, if we presume that R1b-V88, Villabruna and M73 are separate developments.

However, after Villabruna, as long as we do not have any contrary evidence, an Eastern Mediterranean origin is even more plausible for R1b. Of course, everything depends on what will be discovered in the future. Maybe your idea will be proven right, maybe not.

Among other things, the origin of yDNA J, in particular J2b, and mtDNA H is still puzzling me.
ReplyDelete
Replies
GioielloJune 30, 2016 at 4:30 PM
Genetiker says that the R1b from Kura-Araxes is an R1b1a1b positive for V1956 (Y: 8430895 G>A)
Sample ID HG 8430895
REFSEQ G

Also the sample from 1KGP from Puero Rico is positive for V1956 and all the subclades from R-P297 are neagtive
HG00640 R-L389* A
HG00640 20120522 PUR R-L389*
L388/PF6468 • L389/PF6531 B2b

The sample from Puerto Rico is very likely of the Iberian R-L389 haplotypes and all the other known R-L389 which are most diffused and with the highest variance in Italy, thus the Caucasian sample from Kura-Araxes, as I am saying from so long, belongs very likely to the Caucasian R-L389 with YCAII=23-23 which is presupposed derived through a RecLOH from the basal 18-22 and 18-23 diffused in Italy, which are a sister clade of the R-L389 ancestor of R-P297. But the Villabruna sample of 14000 years ago has two certain mutations at the P297 level, thus it is more likely of the family which brought to the R-P297* and subclades. Anyhow no doubt that my theory of an “Italian Refugium” adds another proof. Let's go on.

ReplyDelete
Replies
AramJuly 29, 2016 at 10:34 AM
Maju

What You know about Y dna of Subartu Hayasa and Urartu? I suppose not much otherwise You wouldn't lump three different things into the same basket.
As for Armenians and Phrygian. You can't cite any recent genetic paper suggesting such a thing. Please read Brittanica and find any mention of Balkans. So please update Your theories about Armenian ethnogenesis.

https://www.britannica.com/topic/Armenian-language
ReplyDelete
Replies
UnknownAugust 21, 2016 at 3:20 AM
I know you hate talking about this subject, but what is the implications of this study on the subject of the Origins of Ashkenazim Jews and modern day Palestinians relative to the original Levant population.
ReplyDelete
Replies
Joshua LipsonAugust 24, 2016 at 1:38 AM
The greater discontinuity might actually be Early Bronze <-> Middle/Late Bronze, rather than Middle/Late Bronze <-> Iron. I think archaeologists usually refrain from calling the population of the Early Bronze Age "Canaanite".

re: your last comment — any particular insights you have into Palestinian regional (or religious) differences, even across poorly-labeled datasets? I've never seen any good treatment of this topic.
ReplyDelete
Replies
LaMar McNeilAugust 27, 2016 at 8:36 AM
So where exactly so Arabians like Saudis and Yemenis fit into all of this? Not to mention North Africans like Egyptians and Berbers (especially Berbers)? Are Arabian descendent of the Levant farmers or something else?
ReplyDelete
Replies
blogmasterNovember 14, 2016 at 4:09 AM
This study labeled as Neolthic Anatolian, is in fact, predominately Iranian. So the study is essentially suggesting, though non-explicitly, it's not just that Iranians moved Northwards into the Steppe, and into India, but also into Europe, through Anatolia. Still I'm a bit confused, another study with remains of single Neolithic woman from Ganj Dareh, in NW Zagros, shows to be quite distinct from Anatolian Neolithic samples from the far West of Turkey, and early farmers of Europe. That was likely the illusion which resulted from using only a single Iranian sample, and excluding Neolithic samples from more Eastern parts of Turkey.
ReplyDelete
Replies
TminusNovember 14, 2016 at 3:56 PM
In the body of your post, with the image titled "This graphic shows pretty well how the ancient populations of West Eurasia are expressed as a mixture of those four founder populations". The Anatolian_N carries a sizable Iranian_N component, in Red. Am I not interpreting that correctly? It would make sense, because High resolution global PCA's do not suggest modern Iranians and Turks are highly distinct populations. In fact, there position is usually adjacent (though Iranians are more heterogeneous), along with southern Caucus populations. I am suggesting that what they refer to as "European/Anatolian farmers", is essentially a mix of Iran_N + Levant_N + WHG (through backflow). Also, the Steppe had a significant Iranic component, and through bronze age waves, contributed to the later peopling of Europe.
ReplyDelete
Replies

Add comment

Please, be reasonably respectful when making comments. I do not tolerate in particular sexism, racism nor homophobia. Personal attacks, manipulation and trolling are also very much unwelcome here.The author reserves the right to delete any abusive comment.

Preliminary comment moderation is... ON (your comment may take some time, maybe days or weeks to appear).

Pages