Email updates

Keep up to date with the latest news and content from Investigative Genetics and BioMed Central.

This article is part of the series Human evolutionary genomics.

Open Access Highly Accessed Research

Modeling the contrasting Neolithic male lineage expansions in Europe and Africa

Michael J Sikora1, Vincenza Colonna12, Yali Xue1 and Chris Tyler-Smith1*

Author Affiliations

1 The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK

2 Institute of Genetics and Biophysics, National Research Council (CNR), 80125, Naples, Italy

For all author emails, please log on.

Investigative Genetics 2013, 4:25  doi:10.1186/2041-2223-4-25


The electronic version of this article is the complete one and can be found online at: http://www.investigativegenetics.com/content/4/1/25


Received:16 August 2013
Accepted:21 October 2013
Published:21 November 2013

© 2013 Sikora et al.; licensee BioMed Central Ltd.

This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Background

Patterns of genetic variation in a population carry information about the prehistory of the population, and for the human Y chromosome an especially informative phylogenetic tree has previously been constructed from fully-sequenced chromosomes. This revealed contrasting bifurcating and starlike phylogenies for the major lineages associated with the Neolithic expansions in sub-Saharan Africa and Western Europe, respectively.

Results

We used coalescent simulations to investigate the range of demographic models most likely to produce the phylogenetic structures observed in Africa and Europe, assessing the starting and ending genetic effective population sizes, duration of the expansion, and time when expansion ended. The best-fitting models in Africa and Europe are very different. In Africa, the expansion took about 12 thousand years, ending very recently; it started from approximately 40 men and numbers expanded approximately 50-fold. In Europe, the expansion was much more rapid, taking only a few generations and occurring as soon as the major R1b lineage entered Europe; it started from just one to three men, whose numbers expanded more than a thousandfold.

Conclusions

Although highly simplified, the demographic model we have used captures key elements of the differences between the male Neolithic expansions in Africa and Europe, and is consistent with archaeological findings.

Keywords:
Human Y chromosome; Neolithic transition; Population expansion; Demographic modeling; Coalescent simulations; Haplogroup; R1b; E1b1a

Background

Around 50 to 70 thousand years ago (approximately 60 KYA), modern humans expanded out of Africa, and by approximately 15 KYA had colonized all inhabitable continents [1]. During most of this period, the climate was both cold and unstable, but after approximately 10 KYA (the beginning of the Holocene period) it warmed and stabilized to produce the climate we know today. Early humans subsisted by hunting and gathering, but in the Holocene additional lifestyles became possible, including agriculture and pastoralism. This ‘Neolithic transition’ occurred independently at different times during the Holocene in different geographical regions. One Neolithic transition began in the Fertile Crescent in the Near East approximately 10 KYA and spread outwards in several directions, including into Europe over the course of several thousand years [2]. In sub-Saharan Africa, a comparable transition began later, approximately 3 KYA in West Africa, and spread south and east, reaching the extreme south only within historical times [3]. This differed from the transition in Europe in a number of respects: for example, there was no change in stone tool technology or use of copper or bronze, but instead a direct transition from the Later Stone Age to iron use, and some archaeologists therefore consider it inappropriate to use the term ‘Neolithic’ , but we retain it here because it is simple and widely understood. Both transitions were associated with large increases in population size.

Genetic evidence has contributed to our understanding of these events. There has been debate about the extent to which the genomes of present-day inhabitants of these areas have been derived from Neolithic farmers or from Paleolithic hunter-gatherers. The first large-scale molecular-genetic analyses in Europe were based on mitochondrial DNA (mtDNA) from present-day Europeans and were interpreted as favoring a Paleolithic entry for the majority of European mtDNAs [4]. More direct tests of this question, however, using ancient DNA (aDNA), have revealed a discontinuity between hunter-gatherer and early farmer mtDNAs, suggesting a Neolithic or later entry for the lineages that are most common today [5-8]. Similarly, low-coverage whole-genome sequencing supported the idea of a southern origin for early farmers from northern Europe [9,10], and thus migration and expansion of incoming Neolithic populations to replace the previous occupants.

The Y chromosome has several properties that make it potentially very informative about historical events, including the Neolithic transition. Its lack of recombination over most of its length means that it provides the most detailed and informative phylogenetic tree for any locus in the genome, while as a consequence of its strict father-to-son transmission it carries information specifically about male events [11]. Y-chromosomal lineages differ substantially between geographical regions and in each of the two areas considered here a single lineage predominates: R1b (especially the sublineage defined by the SNP M269, rs9786153) in Western Europe [12,13] and E1b1a (defined by the SNP known variously as M2, sY81, DYS271 or rs9785941) in sub-Saharan Africa [14]. While these observed geographical distributions are uncontested, and E1b1a has been widely associated with the Neolithic expansion in Africa [15,16], the time depth of R1b in Europe has been disputed, with opinions ranging from a Paleolithic date [13] to a Neolithic one [17]. aDNA has not yet been very informative for the Y chromosome, although the limited data available show no evidence of pre-Neolithic R1b lineages [5]. Full sequences from the Y chromosomes of present-day individuals, however, have recently become available, and these support a Neolithic spread of R1b [18]. In addition, the tree structure resulting from these sequences, based on the unbiased ascertainment of variants, is informative in other ways. There is a striking difference in the structure of the E1b1a and R1b phylogenies: R1b has a starlike structure indicative of an expansion so rapid that few mutations occurred during the expansion, while E1b1a has a more regular bifurcating structure.

In the current study, we accept R1b and E1b1a as lineages that expanded during the Neolithic, and set out to explore, using coalescent simulations, the demographic conditions under which their different phylogenetic structures might be expected to arise. We found that these differ between the two continents, and link our conclusions to the available archaeological evidence.

Methods

Data

The samples consisted of 21 high-coverage Y-chromosomal sequences downloaded from the Complete Genomics website [19], eight from the E1b1a haplogroup and 13 from the R1b haplogroup. Filtering of the data and generation of a phylogenetic tree from them have been described previously [18]. Eight individuals within the R1b haplogroup were from a three-generation pedigree, so in the current work where the simulations assume individuals are unrelated, this pedigree was combined to make a single branch by averaging the number of distinct SNPs in each family member and adding this value to the number of SNPs shared by all of the individuals.

Coalescent simulations

Simulations were performed using MaCS [20], a coalescent simulator, using six and eight haplotypes for the R1b and E1b1a data, respectively, with a sequence length of 8.8 × 106 nucleotides, assuming a generation time of 30 years [21], a mutation rate of 3 × 10-8 per nucleotide per generation [22] and zero recombination. The simulations explored the parameters of a single population expansion using four variables: the starting and final population sizes, the time when the expansion ended, and the length of the expansion. Examples of the command lines used are provided in Additional file 1: Table S2.

Additional file 1: Table S2. Examples of commands for MaCS. An example of a command for each of the R1b and E1b1a simulations is shown, along with the parameter set from which the command was derived.

Format: DOC Size: 29KB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

Since we needed to compare the output from the simulations with the trees from the real data, as described below, we constructed statistics related to ones used previously [23] to compare the output, as follows. The phylogenetic tree from each simulation was normalized to a total branch length of 1.0 and analyzed using three measures: the ratio of singletons to shared SNPs, and the mean and standard deviation of the TMRCA (Time to the Most Recent Common Ancestor) of all the individual haplotypes. The singleton/shared SNP ratio (r) was calculated by summing the terminal branch lengths and dividing by the sum of the internal branch lengths multiplied by one plus the sum of each internal branch length beneath its node:

<a onClick="popup('http://www.investigativegenetics.com/content/4/1/25/mathml/M1','MathML',630,470);return false;" target="_blank" href="http://www.investigativegenetics.com/content/4/1/25/mathml/M1">View MathML</a>

where b is a tree branch of length lb, which has nBEN branches of length lbi beneath its node, nTER is the number of terminal branches and nINT is the number of internal branches.

The other two statistics were calculated by determining the branch length of the TMRCA of each combination of the individual haplotypes and computing the mean and standard deviation. The three statistics thus reflect both the time depth of the tree and how starlike its structure is.

Comparison of data and coalescent simulations

To identify the range of simulation parameter values that best fit the empirical trees, we created heat maps of a summary value of the three statistics, designated the average normalized delta (AND) value. The AND value was computed by dividing the difference of the simulated statistic and the empirical statistic by the empirical statistic and averaging these three distances:

<a onClick="popup('http://www.investigativegenetics.com/content/4/1/25/mathml/M2','MathML',630,470);return false;" target="_blank" href="http://www.investigativegenetics.com/content/4/1/25/mathml/M2">View MathML</a>

where the subscript s indicates a simulated value, o an observed value, r a singleton/shared ratio statistic, m a mean TMRCA statistic and d a standard deviation of a TMRCA statistic.

A low AND value thus indicates a good fit to the empirical data. We completed 1,000 simulations for each demographic scenario and averaged each statistic to use as the simulated value.

The ranges for the parameters on the first set of simulations and corresponding heat map were each chosen to be very wide, including all reasonable estimates for their values (Additional file 2: Table S1). The parameter ranges for the time the expansion ended and the length of the expansion were each extended past the empirical TMRCA for each respective haplogroup. For each successive heat map, a conservative selection of the lowest AND values was noted and the ranges for the following set of simulations chosen to include these, unless their TMRCAs were not compatible with the maximum TMRCA of the haplogroup. Thus we sequentially removed parameter values that resulted in large AND values, progressively narrowing the range until it encompassed only AND values of 0.05 and below. Although these do not provide an absolute measure of how well the model fits the data, they show that among the wide ranges of parameters explored, these are the best fits. Then, a histogram for each parameter was created using the frequency of sub-0.05 AND values, to provide an indication of our conclusions regarding this parameter value.

Additional file 2: Table S1. Starting parameter values for the simulations.

Format: DOC Size: 28KB Download file

This file can be viewed with: Microsoft Word ViewerOpen Data

Results

The phylogenetic trees of the R1b and E1b1a branches of the Y-chromosomal phylogeny show strongly contrasting structures (Figure 1), as previously noted [18]. R1b has a markedly starlike structure (Figure 1a), with only a single variant uniting three of the six chromosomes creating a departure from a perfect star, while E1b1a shows a largely bifurcating structure with greater time depth and just one trifurcation (Figure 1b).

thumbnailFigure 1. Phylogenies based on high-coverage whole-genome sequences. (a) Six R1b and (b) eight E1b1a Y chromosomes. Branch lengths are proportional to the number of SNPs, which are given on each branch, and thus approximately proportional to time.

To explore demographic scenarios that could lead to these different structures, we performed coalescent simulations that included four parameters: starting and ending population sizes, and length and end time of the expansion (Figure 2). We used a strategy of sequential rounds of simulations, starting with a broad range of parameter values, assessing which combinations of these led to the best fit with the observed data, and then repeating the simulations with a narrower range of values centered around those that led to the best fit. These results are presented visually as heat maps illustrating the AND values, which measure the simulation-observed match (Figure 3 and Additional file 3: Figures S1-S14). In these heat maps, the color of the small rectangles indicates the AND value: red is for a good fit, yellow and green are for intermediate fits and blue is for a poor fit, as in the scale on the right of the maps. These small rectangles are assembled into sets with differing values of the starting population size (StartN, bottom) and ending population size (EndN, left) to form a grid of intermediate-sized rectangles separated by grey/white borders. These grids have different times for when the expansion ended (top) and different expansion lengths (right). The best-fitting small rectangles in Figure 3 (AND < 0.05) are marked with black dots. After 9 and 11 rounds of simulations for R1b and E1b1a, respectively, we obtained simulation sets in which a substantial proportion of the parameter combinations showed a good fit between the simulations and the observed data, indicated by an AND value of <0.05. We summarize the distribution of individual parameter values from these well-fitting simulations in Figure 4.

thumbnailFigure 2. Demographic model used in coalescent simulations. A single exponential expansion was modeled, with four variable parameters as shown.

thumbnailFigure 3. Fit between model and observed data. The color of the small rectangles indicates the AND value, which measures the fit between the model and the observed tree. Red: good fit, yellow and green: intermediate fits, blue: poor fit, as indicated by the scale. Each rectangle is based on 1,000 simulations. The best-fitting rectangles (AND < 0.05) are marked with black dots. AND, average normalized delta.

Additional file 3: Figures S1 to S14. Heat maps illustrating the AND values from sequential simulation runs.

Format: PPTX Size: 1.8MB Download fileOpen Data

thumbnailFigure 4. Best-fitting parameter values. Distributions of values for the four parameters from the simulations that fitted the empirical data best (AND < 0.05).

The simulations suggest that very different demographic histories are needed to generate the R1b and E1b1a trees. In Europe, the expansion in size was extreme, from a starting size of just two men (range one to three; numbers are given as the median and 95% interval from the data in Figure 4, rounded appropriately) to an ending size of approximately 9,500 (5,000 to 12,500), while in Africa it was extensive but less extreme, from a starting size of approximately 40 (1 to 80) to an ending size of approximately 2,000 (500 to 5,500). In Europe, the expansion was very rapid, taking only approximately 325 (50 to 600) years and ending approximately 12 (6 to 14) KYA, while in Africa it was considerably less rapid, taking approximately 12 (2 to 24) KY and ending more recently, approximately 2 (0 to 12) KYA. The resulting most favored scenarios are illustrated in Figure 5.

thumbnailFigure 5. Favored demographic models for the European and African Neolithic expansions.

Discussion

The model we have explored, involving a single exponential expansion, is grossly simplified. In addition, we have analyzed within each population a single lineage (R1b or E1b1a) of a single locus (the Y chromosome), and this may not be representative of the population. Nevertheless, there are several reasons to believe that our results should capture features of interest. First, the male history represented by the Y chromosome is of interest whether or not it corresponds to the history of other regions of the genome. Second, the single Y lineages we examined are the most frequent in their respective geographical regions, being found in >75% and >80% of males from many Western European and sub-Saharan African populations, respectively, so form a major constituent of the Y-chromosomal gene pool. Furthermore, the chromosomes sampled within each of the two lineages have diverse geographical origins: the R1b chromosomes come from the CEU (Northwestern Europe [24]), TSI (Italy), PUR and MXL (probably Iberia) populations, while the E1b1a chromosomes come from the YRI (Nigeria), LWK (Kenya) and ASW (probably West Africa) populations. Thus their origins are not confined to any one country or small geographical area, and are likely to be broadly representative of these lineages. Third, the Y phylogenies, based on resequencing approximately 9 Mb of Y-chromosomal DNA, are very robust, especially in this high-coverage dataset where singletons will be called reliably. Consequently the R1b chromosomes in this set, for example, must have radiated in an interval so short that there was only enough time for a single mutation to occur, no matter how complex the migrations, integrations or replacements and other cultural changes going on in the society carrying these chromosomes. Fourth, although only a portion of the parameter space has been explored within the model, and it remains possible (indeed, it is an inevitable feature of this approach), that an undiscovered global optimum with very narrow parameter values may exist, our sequential approach (Additional files 3: Figures S1 to S14) minimizes the chance of this, and we discuss below the good correspondence with other sources of information.

With these caveats, we can consider how the Y-chromosome-based genetic findings fit with other genetic and archaeological evidence. The Neolithic transition in Europe has been studied extensively by archaeologists. It appeared in Greece approximately 9 KYA and reached the extreme west by approximately 4 KYA [1,2]. The demographic model suggests that the R1b expansion most likely ended before this time, at approximately 12 KYA (Figures 4 and 5), which appears inconsistent with a Neolithic expansion of this lineage, although the lower limit does extend to approximately 6 KYA. We interpret the discrepancy, however, as a limitation of the model. We constrained the parameter values so that R1b could not expand before the estimated TMRCA of the sampled R1b chromosomes [18], and the model favored an immediate expansion of the lineage, hence the expansion at approximately 12 KYA. If we had used the more likely 4 to 5 KYA estimate of the R1b TMRCA from the rho statistic [18], the expansion in the current model would have been placed close to this time, well within the Neolithic and, interestingly, also close to the time of establishment of the major European mtDNA haplogroup, H, approximately 6 KYA [7,8]. The rapidity of the R1b expansion and the large increase in population size are most consistent with migration and population replacement, issues debated by archaeologists but favored by the aDNA data [5-9]. The later and more gradual E1b1a expansion in Africa is as expected from the spread of cattle-herders from the north between 2.5 and 8 KYA, followed by the Bantu expansion to the southern tip of the continent beginning approximately 2.5 KYA and ending within the last few hundred years, incorporating the package of Bantu languages, cattle and iron-working [1,3]. The population sizes used by the model are genetic effective population sizes, which, for a population that has expanded recently, are much smaller than the census population size [1].

Studies of this kind can be improved by considering more complex demographic models and larger Y-chromosomal datasets. While it may seem obvious that more complex and thus more realistic models should be preferable, models are only useful if the different scenarios they encompass can be discriminated between using the data available, so the simplest model that captures a relevant aspect of the data may still be the most appropriate one. Thus while future models in this context could incorporate spatial structure and phenomena such as surfing [25], a single rapid expansion should still be permitted. We have modeled only a single Y haplogroup, because in each expansion a single haplogroup predominates. Low-coverage sequencing of larger population samples by the 1000 Genomes Project [26,27] and two recent studies focusing on Africa [28] and Sardinia [29] confirm both the high frequencies of haplogroups R1b and E1b1a in the relevant populations and the structures of the phylogenetic trees associated with them. These projects thus provide much larger datasets, which could be used in future modeling studies, although the low coverage and substantial false negative rates of rare variants would need to be taken into account. With such data, the additional rare Y haplogroups present in the populations could also be considered. Different studies have come to different conclusions about the Y-chromosomal mutation rate [22,28,29]; in the current study, the mutation rate is used simply to scale the results, and a mutation rate about half [29] of that used here [22], for example, would double the times. Finally, we note that such analyses of single lineages, which may have deep coalescences, contrast with the universal sharing of recent genealogical ancestors by all people within the last few thousand years [30].

Conclusions

We have identified demographic scenarios that can lead to the contrasting phylogenies observed for the major Y-chromosomal lineages that expanded during the distinct Neolithic transitions in Europe and Africa. These suggest that in Europe, the R1b lineage experienced an extremely rapid and extensive increase as soon as it entered the continent, expanding more than a thousandfold in a few generations. The expansion in Africa began from a larger population size, took thousands of years and ended only recently. While these conclusions are based on a simplified demographic model, they capture major differences between the continents and fit many aspects of the archaeological findings.

Abbreviations

aDNA: Ancient DNA; AND: Average normalized delta; KYA: Thousand years ago; mtDNA: Mitochondrial DNA; SNP: Single nucleotide polymorphism; TMRCA: Time to the most recent common ancestor.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

CTS and YX conceived the study. MJS, VC, YX and CTS designed the study. MJS performed the simulations. MJS, VC, YX and CTS interpreted the results. MJS and CTS drafted the manuscript. All authors read and approved the final manuscript.

Acknowledgement

This work was supported by the Wellcome Trust (WT098051); MJS was supported by an International Thesis Research Grant from the Schreyer Honors College at The Pennsylvania State University.

References

  1. Jobling M, Hollox E, Hurles M, Kivisild T, Tyler-Smith C: Human Evolutionary Genetics. 2nd edition. Garland Science: Abingdon, UK; 2013. OpenURL

  2. Pinhasi R, Fort J, Ammerman AJ: Tracing the origin and spread of agriculture in Europe.

    PLoS Biol 2005, 3:e410. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  3. Phillipson DW: African Archaeology. 2nd edition. Cambridge, UK: Cambridge University Press; 2002. OpenURL

  4. Richards M, Macaulay V, Hickey E, Vega E, Sykes B, Guida V, Rengo C, Sellitto D, Cruciani F, Kivisild T, Villems R, Thomas M, Rychkov S, Rychkov O, Rychkov Y, Golge M, Dimitrov D, Hill E, Bradley D, Romano V, Cali F, Vona G, Demaine A, Papiha S, Triantaphyllidis C, Stefanescu G, Hatina J, Belledi M, Di Rienzo A, Novelletto A, et al.: Tracing European founder lineages in the Near Eastern mtDNA pool.

    Am J Hum Genet 2000, 67:1251-1276. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  5. Haak W, Balanovsky O, Sanchez JJ, Koshel S, Zaporozhchenko V, Adler CJ, Der Sarkissian CS, Brandt G, Schwarz C, Nicklisch N, Dresely V, Fritsch B, Balanovska E, Villems R, Meller H, Alt KW, Cooper A: Ancient DNA from European early Neolithic farmers reveals their Near Eastern affinities.

    PLoS Biol 2010, 8:e1000536. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  6. Bramanti B, Thomas MG, Haak W, Unterlaender M, Jores P, Tambets K, Antanaitis-Jacobs I, Haidle MN, Jankauskas R, Kind CJ, Lueth F, Terberger T, Hiller J, Matsumura S, Forster P, Burger J: Genetic discontinuity between local hunter-gatherers and central Europe's first farmers.

    Science 2009, 326:137-140. PubMed Abstract | Publisher Full Text OpenURL

  7. Brotherton P, Haak W, Templeton J, Brandt G, Soubrier J, Adler CJ, Richards SM, Sarkissian CD, Ganslmeier R, Friederich S, Dresely V, van Oven M, Kenyon R, Van der Hoek MB, Korlach J, Luong K, Ho SY, Quintana-Murci L, Behar DM, Meller H, Alt KW, Cooper A, The Genographic Consortium: Neolithic mitochondrial haplogroup H genomes and the genetic origins of Europeans.

    Nat Commun 2013, 4:1764. PubMed Abstract | Publisher Full Text OpenURL

  8. Brandt G, Haak W, Adler CJ, Roth C, Szecsenyi-Nagy A, Karimnia S, Moller-Rieker S, Meller H, Ganslmeier R, Friederich S, Dresely V, Nicklisch N, Pickrell JK, Sirocko F, Reich D, Cooper A, Alt KW: Ancient DNA reveals key stages in the formation of Central European mitochondrial genetic diversity.

    Science 2013, 342:257-261. PubMed Abstract | Publisher Full Text OpenURL

  9. Skoglund P, Malmstrom H, Raghavan M, Stora J, Hall P, Willerslev E, Gilbert MT, Gotherstrom A, Jakobsson M: Origins and genetic legacy of Neolithic farmers and hunter-gatherers in Europe.

    Science 2012, 336:466-469. PubMed Abstract | Publisher Full Text OpenURL

  10. Barbujani G: Human genetics: message from the Mesolithic.

    Curr Biol 2012, 22:R631-633. PubMed Abstract | Publisher Full Text OpenURL

  11. Jobling MA, Tyler-Smith C: The human Y chromosome: an evolutionary marker comes of age.

    Nat Rev Genet 2003, 4:598-612. PubMed Abstract | Publisher Full Text OpenURL

  12. Rosser ZH, Zerjal T, Hurles ME, Adojaan M, Alavantic D, Amorim A, Amos W, Armenteros M, Arroyo E, Barbujani G, Beckman G, Beckman L, Bertranpetit J, Bosch E, Bradley DG, Brede G, Cooper G, Corte-Real HB, de Knijff P, Decorte R, Dubrova YE, Evgrafov O, Gilissen A, Glisic S, Golge M, Hill EW, Jeziorowska A, Kalaydjieva L, Kayser M, Kivisild T, et al.: Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language.

    Am J Hum Genet 2000, 67:1526-1543. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  13. Semino O, Passarino G, Oefner PJ, Lin AA, Arbuzova S, Beckman LE, De Benedictis G, Francalacci P, Kouvatsi A, Limborska S, Marcikiae M, Mika A, Mika B, Primorac D, Santachiara-Benerecetti AS, Cavalli-Sforza LL, Underhill PA: The genetic legacy of Paleolithic Homo sapiens sapiens in extant Europeans: a Y chromosome perspective.

    Science 2000, 290:1155-1159. PubMed Abstract | Publisher Full Text OpenURL

  14. Cruciani F, Santolamazza P, Shen P, Macaulay V, Moral P, Olckers A, Modiano D, Holmes S, Destro-Bisol G, Coia V, Wallace DC, Oefner PJ, Torroni A, Cavalli-Sforza LL, Scozzari R, Underhill PA: A back migration from Asia to sub-Saharan Africa is supported by high-resolution analysis of human Y-chromosome haplotypes.

    Am J Hum Genet 2002, 70:1197-1214. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  15. de Filippo C, Bostoen K, Stoneking M, Pakendorf B: Bringing together linguistic and genetic evidence to test the Bantu expansion.

    Proc Biol Sci 2012, 279:3256-3263. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  16. Montano V, Ferri G, Marcari V, Batini C, Anyaele O, Destro-Bisol G, Comas D: The Bantu expansion revisited: a new analysis of Y chromosome variation in Central Western Africa.

    Mol Ecol 2011, 20:2693-2708. PubMed Abstract | Publisher Full Text OpenURL

  17. Balaresque P, Bowden GR, Adams SM, Leung HY, King TE, Rosser ZH, Goodwin J, Moisan JP, Richard C, Millward A, Demaine AG, Barbujani G, Previdere C, Wilson IJ, Tyler-Smith C, Jobling MA: A predominantly Neolithic origin for European paternal lineages.

    PLoS Biol 2010, 8:e1000285. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  18. Wei W, Ayub Q, Chen Y, McCarthy S, Hou Y, Carbone I, Xue Y, Tyler-Smith C: A calibrated human Y-chromosomal phylogeny based on resequencing.

    Genome Res 2013, 23:388-395. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  19. Complete Genomics Accessed October 2011 [http://www.completegenomics.com/public-data/69-Genomes webcite]

  20. Chen GK, Marjoram P, Wall JD: Fast and flexible simulation of DNA sequence data.

    Genome Res 2009, 19:136-142. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  21. Fenner JN: Cross-cultural estimation of the human generation interval for use in genetics-based population divergence studies.

    Am J Phys Anthropol 2005, 128:415-423. PubMed Abstract | Publisher Full Text OpenURL

  22. Xue Y, Wang Q, Long Q, Ng BL, Swerdlow H, Burton J, Skuce C, Taylor R, Abdellah Z, Zhao Y, MacArthur DG, Quail MA, Carter NP, Yang H, Tyler-Smith C: Human Y chromosome base-substitution mutation rate measured by direct sequencing in a deep-rooting pedigree.

    Curr Biol 2009, 19:1453-1457. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  23. Rosenberg NA, Hirsh AE: On the use of star-shaped genealogies in inference of coalescence times.

    Genetics 2003, 164:1677-1682. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  24. He M, Gitschier J, Zerjal T, de Knijff P, Tyler-Smith C, Xue Y: Geographical affinities of the HapMap samples.

    PLoS One 2009, 4:e4684. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  25. Ray N, Excoffier L: Inferring past demography using spatially explicit population genetic models.

    Hum Biol 2009, 81:141-157. PubMed Abstract | Publisher Full Text OpenURL

  26. The 1000 Genomes Project Consortium: A map of human genome variation from population-scale sequencing.

    Nature 2010, 467:1061-1073. PubMed Abstract | Publisher Full Text | PubMed Central Full Text OpenURL

  27. The 1000 Genomes Project Consortium: An integrated map of genetic variation from 1,092 human genomes.

    Nature 2012, 56-65. OpenURL

  28. Poznik GD, Henn BM, Yee MC, Sliwerska E, Euskirchen GM, Lin AA, Snyder M, Quintana-Murci L, Kidd JM, Underhill PA, Bustamante CD: Sequencing Y chromosomes resolves discrepancy in time to common ancestor of males versus females.

    Science 2013, 341:562-565. PubMed Abstract | Publisher Full Text OpenURL

  29. Francalacci P, Morelli L, Angius A, Berutti R, Reinier F, Atzeni R, Pilu R, Busonero F, Maschio A, Zara I, Sanna D, Useli A, Urru MF, Marcelli M, Cusano R, Oppo M, Zoledziewska M, Pitzalis M, Deidda F, Porcu E, Poddie F, Kang HM, Lyons R, Tarrier B, Gresham JB, Li B, Tofanelli S, Alonso S, Dei M, Lai S, et al.: Low-pass DNA sequencing of 1200 Sardinians reconstructs European Y-chromosome phylogeny.

    Science 2013, 341:565-569. PubMed Abstract | Publisher Full Text OpenURL

  30. Rohde DL, Olson S, Chang JT: Modelling the recent common ancestry of all living humans.

    Nature 2004, 431:562-566. PubMed Abstract | Publisher Full Text OpenURL