- Research article
- Open Access
Y-chromosomal diversity in the population of Guinea-Bissau: a multiethnic perspective
BMC Evolutionary Biologyvolume 7, Article number: 124 (2007)
The geographic and ethnolinguistic differentiation of many African Y-chromosomal lineages provides an opportunity to evaluate human migration episodes and admixture processes, in a pan-continental context. The analysis of the paternal genetic structure of Equatorial West Africans carried out to date leaves their origins and relationships unclear, and raises questions about the existence of major demographic phenomena analogous to the large-scale Bantu expansions. To address this, we have analysed the variation of 31 binary and 11 microsatellite markers on the non-recombining portion of the Y chromosome in Guinea-Bissau samples of diverse ethnic affiliations, some not studied before.
The Guinea-Bissau Y chromosome pool is characterized by low haplogroup diversity (D = 0.470, sd 0.033), with the predominant haplogroup E3a*-M2 shared among the ethnic clusters and reaching a maximum of 82.2% in the Mandenka people. The Felupe-Djola and Papel groups exhibit the highest diversity of lineages and harbor the deep-rooting haplogroups A-M91, E2-M75 and E3*-PN2, typical of Sahel's more central and eastern areas. Their genetic distinction from other groups is statistically significant (P = 0.01) though not attributable to linguistic, geographic or religious criteria. Non sub-Saharan influences were associated with the presence of haplogroup R1b-P25 and particular lineages of E3b1-M78.
The predominance and high diversity of haplogroup E3a*-M2 suggests a demographic expansion in the equatorial western fringe, possibly supported by a local agricultural center. The paternal pool of the Mandenka and Balanta displays evidence of a particularly marked population growth among the Guineans, possibly reflecting the demographic effects of the agriculturalist lifestyle and their putative relationship to the people that introduced early cultivation practices into West Africa. The paternal background of the Felupe-Djola and Papel ethnic groups suggests a better conserved ancestral pool deriving from East Africa, from where they have supposedly migrated in recent times. Despite the overall homogeneity in a multiethnic sample, which contrasts with their social structure, minor clusters suggest the imprints of multiple peoples at different timescales: traces of ancestral inhabitants in haplogroups A-M91 and B-M60, today typical of hunter-gatherers; North African influence in E3b1-M78 Y chromosomes, probably due to trans-Saharan contacts; and R1b-P25 lineages reflecting European admixture via the North Atlantic slave trade.
Many genetic studies of sub-Saharan Y chromosome variation have paid special attention to the large-scale Bantu expansions, and the particular pool of the "relic" Central African Pygmies and the South African Khoisan [1–7], while little is known about the events that have shaped the paternal structure of Equatorial West Africans. Although anthropological evidence is scarce, the earliest traces of West Atlantic occupation by modern humans dates back 40 ky [8, 9]. Later climatic changes, when around 9 kya the Sahara was at its wettest , created conditions for both the massive displacement of people and the spread of agriculture, reaching previously uninhabited areas and promoting admixture with isolated populations [11–14]. Although the farming practices in Sahel could have started earlier than 6 kya [15, 16], firm archaeological evidence points to the domestication of local sorghum, millet and yams ~4 kya . Together with the introduction of iron-smelting techniques ~2.7 kya, agriculture led ultimately to the large-scale Bantu migrations from the Gulf of Guinea to the south of the continent . From the perspective of Y chromosome genetic variation, such movements are believed to have erased much of the pre-existing diversity, replacing it with the now dominant haplogroup E3a-M2 lineages [4, 19, 20].
The inhabitants of the Guinea-Bissau area have certainly been under the influence of several demographic events since prehistorical times, as a result of migratory movements, trade networks and consecutive invasions. The first recorded influx of ethnically defined groups is the arrival of Fulbe people in the 8th century AD, from a Central African epicenter . First contact with the North African Berbers dates back to at least the 9th century, and was repeated in the 11th century when, pushed by the Omníades, these people came to occupy the vicinity of Senegal . The economic shift in the Sahel allowed more centralized states to form (namely the "Black Kingdoms" in the period between the 8th and 16th centuries, ), linked by a trading corridor reaching from Mauritania to Niger . In the following centuries pastoral Fulbe arrived again slowly but en masse, together with the Mandenka, and became the most prevalent people in Guinea-Bissau territory. Oral tradition also states that the Djola people – Felupe-Djola, Baiote and possibly Beafada – came from Sudan in the 15th-16th centuries . As for the Balanta, Sudanese or Bantu affinities may argue for their cultural and phenotypic aspects. Though research on the background of the Nalú is less advanced, Teixeira da Mota  considers them to be the autochthonous people of the region. The same author identifies Bijagós as a separated branch of Djola or relatives of Papel and Nalú. The main ethnic groups now present in Guinea-Bissau (Figure 1; see Additional file 1) were already settled in the region in the 15th century, at the time of arrival of the Portuguese. With the establishment of the Atlantic slave trade the region experienced an input of Europeans, in their vast majority males, whose genetic imprint is undetermined. Many of the ethnic barriers were brought down, in particular the endogamic practices, promoting an intense cultural contact and higher levels of admixture between groups than before.
The present study intends to characterize the paternal genetic pool of Guineans, focusing on their ethnic affiliation, by the use of binary markers and microsatellites on the non-recombining region of the Y chromosome (NRY). Our sample (n = 282) extends significantly the Y-chromosomal coverage of West African populations (Senegal , Gambia/Senegal Wolof and Mandenka , Mali  and Dogon , Burkina-Faso [1, 26], Ghana Ewe, Ga and Fante ) both in size and number of surveyed ethnic groups. The unique features of the Y chromosome system, namely its haploid and non-recombining nature and paternal inheritance, provide an opportunity to evaluate the temporal and spatial aspects of population movements, in the light of the available non-genetic evidence.
Results and discussion
Y chromosome haplogroup variation
The fairly homogeneous paternal structure of Guinea-Bissau (D = 0.470, sd 0.033), is not surprising given the general landscape of sub-Saharan low Y-chromosomal haplogroup diversity [2, 27] and its reported east-to-west decline along a Central African corridor . Responsibility for the low diversity is attributed to the highly frequent E3a*-M2 and E1*-M33 lineages (72.0% and 15.6%, respectively) that are shared among all ethnic clusters (Figure 2). In our dataset the Mandenka harbor the highest frequency (82.2%) of the E3a*-M2 paragroup, fitting the context of its closest neighbors (~80% in Senegalese  and Gambia/Senegal Mandinka ). The lack of diversity of West African Y chromosomes together with the predominance of E3a*-M2 lineages (assuming a frequency peak only equivalent to that in Central Africa; Figure 3) reinforces its link to agricultural expansion [3, 4, 19, 20] and hint at the existence of a large local center of cultivation [14–16, 18, 29]. We hypothesized that the newly adopted lifestyle created conditions for major demographic growth, obscuring earlier patterns of lineages. Alternatively, a moderate farming expansion may have occurred on a background of reduced diversity, following the 5.5 kya savanna retreat  or the malarial epidemic episodes which were an outcome of pastoral habits .
The lifestyle transition in West Africa was most likely promoted by people other than the Bantu, as no relevant westwards migrations of these people are reported and none or few Bantu languages are found in the area today. In fact, the West African center may date earlier than that documented for Central Africa and may have acted as a western source of knowledge [14–16]. Based on the high frequency and microsatellite diversity of E3a*-M2 in the Mandenka and Balanta (Figure 2; see Additional file 2), we suggest that these people may have experienced a particular benefit from food production. If so, this might associate their ancestors with the people who implemented the farming habits in the Guinea-Bissau area. The Mandenka are physically and culturally descendants of the Mande, protagonists of agricultural population expansions in Niger/Mali/Burkina-Faso region  and rulers of the West African Black Empires, based on trade and agriculture. For the Balanta, the cultural and physical affinities with Bantu suggest a common origin at the end of the Pleistocene , so it may be that different peoples jointly learnt the agricultural techniques. The E3a7-M191 lineages of one Fulbe and two Mandenka individuals of Guinea-Bissau are undoubtedly representatives of a Central African lineage that followed a trajectory to the west [2, 3, 5, 32].
Haplogroup E1*-M33, of probable local radiation (5–7% in Senegal and Burkina-Faso [2, 3, 5, 7], 40.4% in Mali and 52.9% in Fulbe of Cameroon [1, 26]), is surprisingly frequent in Felupe-Djola and Papel (34.0% and 20.3%). Both ethnic groups exhibit the highest haplogroup diversity (0.5 < D < 0.6) and the deepest-rooting phylogenetic types in our dataset – haplogroups A-M91, E2-M75 and E3*-PN2 – some with occasional occurrences in Fulbe and Balanta (Figure 2). These minor imprints may represent movements from Sahel's more central and eastern parts, seen, for example, in the typically Ethiopian/Sudanese E3*-PN2 lineages that have reached Senegambia [2, 3, 5]. The Djola's oral tradition claims an arrival from Sudan in the 15th-16th centuries which is supported by their carrying the lowest fraction of E3a* in our dataset (58.0%). At the same time, the relatively short time of residence and/or the genetic isolation on cultural grounds has not contributed to a greater homogeneity among the peoples. The Papel, curiously also affiliated to the Bak-speakers, may either represent a legacy left by earlier inhabitants of the Guinean delta, survivors of an ancient pool through demographic reductions and expansions, or later arrivers who have preserved a more discrete genetic identity.
Of greater prevalence in the East quadrant of Africa and among South African Khoisan (~12% and 15%, respectively; [2, 5]) the paragroup E3b*-M35 is common to Felupe-Djola and Papel (~2%) but is also found among Fulbe and Mandenka (~4%). Its presence at ~2% in Guinea-Bissau and ~5% in Senegal may also indicate loose relationships to the North, where it is widespread at rather low frequencies (2–4%, [1, 26, 33–35]. A similar scenario of Eastern prevalence and North African spread traces the African distribution of E3b1-M78 (~26% in Sudan and Ethiopia and 19% in NW-African Arabs), not to mention the ~7% in the Near Eastern and European people [1, 5, 26, 33–35]. In Guinea-Bissau this haplogroup attains the highest frequency so far reported for West Africa (~4%).
The remainder of binary marker variation falls into haplogroups A, B and R, each detected at marginal frequencies (0.4–3.9%). Clades A-M91 and B-M60, the most divergent of the haplogroups of the Y chromosome tree, are associated with the earliest modern human diversification and are putative markers of the first pan-African dispersals of hunter-gatherers [2, 3, 7, 20, 36]. However, the Guinea-Bissau A-M91 lineages do not belong to the widespread A3-M32 but to the A1-M31 subcluster, with reported marginal presence in Mali (2.0% [2, 7]), Gambia/Senegal Mandinka (5.1% ) and North African Berbers (3.1% [1, 33–35]). Any association of Balanta to the Sudanese-speakers is traceable only in the A3b2-M13 and E3* Y chromosomes. The B-M60 variant observed in almost all sub-Saharan collections  was only found in Nalú. One other Nalú individual belongs to the rare and deep-rooting DE* paragroup described in five Nigerians  and thus representing a coalescent "missing link", paraphyletic to haplogroups D and E. The two Western European R1b-P25 lineages in Fulbe and Bijagós are best explained by recent European influence, at the time of the slave trade. A partial introduction through North African pastoral immigrants can not be rejected, where the 3–12% of R1b-P25 are due to the geographic proximity and the long reported contacts with Europe and Middle-East . The European source seems nevertheless more likely: firstly, Y chromosome signatures of European presence have a reported great expression in the nearby Cape Verdians  and secondly, highly frequent North African haplogroups that would have been equally carried by the migrants (e.g. E3b2-M81) are absent in Guineans. The M173 and P25 derived states in both our samples rule out a relationship to the R1*-M173 lineage previously found in Cameroon, Oman, Egypt and Rwanda, and adduced to support the "Back-to-Africa" theory [3, 28].
Pairwise FST analysis of haplogroup frequencies reveals the Felupe-Djola as the only group statistically significantly different from others, namely Bijagós (FST = 0.095, P = 0.027), Fulbe (FST = 0.081, P = 0.004) and Mandenka (FST = 0.107, P = 0.004). The exact test of population differentiation reveals similar information, further distinguishing Papel from Bijagós (P = 0.01), Fulbe (P = 0.003) and Mandenka (P = 0.04). These results are in agreement with principal components analysis (PCA; see below) and the interpretation of the greater distinctiveness of the paternal pool of Felupe-Djola and Papel among other Guineans.
PCA and AMOVA analysis
A PCA of Guinean and other African populations Y chromosome haplogroup frequencies is depicted in Figure 4a [see Additional files 3 and 4 for population details]. The 1st PC clearly separates the Afro-Asiatic speakers from other linguistic families, independently of their geographic location. Consistent with geographical grouping, North and West Africans cluster in independent and tighter groups. The coordinates of North Africans are attributable to haplogroups E3b2-M81 and J-12f2 while West Africans' Y chromosomes cluster largely due to E3a*-M2, and E1-M33 to a lesser extent. Central and South African people are more dispersed in the plot, many lying closer to the Eastern populations (due to the presence of R-M207, A3-M32 and B2-M182 lineages) while others lie closer to the Western cluster. A linguistic correlation is hypothesized to underlie the genetic proximity of Bantu-speakers occupying different quadrants of the continent, driven by the E3a7-M191. Guinea-Bissau groups are included in the western cluster of populations, in close vicinity to Gambia/Senegal Wolof and Mandinka  and Senegalese  with which they share numerous population groups. It is noteworthy that the Guinea-Bissau Fulbe show a distinct pool from other Fulbe people, namely the ones in Burkina-Faso and Cameroon, and are integrated within the Guinea-Bissau variation. A PCA of Guinea-Bissau ethnic groups is illustrated in Figure 4b, less biased by the major influence of haplogroup E3a*-M2 and where the influence of minor clusters is emphasized. The Felupe-Djola and Papel have distinctive positions, largely a result of the high frequency of haplogroup E1-M33. The Bijagós, inhabitants of the archipelago, are placed apart in closer relation to the mainland Fulbe. The position of Mandenka is clearly defined by its E3a*-M2 composition.
The AMOVA yielded no statistically significant results for ethnic group distinction on any of the defined criteria, with ~97% of the variance occurring at the within-population level (P < 0.05; see Additional file 5). These results suggest that in spite of obvious sociocultural differences among groups, marked by the supposedly strict admixture barriers, their Y chromosome gene pool remains largely shared, because of common origin or common history of genetic admixture without language shift.
Microsatellite haplotypes within haplogroups
Y-chromosomal microsatellites provide further haplotype resolution, and are of particular use when, as in this case, some haplogroups are very prevalent. The E3a*-M2 microsatellite profiles of Mandenka and Balanta are the most diverse among our data (RST average gene diversity, see Additional file 2) and attest to an earlier origin or more pronounced expansion. Since the corresponding parameter in Fulbe is less diverse we consider this to signal either a genetic bottleneck or their more recent expansion and late arrival in the West. The data are consistent with the less diverse E3a-M2 profile in Central and South Africans (data not shown). Haplotypes within E3b1-M78 are supposed to represent distinct clusters of local genetic drift . The rare DYS439 allele 10 of a so-called E3b1-β cluster particularly widespread among Moroccan Arabs defines a contribution to the Guinean Fulbe and Bijagós from North West Africans who have crossed the Sahara. The hypothesis of much later European contribution is valid though the remaining variability is absent (except for two R1b-P25 chromosomes) and none of the Guinean haplotypes carry the A7.1 allele with size 9, characteristic of Europe . Microsatellite networks for paragroup E3a*-M2 and haplogroup E3b1-M78 are not informative due to multiple reticulations and the absence of a clear haplotype sub-structure particularly associated to ethnic groups [see Additional file 6]. Further refinement awaits the finding of new markers especially within paragroup E3a*-M2. The microsatellite profile of the DE* individual is one mutational step away from the allelic state described for Nigerians (DYS390*21, DYS388 not tested; , therefore suggesting a common ancestry but not elucidating the phylogenetics.
The Fulbe E3a*-M2 extended haplotypes find exact matches in Equatorial Guinea, Mozambique, Angola and Xhosa (H61, H48, H69 [40, 41]; see Additional file 7) supporting their broad distribution. The Mandenka share E3a*-M2 variants with all other groups in Guinea-Bissau and do not match types outside Central-West Africa (except H67 in Mozambique), a sign of localized expansion and increased influence over their ethnic neighbors. The Felupe-Djola, Balanta and Papel each share one microsatellite haplotype (H49, H46 and H127, respectively) with Mozambique and Angola. Several E3a* eight-loci profiles matched Europeans (H29, H38, H44, H30, H152, H153 and H55), most likely descendants of incoming slaves. Three Fulbe E3b1-M78 haplotypes (H155 and H156) were found to match Spanish haplotypes  and samples in Central Portugal, Macedonia, Romania and Poland (YHRD database ). Both profiles present the A7.1 allele 12, quite frequent in Equatorial Guinea . The R1b-P25 H165 has a 10-loci haplotype found in 68 worldwide populations, of which 53 are European (nine matches in Portugal, YHRD). The picture for H166-R1b is quite different since on a 7-loci basis it matches four Europeans and two individuals from the Reunion Islands, known to have a European-permeable culture.
MtDNA haplogroup variation
Comparisons between mtDNA and Y-chromosomal diversity are hindered because of the very different mutational properties of their SNPs and Y microsatellites, and because of SNP ascertainment bias on the Y chromosome. Therefore, caution is needed when interpreting the results.
The maternal inheritance of Guineans is markedly West African, with haplogroups coalescing at distinct timeframes, from the initial occupation of the area to the later inputs of people . Of relevance for comparison with the paternal counterpart are the signatures of recent expansion in haplogroups frequent in Senegambia, namely haplogroups L2a-L2c, the latter displaying an almost starlike phylogeny and being particularly frequent in the Mandenka (; Tajima's D and Fu's Fs, our unpublished data). An intriguing increased frequency of L0a1 in the Balanta might parallel A1-M31 and A3b2-M13 Y chromosomes in representing East African traces. Although the founder L0a1 haplotype is shared in an east-to-west corridor, the emerging lineages are exclusive of Guineans, indicating a rapid spread and local expansion after arrival. These may therefore reflect the arrival of their ancestors in the Holocene (at about 7 kya, ). Moreover, the exact matches found between Balanta and North Africans in haplogroups L2a, L2b and L3b may represent evidence for their contact and long residence in the territory. L3e4 lineages, thought to signal the western expansion of food-production and iron-smelting, show a moderate frequency of 8% in the Balanta. The absence of mtDNA Bantu-markers [46–49] suggests either that Bantu people contributed very little to the maternal gene pool of Guineans, or that they had a different pool from that associated with the southwards migrations .
The widespread L3e2b is mainly a Felupe-Djola and Papel cluster with probable links to their homeland mirrored in exact matches with East and Central African haplotypes. Lineages within L3h, coalescing at the late Pleistocene/early Holocene in Guineans , exhibit one of the highest found frequencies among the Felupe-Djola (8%). Their increased frequency of West African mtDNA haplogroups L2b and L3d and Y chromosome E1*-M33 could be due to amplification in small founder groups, as these are absent in East Africa.
The mtDNA haplotypes in Guinean Fulbe exhibit a wide range of matches supported by their wide distribution and massive movements in recent history (e.g. ). The high frequency of L1b is otherwise a constant in the Fulbe "world" . Conversely to what is seen on the paternal side, this is the only group that retains statistically significant differences in mtDNA lineages from its ethnic neighbors. As for the Y chromosome, the mtDNA pool of Bijagós shows higher affinity to that of Fulbe, making less likely any connections to the Djola, Papel or Nalú .
The North African mtDNA haplogroups demonstrates partial diffusion to Sahel, namely U6 found in Fulbe and Mandenka and M1b present in Guinea-Bissau Atlantic Bak-speakers ([51, 52]; previously referred to as M1 in ). The U5b1b lineages in Fulbe and Papel are representatives of a link between the Scandinavian Saami and the North African Berbers, emphasizing the great importance of post-glacial expansions . These lineages have most likely crossed the strait of Gibraltar and developed into local clusters, one of which is in West Africa. They do not seem to result from recent gene flow given that the North African Euroasiatic haplogroups H, J and T are absent in our sample.
The analysis of our data provides further evidence for the homogeneity of the Y chromosome gene pool of sub-Saharan West Africans, due to the high frequency of haplogroup E3a-M2. Its frequency and diversity in West Africa are among the highest found, suggesting an early local origin and expansion in the last 20–30 ky. Hypothesizing on the existence of an important local agricultural centre, this could have supported a demographic expansion, on an E3a-M2 background, that almost erased the pre-existing Y chromosome diversity. Its pattern of diversity within Mandenka and Balanta hints at a more marked populational growth, these people possibly related to the local diffusion of agricultural expertise. The Papel and Felupe-Djola people retain traces of their East African relatives, to which the short timescale of residence in Guinea-Bissau and higher isolation from major influences have contributed. In the near absence of archaeological data, the signatures of North, Central and East Africans, traceable in less frequent extant paternal haplogroups, fit well with the linguistic and historical evidence regarding the origin and admixture processes of particular ethnic groups. Minor influences of North and East Africa, in particular, are corroborated by mtDNA data.
A total of 282 Guinea-Bissau unrelated healthy males were analyzed in this survey for the Y chromosome biallelic markers. The sample constitutes a subset of that typed for mtDNA  and therefore follows similar selection criteria and DNA extraction procedures. The present data are published as a Cape Verde source population  but here samples are described by ethnic affiliation. In the aforementioned article the authors were alerted to slight inconsistencies in Figure 2, which are corrected in the present work (see Figure 2), such as the missing 44 haplogroup E1*-M33 individuals. Note that discrepancies were not due to sample mistyping but to typographical errors in the original table.
In order to have a manageable number of units with reasonable sample size, many of the Guinean ethnic groups were clustered: Felupe-Djola includes the homonymous group, Baiote, Cassanga and Beafada; Papel includes Papel, Manjaco and Mancanha; Fulbe clusters Fulbe, Futa-Fulbe, Fulbe-Preto and Fulbe-Forro; Mandenka joins Mandenka, Mansonca, and Sussu; Balanta, Bijagós and Nalú were considered independently. The clustering is not without controversy, but follows pertinent information related either to history, anthropology or linguistics [24, 54–59].
Typing of Y chromosome Binary and Microsatellite Polymorphisms
The hierarchical selection of the following 31 Y chromosome binary markers according to the Y Chromosome Consortium phylogeny [60, 61] allowed the inclusion of each Y chromosome into specific haplogroups: YAP , 92R7 , SRY4064, SRY10831 , P25 , PN2 , M2, M9, M10, M13, M14, M31, M32, M33, M35, M44, M60, M75, M78, M81, M89, M91, M116, M123, M130, M155, M168, M173, M174 and M191 [2, 20]. The typing details of restriction fragment length polymorphisms (RFLPs) and direct sequencing analysis are available from the authors. The Wisconsin Package Version 10.0  was employed to align DNA sequences. The nomenclature and phylogenetic relationship of lineages followed the guidelines proposed by the YCC , referred to in the text by the (sub)haplogroup and the terminal mutation.
The microsatellite variation, previously determined for a subset of 215 individuals  and newly typed for five samples, was associated to the haplogroups. Typing methodology of microsatellites DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS385, DYS437, DYS438 and DYS439 is published elsewhere . An additional GATA microsatellite A7.1 (DYS460 ) was tested for E3b1-M35 chromosomes.
A graphical representation of the haplogroup phylogeny and distribution among ethnic clusters was built in netViz 6.5 . Arlequin program ver. 2.000  was used for the summary statistics on both haplogroup and microsatellite haplotype frequencies for each population unit: diversity indexes ; FST and RST calculation; exact test of population differentiation (DYS385 omitted from the analysis) ; AMOVA tests  with hierarchical clustering of the ethnic groups on geographical, linguistic and religious criteria. PCAs were performed with the software MSVP Version 3.13 m  for haplogroup frequencies of our data and a wide selection of African populations (units as in Figure 4; see Additional files 3 and 4), to generate a more complete picture of the African Y-haplogroup variation and the phylogeographic relationships.
Haplotype networks of microsatellite data were drawn using the Network 188.8.131.52 program . Information on seven microsatellites (DYS19, DYS389I, DYS389II, DYS390, DYS391, DYS392 and DYS393) was sequentially submitted to reduced-median and median joining algorithms [76, 77]. Singletons were excluded from the analysis and the threshold level of 2 was set, with weighted STR loci . The YHRD database and published sources were consulted for exact matches of eight and ten microsatellites (minimal and extended haplotypes, respectively).
(kilo) thousand years ago
Principal Components Analysis
Analysis of Molecular Variance.
Scozzari R, Cruciani F, Santolamazza P, Malaspina P, Torroni A, Sellitto D, Arredi B, Destro-Bisol G, De Stefano G, Rickards O, Martinez-Labarga C, Modiano D, Biondi G, Moral P, Olckers A, Wallace DC, Novelletto A: Combined use of biallelic and microsatellite Y-chromosome polymorphisms to infer affinities among African populations. Am J Hum Genet. 1999, 65: 829-846. 10.1086/302538.
Underhill PA, Shen P, Lin AA, Jin L, Passarino G, Yang WH, Kauffman E, Bonne-tamir B, Bertranpetit J, Francalacci P, Ibrahim M, Jenkins T, Kidd JR, Mehdi SQ, Seielstad MT, Wells RS, Piazza A, Davis RW, Feldman MW, Cavalli-Sforza LL, Oefner PJ: Y chromosome sequence variation and the history of human populations. Nat Genet. 2000, 26: 358-361. 10.1038/81685.
Cruciani F, Santolamazza P, Shen P, Macaulay V, Moral P, Olckers A, Modiano D, Holmes S, Destro-Bisol G, Coia V, Wallace DC, Oefner PJ, Torroni A, Cavalli-Sforza LL, Scozzari R, Underhill PA: A back migration from Asia to sub-Saharan Africa is supported by high-resolution analysis of human Y-chromosome haplotypes. Am J Hum Genet. 2002, 70: 1197-1214. 10.1086/340257.
Pereira L, Gusmao L, Alves C, Amorim A, Prata MJ: Bantu and European Y-lineages in sub-Saharan Africa. Ann Hum Genet. 2002, 66: 369-378. 10.1046/j.1469-1809.2002.00130.x.
Semino O, Santachiara-Benerecetti AS, Falaschi F, Cavalli-Sforza LL, Underhill PA: Ethiopians and Khoisan share the deepest clades of the human Y-chromosome phylogeny. Am J Hum Genet. 2002, 70: 265-268. 10.1086/338306.
Knight A, Underhill PA, Mortensen HM, Zhivotovsky LA, Lin AA, Henn BM, Louis D, Ruhlen M, Mountain JL: African Y chromosome and mtDNA divergence provides insight into the history of click languages. Curr Biol. 2003, 13: 464-473. 10.1016/S0960-9822(03)00130-1.
Wood ET, Stover DA, Ehret C, Destro-Bisol G, Spedini G, McLeod H, Louie L, Bamshad M, Strassmann BI, Soodyall H, Hammer MF: Contrasting patterns of Y chromosome and mtDNA variation in Africa: evidence for sex-biased demographic processes. Eur J Hum Genet. 2005, 13: 867-876. 10.1038/sj.ejhg.5201408.
Alimen H: Evolution du climat et des civilisations depuis 40000 ans du nord au sud du Sahara occidental, Premières conceptions confrontées aux données récentes. Bull L'Assoc Franç L'Êtude Quarternaire. 1987, 4: 215-227.
Mercader J, Martí R: The Middle Stone Age occupation of Atlantic central Africa: new evidence from Equatorial Guinea and Cameroon. Under the Canopy. Edited by: Mercader J. 2003, New Brunswick, Rutgers University Press, 93-118.
Aumassip G, Ferhat N, Heddouche A, Vernet R: Le milieu saharien aux temps préhistoriques. Milieux, hommes et techniques du Sahara préhistorique. Problèmes actuels. Edited by: Aumassip G and a . 1994, Paris, L'Harmattan, 9-29.
Camps G: Les Civilisations Préhistorique de l'Áfrique du Nord et du Sahara. 1974, Paris, Doin
Hassan FA: Archaeological Explorations of the Siwa Oasis Region, Egypt. Curr Anthropol. 1978, 19: 146-148. 10.1086/202020.
Dutour O, Vernet R, Aumassip G: Le peuplement préhistorique du Sahara. Milieux, hommes et techniques du Sahara préhistorique. Problèmes actuels. Edited by: Aumassip G and al . 1988, Paris, L'Harmattan, 39-52.
Clark JD: Africa: From the appearance of Homo sapiens sapiens to the beginnings of food production. Volume I - Prehistory and the Beginnings of Civilization. Edited by: De Laet SJ, co-edited by Dani AH, Lorenzo JL and Nunoo RB. 1994, New York, Routledge, 191-206.
Atherton JH: Excavations at Kamabai and Yagala rock shelters, Sierra Leone. West Afr J Archaeol. 1972, 2: 39-74.
Calvocoressi D, David N: A new survey of radiocarbon and thermoluminescence dates for West Africa. J Afr Hist. 1979, 20: 1-29.
Stahl AB: Reinvestigation of Kintampo 6 rock shelter, Ghana: Implications for the nature of culture change. Afr Archaeol Rev. 1985, 3: 117-150. 10.1007/BF01117457.
Cavalli-Sforza LL, Menozzi P, Piazza A: The History and Geography of Human Genes. 1994, Princeton, Princeton University Press
Passarino G, Semino O, Quintana-Murci L, Excoffier L, Hammer MF, Santachiara-Benerecetti AS: Different genetic components in the Ethiopian population, identified by mtDNA and Y-chromosome polymorphisms. Am J Hum Genet. 1998, 62: 420-434. 10.1086/301702.
Underhill PA, Passarino G, Lin AA, Shen P, Mirazon Lahr M, Foley R, Oefner PJ, Cavalli-Sforza LL: The phylogeography of Y chromosome binary haplotypes and the origins of modern human populations. Ann Hum Genet. 2001, 65: 43-62. 10.1046/j.1469-1809.2001.6510043.x.
Carreira A, Meireles M: Notas sobre os movimentos migratórios da população natural da Guiné-Portuguesa. Bol Cult Guiné Port. 1959, XIV: 7-20.
Moreira JM: Os Fulas da Guiné Portuguesa na panorâmica geral do mundo Fula: os Fulas segundo os nossos cronistas. Bol Cult Guiné Port. 1964, XIX: 417-432.
Fage J: A history of Africa. 1995, London, Routledge
Quintino F: Os povos da Guiné: estrutura social. Bol Cult Guiné Port. 1969, XXIV: 861-915.
Teixeira da Mota A: Guiné Portuguesa. 1954, Lisboa, Agência Geral do Ultramar
Scozzari R, Cruciani F, Santolamazza P, Sellitto D, Cole DE, Rubin LA, Labuda D, Marini E, Succa V, Vona G, Torroni A: mtDNA and Y chromosome-specific polymorphisms in modern Ojibwa: implications about the origin of their gene pool. Am J Hum Genet. 1997, 60: 241-244.
Shen P, Wang F, Underhill PA, Franco C, Yang WH, Roxas A, Sung R, Lin AA, Hyman RW, Vollrath D, Davis RW, Cavalli-Sforza LL, Oefner PJ: Population genetic implications from sequence variation in four Y chromosome genes. Proc Natl Acad Sci U S A. 2000, 97: 7354-7359. 10.1073/pnas.97.13.7354.
Luis JR, Rowold DJ, Regueiro M, Caeiro B, Cinnioglu C, Roseman C, Underhill PA, Cavalli-Sforza LL, Herrera RJ: The Levant versus the Horn of Africa: evidence for bidirectional corridors of human migrations. Am J Hum Genet. 2004, 74: 532-544. 10.1086/382286.
Jobling MA, Hurles ME, Tyler-Smith C: Human Evolutionary Genetics - Origins, Peoples & Disease. 2004, New York, Garland Publishing
Adams JM, Faure H: Review and Atlas of Palaeovegetation: Preliminary land ecosystem maps of the world since the Last Glacial Maximum. 1997, [http://www.esd.ornl.gov/ern/qen/adams1.html]
Kwiatkowski DP: How malaria has affected the human genome and what human genetics can teach us about malaria. Am J Hum Genet. 2005, 77: 171-192. 10.1086/432519.
Zhivotovsky LA, Underhill PA, Cinnioglu C, Kayser M, Morar B, Kivisild T, Scozzari R, Cruciani F, Destro-Bisol G, Spedini G, Chambers GK, Herrera RJ, Yong KK, Gresham D, Tournev I, Feldman MW, Kalaydjieva L: The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time. Am J Hum Genet. 2004, 74: 50-61. 10.1086/380911.
Bosch E, Calafell F, Comas D, Oefner PJ, Underhill PA, Bertranpetit J: High-resolution analysis of human Y-chromosome variation shows a sharp discontinuity and limited gene flow between northwestern Africa and the Iberian Peninsula. Am J Hum Genet. 2001, 68: 1019-1029. 10.1086/319521.
Scozzari R, Cruciani F, Pangrazio A, Santolamazza P, Vona G, Moral P, Latini V, Varesi L, Memmi MM, Romano V, De Leo G, Gennarelli M, Jaruzelska J, Villems R, Parik J, Macaulay V, Torroni A: Human Y-chromosome variation in the western Mediterranean area: implications for the peopling of the region. Hum Immunol. 2001, 62: 871-884. 10.1016/S0198-8859(01)00286-5.
Arredi B, Poloni ES, Paracchini S, Zerjal T, Fathallah DM, Makrelouf M, Pascali VL, Novelletto A, Tyler-Smith C: A predominantly Neolithic origin for Y-chromosomal DNA variation in North Africa. Am J Hum Genet. 2004, 75: 338-345. 10.1086/423147.
Semino O, Passarino G, Oefner PJ, Lin AA, Arbuzova S, Beckman LE, De Benedictis G, Francalacci P, Kouvatsi A, Limborska S, Marcikiae M, Mika A, Mika B, Primorac D, Santachiara-Benerecetti AS, Cavalli-Sforza LL, Underhill PA: The genetic legacy of Paleolithic Homo sapiens sapiens in extant Europeans: a Y chromosome perspective. Science. 2000, 290: 1155-1159. 10.1126/science.290.5494.1155.
Weale ME, Shah T, Jones AL, Greenhalgh J, Wilson JF, Nymadawa P, Zeitlin D, Connell BA, Bradman N, Thomas MG: Rare deep-rooting Y chromosome lineages in humans: lessons for phylogeography. Genetics. 2003, 165: 229-234.
Goncalves R, Rosa A, Freitas A, Fernandes A, Kivisild T, Villems R, Brehm A: Y-chromosome lineages in Cabo Verde Islands witness the diverse geographic origin of its first male settlers. Hum Genet. 2003, 113: 467-472. 10.1007/s00439-003-1007-4.
Cruciani F, La Fratta R, Santolamazza P, Sellitto D, Pascone R, Moral P, Watson E, Guida V, Colomb EB, Zaharova B, Lavinha J, Vona G, Aman R, Cali F, Akar N, Richards M, Torroni A, Novelletto A, Scozzari R: Phylogeographic analysis of haplogroup E3b (E-M215) Y chromosomes reveals multiple migratory events within and out of Africa. Am J Hum Genet. 2004, 74: 1014-1022. 10.1086/386294.
Alves C, Gusmão L, Barbosa J, Amorim A: Evaluating the informative power of Y-STRs: a comparative study using European and new African haplotype data. Forensic Sci Int. 2003, 134: 126-133. 10.1016/S0379-0738(03)00127-0.
Leat N, Benjeddou M, Davison S: Nine-locus Y-chromosome STR profiling of Caucasian and Xhosa populations from Cape Town, South Africa. Forensic Sci Int. 2004, 144: 73-75. 10.1016/j.forsciint.2004.02.022.
Zarrabeitia MT, Riancho JA, Gusmao L, Lareu MV, Sanudo C, Amorim A, Carracedo A: Spanish population data and forensic usefulness of a novel Y-STR set (DYS437, DYS438, DYS439, DYS460, DYS461, GATA A10, GATA C4, GATA H4). Int J Legal Med. 2003, 117: 306-311. 10.1007/s00414-003-0393-4.
YHRD - Y Chromosome Haplotype Reference Database. 2006, [http://www.yhrd.org]
Arroyo-Pardo E, Gusmao L, Lopez-Parra AM, Baeza C, Mesa MS, Amorim A: Genetic variability of 16 Y-chromosome STRs in a sample from Equatorial Guinea (Central Africa). Forensic Sci Int. 2005, 149: 109-113. 10.1016/j.forsciint.2004.06.006.
Rosa A, Brehm A, Kivisild T, Metspalu E, Villems R: MtDNA profile of West Africa Guineans: towards a better understanding of the Senegambia region. Ann Hum Genet. 2004, 68: 340-352. 10.1046/j.1529-8817.2004.00100.x.
Soodyall H, Vigilant L, Hill AV, Stoneking M, Jenkins T: mtDNA control-region sequence variation suggests multiple independent origins of an "Asian-specific" 9-bp deletion in sub-Saharan Africans. Am J Hum Genet. 1996, 58: 595-608.
Watson E, Forster P, Richards M, Bandelt HJ: Mitochondrial footprints of human expansions in Africa. Am J Hum Genet. 1997, 61: 691-704.
Bandelt HJ, Alves-Silva J, Guimaraes PE, Santos MS, Brehm A, Pereira L, Coppa A, Larruga JM, Rengo C, Scozzari R, Torroni A, Prata MJ, Amorim A, Prado VF, Pena SD: Phylogeography of the human mitochondrial haplogroup L3e: a snapshot of African prehistory and Atlantic slave trade. Ann Hum Genet. 2001, 65: 549-563. 10.1046/j.1469-1809.2001.6560549.x.
Pereira L, Macaulay V, Torroni A, Scozzari R, Prata MJ, Amorim A: Prehistoric and historic traces in the mtDNA of Mozambique: insights into the Bantu expansions and the slave trade. Ann Hum Genet. 2001, 65: 439-458. 10.1046/j.1469-1809.2001.6550439.x.
Cerny V, Hajek M, Bromova M, Cmejla R, Diallo I, Brdicka R: MtDNA of Fulani nomads and their genetic relationships to neighboring sedentary populations. Hum Biol. 2006, 78: 9-27. 10.1353/hub.2006.0024.
Plaza S, Calafell F, Helal A, Bouzerna N, Lefranc G, Bertranpetit J, Comas D: Joining the pillars of Hercules: mtDNA sequences show multidirectional gene flow in the western Mediterranean. Ann Hum Genet. 2003, 67: 312-328. 10.1046/j.1469-1809.2003.00039.x.
Olivieri A, Achilli A, Pala M, Battaglia V, Fornarino S, Al-Zahery N, Scozzari R, Cruciani F, Behar DM, Dugoujon JM, Coudray C, Santachiara-Benerecetti AS, Semino O, Bandelt HJ, Torroni A: The mtDNA legacy of the Levantine early Upper Palaeolithic in Africa. Science. 2006, 314: 1767-1770. 10.1126/science.1135566.
Achilli A, Rengo C, Battaglia V, Pala M, Olivieri A, Fornarino S, Magri C, Scozzari R, Babudri N, Santachiara-Benerecetti AS, Bandelt HJ, Semino O, Torroni A: Saami and Berbers - an unexpected mitochondrial DNA link. Am J Hum Genet. 2005, 76: 883-886. 10.1086/430073.
Almada AA: Tratado breve dos rios da Guiné do Cabo Verde. 1964, Lisboa, Editorial LIAM, 156-2nd
Carreira A, Quintino FR: Antroponímia da Guiné Portuguesa. Memórias da Junta de Investigação do Ultramar. 1964, Lisboa, Junta de Investigações do Ultramar
Hair PE: Ethnolinguistic continuity on the Guinea Coast. J Afr Hist. 1967, VIII: 247-268.
Quintino F: Os povos da Guiné. Bol Cult Guiné Port. 1967, XXII: 5-40.
Diallo T: Les institutions politiques du Fouta Djallon. Initiat Études Afr. 1972, Dakar, IFAN, 28:
Lopes C: Kaabunké, espaço, território e poder na Guiné-Bissau, Gâmbia e Casamance pré-coloniais. 1999, Lisboa, Comissão Nacional para as Comemorações dos Descobrimentos Portugueses
Consortium YC: A nomenclature system for the tree of human Y-chromosomal binary haplogroups. Genome Res. 2002, 12: 339-348. 10.1101/gr.217602.
Jobling MA, Tyler-Smith C: The human Y chromosome: an evolutionary marker comes of age. Nat Rev Genet. 2003, 4: 598-612. 10.1038/nrg1124.
Hammer MF, Horai S: Y chromosomal DNA variation and the peopling of Japan. Am J Hum Genet. 1995, 56: 951-962.
Mathias N, Bayes M, Tyler-Smith C: Highly informative compound haplotypes for the human Y chromosome. Hum Mol Genet. 1994, 3: 115-123. 10.1093/hmg/3.1.115.
Whitfield LS, Sulston JE, Goodfellow PN: Sequence variation of the human Y chromosome. Nature. 1995, 378: 379-380. 10.1038/378379a0.
Hammer MF, Redd AJ, Wood ET, Bonner MR, Jarjanazi H, Karafet T, Santachiara-Benerecetti AS, Oppenheim A, Jobling MA, Jenkins T, Ostrer H, Bonne-Tamir B: Jewish and Middle Eastern non-Jewish populations share a common pool of Y-chromosome biallelic haplotypes. Proc Natl Acad Sci U S A. 2000, 97: 6769-6774. 10.1073/pnas.100115997.
Group GC: Wisconsin Package Version 10.0. 2005, Madison, Wisconsin
Rosa A, Ornelas C, Brehm A, Villems R: Population data on 11 Y-chromosome STRs from Guiné-Bissau. Forensic Sci Int. 2006, 157: 210-217. 10.1016/j.forsciint.2005.04.005.
White PS, Tatum OL, Deaven LL, Longmire JL: New, male-specific microsatellite markers from the human Y chromosome. Genomics. 1999, 57: 433-437. 10.1006/geno.1999.5782.
NetViz 6.5 - NetViz LCC Corporation. 2002, [http://www.netviz.com]
Schneider S, Roessli D, Excoffier L: Arlequin version 2.000: a software for population genetics data analysis. 2000, [http://anthropologie.unige.ch/arlequin]
Nei M: Molecular Evolutionary Genetics. 1987, New York, Columbia University Press, 145-63.
Raymond C, Rousset F: An exact test for population differentiation. Evolution. 1995, 49: 1280-1283. 10.2307/2410454.
Excoffier L, Smouse PE, Quattro JM: Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992, 131: 479-491.
Kovach WL, Services KC: MVSP - A multi-variate statistical package for Windows ver 3.13m. 2004, [http://www.kovcomp.co.uk/mvsp/index.html]
Network 184.108.40.206 - Fluxus Technology Ltd. 2004, [http://www.fluxus-engineering.com]
Bandelt HJ, Forster P, Sykes BC, Richards MB: Mitochondrial portraits of human populations using median networks. Genetics. 1995, 141: 743-753.
Bandelt HJ, Forster P, Röhl A: Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol. 1999, 16: 37-48.
Helgason A, Sigurdardottir S, Nicholson J, Sykes B, Hill EW, Bradley DG, Bosnes V, Gulcher JR, Ward R, Stefansson K: Estimating Scandinavian and Gaelic ancestry in the male settlers of Iceland. Am J Hum Genet. 2000, 67: 697-717. 10.1086/303046.
Hammer MF, Karafet TM, Redd AJ, Jarjanazi H, Santachiara-Benerecetti AS, Soodyall H, Zegura SL: Hierarchical patterns of global human Y-chromosome diversity. Mol Biol Evol. 2001, 18: 1189-1203.
Semino O, Magri C, Benuzzi G, Lin AA, Al-Zahery N, Battaglia V, Maccioni L, Triantaphyllidis C, Shen P, Oefner PJ, Zhivotovsky LA, King R, Torroni A, Cavalli-Sforza LL, Underhill PA, Santachiara-Benerecetti AS: Origin, diffusion, and differentiation of Y-chromosome haplogroups E and J: inferences on the Neolithization of Europe and later migratory events in the Mediterranean area. Am J Hum Genet. 2004, 74: 1023-1034. 10.1086/386295.
The authors are grateful for permissions to collect blood samples by the Chairman of the Joint Chiefs of Staff and the Ministry of Health of the Republic of Guinea-Bissau. AMI – Assistência Médica Internacional gave local support. The laboratory work has been possible thanks to the technical help of Siiri Rootsi and Jüri Parik from the Department of Evolutionary Biology, EBC, Estonia and Ana Teresa Fernandes and Rita Gonçalves from the Human Genetics Laboratory, University of Madeira. Alexandra Rosa is beneficiary of the fellowship grant SFRH/BD/12173/2003 from FCT, Fundação para a Ciência e Tecnologia. MAJ is supported by a Wellcome Trust Senior Fellowship in Basic Biomedical Science (057559). António Brehm received a grant from the Regional Government of Madeira (Portugal).
AR conceived the study design and together with CO carried out the molecular genetic typing. AR performed the statistical analysis and interpreted the data to draft the manuscript. MJ, AB and RV have been involved in drafting and revising the manuscript, whose final version was read and approved by all.