The genetics of East African populations: a Nilo-Saharan component in the African genetic landscape [supplementary information]

DOI

Includes Supplementary Figures S1-S13, Supplementary Tables S1-S8 and Supplementary Methods. Suppl. Fig. S1: Principal component analysis of the new populations genotyped from the Sudanese region; Suppl. Fig. S2: Principal component analysis of six world–wide populations from 1000 Genomes Project using different number of SNPs; Suppl. Fig. S3: Principal component analysis of the new populations genotyped from the Sudanese region; Suppl. Fig. S4: Principal component analysis of the populations from the Sudanese region in the context of the African continent with 14 samples identified as outliers with respect to their populations of origin; Suppl. Fig. S5: Pairwise FST values between the 14 populations; Suppl. Fig. S6: Cross-validation error estimates of the new nine genotyped populations for the ADMIXTURE analysis; Suppl. Fig. S7: ADMIXTURE results for k = 2 through k = 10 for the Sudanese populations; Suppl. Fig. S8: Cross-validation error estimates of the 14 populations for the ADMIXTURE analysis; Suppl. Fig. S9: ADMIXTURE results for k = 2 through k = 10 for the 14 populations using all 921 individuals; Suppl. Fig. S10: ADMIXTURE results for k = 2 through k = 10 for populations from the Sudanese region in the context of other external populations; Suppl. Fig. S11: Principal component analysis of the populations from the Sudanese region in the context of the African continent with an European population added; Suppl. Fig. S12: ADMIXTURE results for k = 2 through k = 10 for populations from the Sudanese region in the context of other external populations; Suppl. Fig. S13: Sampling distribution of the sample mean Global FST between Sudanese populations. Suppl. Table S1: Detailed sample information of the populations analysed in the present study, including sampling location and total number of individuals; Suppl. Table S2: Pairwise FST comparisons among the Sudanese ethnolinguistic groups and neighbouring populations; Suppl. Table S3: Three–population test with Yoruba as outgroup to estimate mixing proportions; Suppl. Table S4: Three–population test with Luya as outgroup to estimate mixing proportions; Suppl. Table S5: List of genes related to resistance to malaria present in the Immunochip; Suppl. Table S6: List of genes belonging to pathways related to antibacterial host defence present in the Immunochip; Suppl. Table S7: List of genes belonging to fungi host defence present in the Immunochip; Suppl. Table S8: Summary statistics of SNPs of disease-related genes from African populations of 1000 Genomes Project compared to the portion of those SNPs genotyped in the Immunochip. The compressed file contains the Sudan Inmunochip dataset in XLSX, BED, BIM and FAM formats.

Identifier
DOI https://doi.org/10.34810/data402
Metadata Access https://dataverse.csuc.cat/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.34810/data402
Provenance
Creator Dobon, Begoña ORCID logo; Hassan, Hisham Y. ORCID logo; Laayouni, Hafid, 1968- ORCID logo; Luisi, Pierre, 1985- ORCID logo; Ricaño Ponce, Isis ORCID logo; Zhernakova, Alexandra; Wijmenga, Cisca ORCID logo; Tahir, Hanan ORCID logo; Comas, David, 1969- ORCID logo; Netea, Mihai G; Bertranpetit, Jaume, 1952- ORCID logo
Publisher CORA.Repositori de Dades de Recerca
Publication Year 2022
Funding Reference European Commission 322698 ; MINECO BFU2013-43726-P ; European Commission 310372
Rights CC BY 4.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/licenses/by/4.0
OpenAccess true
Representation
Resource Type Experimental data; Dataset
Format text/plain; application/vnd.realvnc.bed; application/octet-stream; text/x-c; application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size 3480018; 22054887; 5113054; 9353; 182943; 37462; 27713
Version 1.0
Discipline Geography; Geosciences; Geospheric Sciences; Life Sciences; Medicine; Natural Sciences