Scripts and data of the genetic analysis of Syrah x Grenache progeny

DOI

R-scripts and data associated with the publication "Harnessing multivariate, penalized regression methods for genomic prediction and QTL detection of drought-related traits in grapevine". Article abstract: Viticulture has to cope with climate change and decrease pesticide inputs, while maintaining yield and wine quality. Breeding is a key lever to meet this challenge, and genomic prediction is a promising tool to accelerate breeding programs. Multivariate methods are potentially more accurate than univariate ones. Moreover, some prediction methods also provide marker selection, thus allowing quantitative trait loci (QTLs) detection and allowing the identification of positional candidate genes. To study both genomic prediction and QTL detection for drought-related traits in grapevine, we applied several methods, interval mapping as well as univariate and multivariate penalized regression, in a bi-parental progeny. We used a new denser genetic map, simulated two traits under four QTL configurations, and re-analyzed 14 traits measured in semi-controlled conditions under different watering conditions. According to our simulations, we recommend the penalized regression method Elastic Net (EN) for genomic prediction, and controlling the marginal False Discovery Rate on EN selected markers to prioritize the QTLs. Indeed, penalized methods were more powerful than interval mapping for QTL detection across various genetic architectures. Multivariate prediction did not perform better than its univariate counterpart, despite strong genetic correlation between traits. Using experimental data, penalized regression methods proved as very efficient for intra-population prediction whatever the genetic architecture of the trait, with accuracies reaching 0.68. These methods applied on the denser map found new QTLs controlling traits linked to drought tolerance and provided relevant candidate genes. Overall, these findings provide a strong evidence base for implementing genomic prediction in grapevine breeding.

Scripts and data associated with the SNP genetic map of Syrah x Grenache progeny are available at: https://doi.org/10.15454/QEDX2V.

Raw phenotypic data of Syrah x Grenache progeny in the phenotypic platform PhenoArch are available at: https://doi.org/10.15454/YTRKV6.

R, 3.6.3

RStudio, 1.2.5033

Identifier
DOI https://doi.org/10.15454/NOUQY2
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.15454/NOUQY2
Provenance
Creator Brault, Charlotte ORCID logo; Flutre, Timothée ORCID logo; Doligez, Agnès ORCID logo
Publisher Recherche Data Gouv
Contributor Brault, Charlotte; Doligez, Agnès
Publication Year 2021
Rights etalab 2.0; info:eu-repo/semantics/openAccess; https://spdx.org/licenses/etalab-2.0.html
OpenAccess true
Contact Brault, Charlotte (INRAE - Institut national de recherche pour l'agriculture, l'alimentation et l'environnement); Doligez, Agnès (INRAE - Institut national de recherche pour l'agriculture, l'alimentation et l'environnement)
Representation
Resource Type Software; Dataset
Format text/html; text/markdown; text/x-r-markdown; application/x-shellscript; text/tab-separated-values; image/png; text/x-r-source; text/tsv; application/pdf; text/plain; application/octet-stream
Size 31033; 22059; 14956; 33581; 50075; 743; 2581; 56892; 39952; 340355; 20783; 18166; 30213; 11102; 4202; 18020; 10362; 8589; 9545; 11303; 10087; 11056; 12604; 8206; 9216; 11067; 14298; 16147; 10389; 8639; 9640; 11598; 10135; 8274; 9437; 10967; 10069; 8218; 9291; 10457; 8194; 9229; 10580; 14479; 16089; 9033; 10142; 15266; 8894; 15344; 16229; 4694; 18328; 14664; 7501; 14621; 107800; 9268; 172367; 4047; 2102; 1914; 2072; 3128; 7781486; 29329; 28070; 111505; 18035; 6211540; 183791; 12437608; 393518; 91303; 2763; 1767; 1554248; 2340844; 121960; 1736; 10511829; 52651; 36499456
Version 1.2
Discipline Agriculture, Forestry, Horticulture; Computer Science; Life Sciences; Agricultural Sciences; Agriculture, Forestry, Horticulture, Aquaculture; Agriculture, Forestry, Horticulture, Aquaculture and Veterinary Medicine; Basic Biological and Medical Research; Biology; Medicine; Omics; Plant Science; Information Science