TCGA case study for ASTERICS

DOI

This dataset is issued from the public repository TCGA (https://portal.gdc.cancer.gov/) and contain several files, each corresponding to a given omic on the same individuals with breast cancer. Raw data have been obtained from the mixOmics case study described in http://mixomics.org/mixdiablo/case-study-tcga/ [link accessed on August 18, 2021] and were made available by the package authors at http://mixomics.org/wp-content/uploads/2016/08/TCGA.normalised.mixDIABLO.RData_.zip (R data format). Data in the zip file had been normalised for technical biases by the package authors. Data from the train and test sets were exported as TXT/CSV files and completed with miRNA expression on the smae individuals and toy datasets to handle missing value cases and alike. They serve as a basis for the illustration of the web data analysis tool ASTERICS (Project 20008788 funded by Région Occitanie).

R, 4.0.4

The Cancer Genome Atlas (TCGA) https://portal.gdc.cancer.gov/ Data dictionnary is available on TCGA website https://docs.gdc.cancer.gov/Data_Dictionary/viewer/

The origin of sources is a public repository where raw original data may be retrieved. Data were preprocessed (normalized) by the mixOmics package authors as described in Supplementary Section S2 of [Singh et al, 2019], where the origin of the dataset is also fully described.

Identifier
DOI https://doi.org/10.15454/YNMQUY
Related Identifier https://doi.org/10.1093/bioinformatics/bty1054
Related Identifier https://doi.org/10.1038/nature11412
Metadata Access https://entrepot.recherche.data.gouv.fr/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.15454/YNMQUY
Provenance
Creator Vialaneix, Nathalie ORCID logo
Publisher Recherche Data Gouv
Contributor Vialaneix, Nathalie
Publication Year 2021
Funding Reference Région Occitanie 20008788
Rights info:eu-repo/semantics/openAccess
OpenAccess true
Contact Vialaneix, Nathalie (INRAE)
Representation
Resource Type Dataset
Format text/csv; type/x-r-syntax; text/x-r-source; text/comma-separated-values; text/plain
Size 2752164; 2148636; 864; 1088; 808595; 33405040; 1003170; 1003176; 812120; 8901
Version 3.0
Discipline Life Sciences; Medicine