Motivation: Massive amounts of high-throughput genomics data profiled from tumor samples were made publicly available by the Cancer Genome Atlas (TCGA). Results: We have developed an open source software package, TCGA2STAT, to obtain the TCGA data, wrangle it, and pre-process it into a format ready for multivariate and integrated statistical analysis in the R environment. In a user-friendly format with one single function call, our package downloads and fully processes the desired TCGA data to be seamlessly integrated into a computational analysis pipeline. No further technical or biological knowledge is needed to utilize our software, thus making TCGA data easily accessible to data scientists without specific domain knowledge. Availability and implementation: TCGA2STAT is available from the https://cran.r-project.org/web/packages/TCGA2STAT/index.html. Supplementary information: Supplementary data are available at Bioinformatics online. Contact:
CITATION STYLE
Wan, Y. W., Allen, G. I., & Liu, Z. (2016). TCGA2STAT: Simple TCGA data access for integrated statistical analysis in R. Bioinformatics, 32(6), 952–954. https://doi.org/10.1093/bioinformatics/btv677
Mendeley helps you to discover research relevant for your work.