Identifying differentially expressed genes from cross-site integrated data based on relative expression orderings

21Citations
Citations of this article
25Readers
Mendeley users who have this article in their library.

Abstract

It is a basic task in high-throughput gene expression profiling studies to identify differentially expressed genes (DEGs) between two phenotypes. But the weakly differential expression signals between two phenotypes are hardly detectable with limited sample sizes. To solve this problem, many researchers tried to combine multiple independent datasets using meta-analysis or batch effect adjustment algorithms. However, these algorithms may distort true biological differences between two phenotypes and introduce unacceptable high false rates, as demonstrated in this study. These problems pose critical obstacles for analyzing the transcriptional data in The Cancer Genome Atlas where there are many small-scale batches of data. Previously, we developed RankComp to detect DEGs for individual disease samples through exploiting the incongruous relative expression orderings between two phenotypes and further improved it here to identify DEGs using multiple independent datasets. We demonstrated the improved RankComp can directly analyze integrated cross-site data to detect DEGs between two phenotypes without the need of batch effect adjustments. Its usage was illustrated in detecting weak differential expression signals of breast cancer drug-response data using combined datasets from multiple experiments.

References Powered by Scopus

Gene ontology: Tool for the unification of biology

32172Citations
N/AReaders
Get full text

Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation

12231Citations
N/AReaders
Get full text

Mapping and quantifying mammalian transcriptomes by RNA-Seq

10989Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality

30Citations
N/AReaders
Get full text

MrGPS: an m6A-related gene pair signature to predict the prognosis and immunological impact of glioma patients

12Citations
N/AReaders
Get full text

Identification of population-level differentially expressed genes in one-phenotype data

6Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Cai, H., Li, X., Li, J., Liang, Q., Zheng, W., Guan, Q., … Wang, X. (2018). Identifying differentially expressed genes from cross-site integrated data based on relative expression orderings. International Journal of Biological Sciences, 14(8), 892–900. https://doi.org/10.7150/ijbs.24548

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 12

80%

Professor / Associate Prof. 2

13%

Researcher 1

7%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 6

43%

Agricultural and Biological Sciences 3

21%

Medicine and Dentistry 3

21%

Computer Science 2

14%

Save time finding and organizing research with Mendeley

Sign up for free