Positive and negative forms of replicability in gene network analysis

6Citations
Citations of this article
44Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: Gene networks have become a central tool in the analysis of genomic data but are widely regarded as hard to interpret. This has motivated a great deal of comparative evaluation and research into best practices. We explore the possibility that this may lead to overfitting in the field as a whole. Results: We construct a model of 'research communities' sampling from real gene network data and machine learning methods to characterize performance trends. Our analysis reveals an important principle limiting the value of replication, namely that targeting it directly causes 'easy' or uninformative replication to dominate analyses. We find that when sampling across network data and algorithms with similar variability, the relationship between replicability and accuracy is positive (Spearman's correlation, rs ∼0.33) but where no such constraint is imposed, the relationship becomes negative for a given gene function (rs ∼ -0.13). We predict factors driving replicability in some prior analyses of gene networks and show that they are unconnected with the correctness of the original result, instead reflecting replicable biases. Without these biases, the original results also vanish replicably. We show these effects can occur quite far upstream in network data and that there is a strong tendency within protein-protein interaction data for highly replicable interactions to be associated with poor quality control.

References Powered by Scopus

Basic local alignment search tool

78927Citations
N/AReaders
Get full text

Gene ontology: Tool for the unification of biology

32172Citations
N/AReaders
Get full text

Bagging predictors

19040Citations
N/AReaders
Get full text

Cited by Powered by Scopus

EGAD: Ultra-fast functional analysis of gene networks

52Citations
N/AReaders
Get full text

Dynamic rewiring of the human interactome by interferon signaling

25Citations
N/AReaders
Get full text

Ligand similarity complements sequence, physical interaction, and Co-Expression for gene function prediction

14Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Verleyen, W., Ballouz, S., & Gillis, J. (2016). Positive and negative forms of replicability in gene network analysis. Bioinformatics, 32(7), 1065–1073. https://doi.org/10.1093/bioinformatics/btv734

Readers over time

‘15‘16‘17‘18‘19‘20‘21‘22‘23‘2406121824

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 17

50%

Researcher 13

38%

Professor / Associate Prof. 4

12%

Readers' Discipline

Tooltip

Agricultural and Biological Sciences 11

38%

Computer Science 8

28%

Biochemistry, Genetics and Molecular Bi... 7

24%

Medicine and Dentistry 3

10%

Article Metrics

Tooltip
Mentions
News Mentions: 1
Social Media
Shares, Likes & Comments: 1

Save time finding and organizing research with Mendeley

Sign up for free
0