Pitfalls of ascertainment biases in genome annotations—computing comparable protein domain distributions in eukarya

Arli A. Parikesit; Lydia Steiner; Peter F. Stadler; Sonja J. Prohaska

Journal ArticleOPEN ACCESS

Pitfalls of ascertainment biases in genome annotations—computing comparable protein domain distributions in eukarya

A. Parikesit A
Steiner L
F. Stadler P
et al.

Malaysian Journal of Fundamental and Applied Sciences (2014) 10(2)

DOI: 10.11113/mjfas.v10n2.57

N/ACitations

12Readers

Abstract

Most investigations into the large-scale patterns of protein evolution are based on gene annotations that have been compiled in reference databases. The use of these resources for quantitative comparisons, however, is complicated by sometimes vast differences in coverage. More importantly, however, we also observe substantial ascertainment biases that cannot be removed by simple normalization procedures. A striking example is provided by the correlations between protein domains. We observe that statistics derived from different computational gene annotation procedure show dramatic discrepancies, and even qualitative changes from negative to positive correlation, when compared to statistics obtained from annotation databases.________________________________________GRAPHICAL ABSTRACT

Cite

CITATION STYLE

APA

A. Parikesit, A., Steiner, L., F. Stadler, P., & J. Prohaska, S. (2014). Pitfalls of ascertainment biases in genome annotations—computing comparable protein domain distributions in eukarya. Malaysian Journal of Fundamental and Applied Sciences, 10(2). https://doi.org/10.11113/mjfas.v10n2.57

Pitfalls of ascertainment biases in genome annotations—computing comparable protein domain distributions in eukarya

Abstract

Cite

Register to see more suggestions