Most investigations into the large-scale patterns of protein evolution are based on gene annotations that have been compiled in reference databases. The use of these resources for quantitative comparisons, however, is complicated by sometimes vast differences in coverage. More importantly, however, we also observe substantial ascertainment biases that cannot be removed by simple normalization procedures. A striking example is provided by the correlations between protein domains. We observe that statistics derived from different computational gene annotation procedure show dramatic discrepancies, and even qualitative changes from negative to positive correlation, when compared to statistics obtained from annotation databases.________________________________________GRAPHICAL ABSTRACT
CITATION STYLE
A. Parikesit, A., Steiner, L., F. Stadler, P., & J. Prohaska, S. (2014). Pitfalls of ascertainment biases in genome annotations—computing comparable protein domain distributions in eukarya. Malaysian Journal of Fundamental and Applied Sciences, 10(2). https://doi.org/10.11113/mjfas.v10n2.57
Mendeley helps you to discover research relevant for your work.