In the age of post-genomics, the task of improving existing annotation is one of the major challenge. The sequenced transcriptome allows to revisit the annotated sequenced genome of the corresponding organism and improve the existing gene models. In addition, misleading annotations propagate in multiple databases by comparative approaches of annotation, automatic annotation, and lack of curating power in the face of large data volume. In this pursuit, re-annotated improved gene models can prevent misleading structural and functional annotation of genes and proteins. In this chapter, we will highlight annotation and re-annotation procedures and will explain how annotations can be improved using computational methods. Our integrative workflow can be used to re-annotate genomes of any sequenced eukaryotic organism. We describe the annotation of splice sites, open reading frames, encoded proteins and peptides, hints for functional annotation including phylogenetic and domain analysis as well as critical evaluation of data transfer procedures, and the genome annotation process.
CITATION STYLE
Gupta, S. K., Bencurova, E., Srivastava, M., Pahlavan, P., Balkenhol, J., & Dandekar, T. (2016). Improving re-annotation of annotated eukaryotic genomes. In Big Data Analytics in Genomics (pp. 171–195). Springer International Publishing. https://doi.org/10.1007/978-3-319-41279-5_5
Mendeley helps you to discover research relevant for your work.