To address the void in the availability of high-quality proteomic data traversing the animal tree, we have implemented a pipeline for generating de novo assemblies based on publicly available data from the NCBI Sequence Read Archive, yielding a comprehensive collection of proteomes from 100 species spanning 21 animal phyla. We have also created the Animal Proteome Database (AniProtDB), a resource providing open access to this collection of high-quality metazoan proteomes, along with information on predicted proteins and protein domains for each taxonomic classification and the ability to perform sequence similarity searches against all proteomes generated using this pipeline. This solution vastly increases the utility of these data by removing the barrier to access for research groups who do not have the expertise or resources to generate these data themselves and enables the use of data from nontraditional research organisms that have the potential to address key questions in biomedicine.
CITATION STYLE
Barreira, S. N., Nguyen, A. D., Fredriksen, M. T., Wolfsberg, T. G., Moreland, R. T., & Baxevanis, A. D. (2021). AniProtDB: A Collection of Consistently Generated Metazoan Proteomes for Comparative Genomics Studies. Molecular Biology and Evolution, 38(10), 4628–4633. https://doi.org/10.1093/molbev/msab165
Mendeley helps you to discover research relevant for your work.