The conserved domain database in 2023

Jiyao Wang; Farideh Chitsaz; Myra K. Derbyshire; Noreen R. Gonzales; Marc Gwadz; Shennan Lu; Gabriele H. Marchler; James S. Song; Narmada Thanki; Roxanne A. Yamashita; Mingzhang Yang; Dachuan Zhang; Chanjuan Zheng; Christopher J. Lanczycki; Aron Marchler-Bauer

Journal ArticleOPEN ACCESS

The conserved domain database in 2023

Nucleic Acids Research (2023) 51(D1) D384-D388

DOI: 10.1093/nar/gkac1096

771Citations

461Readers

Abstract

NLM's conserved domain database (CDD) is a collection of protein domain and protein family models constructed as multiple sequence alignments. Its main purpose is to provide annotation for protein and translated nucleotide sequences with the location of domain footprints and associated functional sites, and to define protein domain architecture as a basis for assigning gene product names and putative/predicted function. CDD has been available publicly for over 20 years and has grown substantially during that time. Maintaining an archive of pre-computed annotation continues to be a challenge and has slowed down the cadence of CDD releases. CDD curation staff builds hierarchical classifications of large protein domain families, adds models for novel domain families via surveillance of the protein 'dark matter' that currently lacks annotation, and now spends considerable effort on providing names and attribution for conserved domain architectures. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

Cite

CITATION STYLE

APA

Wang, J., Chitsaz, F., Derbyshire, M. K., Gonzales, N. R., Gwadz, M., Lu, S., … Marchler-Bauer, A. (2023). The conserved domain database in 2023. Nucleic Acids Research, 51(D1), D384–D388. https://doi.org/10.1093/nar/gkac1096

The conserved domain database in 2023

Abstract

Cite

Register to see more suggestions