CDD: a Conserved Domain Database for the functional annotation of proteins

Marchler-Bauer, Aron; Lu, Shennan; Anderson, John B.; Chitsaz, Farideh; Derbyshire, Myra K.; DeWeese-Scott, Carol; Fong, Jessica H.; Geer, Lewis Y.; Geer, Renata C.; Gonzales, Noreen R.; Gwadz, Marc; Hurwitz, David I.; Jackson, John D.; Ke, Zhaoxi; Lanczycki, Christopher J.; Lu, Fu; Marchler, Gabriele H.; Mullokandov, Mikhail; Omelchenko, Marina V.; Robertson, Cynthia L.; Song, James S.; Thanki, Narmada; Yamashita, Roxanne A.; Zhang, Dachuan; Zhang, Naigong; Zheng, Chanjuan; Bryant, Stephen H.

Published in

Oxford University Press, Nucleic Acids Research, Database(39), p. D225-D229, 2010

DOI: 10.1093/nar/gkq1189

Tools

Export citation

Search in Google Scholar

CDD: a Conserved Domain Database for the functional annotation of proteins

Journal article published in 2010 by Aron Marchler-Bauer

, Shennan Lu, John B. Anderson, Farideh Chitsaz, Myra K. Derbyshire, Carol DeWeese-Scott, Jessica H. Fong, Lewis Y. Geer, Renata C. Geer, Noreen R. Gonzales, Marc Gwadz, David I. Hurwitz, John D. Jackson, Zhaoxi Ke, Christopher J. Lanczycki and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

Published in

Links

Tools

CDD: a Conserved Domain Database for the functional annotation of proteins

Abstract