CDD: a conserved domain database for interactive domain family analysis

Marchler-Bauer, Aron; Anderson, John B.; Derbyshire, Myra K.; DeWeese-Scott, Carol; Gonzales, Noreen R.; Gwadz, Marc; Hao, Luning; He, Siqian; Hurwitz, David I.; Jackson, John D.; Ke, Zhaoxi; Krylov, Dmitri; Lanczycki, Christopher J.; Liebert, Cynthia A.; Liu, Chunlei; Lu, Fu; Lu, Shennan; Marchler, Gabriele H.; Mullokandov, Mikhail; Song, James S.; Thanki, Narmada; Yamashita, Roxanne A.; Yin, Jodie J.; Zhang, Dachuan; Bryant, Stephen H.

Published in

Oxford University Press, Nucleic Acids Research, Database(35), p. D237-D240, 2007

DOI: 10.1093/nar/gkl951

Tools

Export citation

Search in Google Scholar

CDD: a conserved domain database for interactive domain family analysis

Journal article published in 2007 by Aron Marchler-Bauer

, John B. Anderson, Myra K. Derbyshire, Carol DeWeese-Scott, Noreen R. Gonzales, Marc Gwadz, Luning Hao, Siqian He, David I. Hurwitz, John D. Jackson, Zhaoxi Ke, Dmitri Krylov, Christopher J. Lanczycki, Cynthia A. Liebert, Chunlei Liu and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

The conserved domain database (CDD) is part of NCBI's Entrez database system and serves as a primary resource for the annotation of conserved domain footprints on protein sequences in Entrez. Entrez's global query interface can be accessed at and will search CDD and many other databases. Domain annotation for proteins in Entrez has been pre-computed and is readily available in the form of ‘Conserved Domain’ links. Novel protein sequences can be scanned against CDD using the CD-Search service; this service searches databases of CDD-derived profile models with protein sequence queries using BLAST heuristics, at . Protein query sequences submitted to NCBI's protein BLAST search service are scanned for conserved domain signatures by default. The CDD collection contains models imported from Pfam, SMART and COG, as well as domain models curated at NCBI. NCBI curated models are organized into hierarchies of domains related by common descent. Here we report on the status of the curation effort and present a novel helper application, CDTree, which enables users of the CDD resource to examine curated hierarchies. More importantly, CDD and CDTree used in concert, serve as a powerful tool in protein classification, as they allow users to analyze protein sequences in the context of domain family hierarchies.

Published in

Links

Tools

CDD: a conserved domain database for interactive domain family analysis

Abstract