Using the CATH domain database to assign structures and functions to the genome sequences

Pearl, F.; Todd, A. E.; Ae, Todd; Bray, J. E.; Martin, A. C. R.; Aa, Salamov; Salamov, A. A.; Suwa, M.; Swindells, M. B.; Thornton, J. M.; Ca, Orengo; Orengo, C. A.

Published in

Portland Press, Biochemical Society Transactions, 2(28), p. 269-275, 2000

DOI: 10.1042/bst0280269

Tools

Export citation

Search in Google Scholar

Using the CATH domain database to assign structures and functions to the genome sequences

Journal article published in 2000 by F. Pearl, A. E. Todd, Todd Ae, J. E. Bray

, A. C. R. Martin, Salamov Aa, A. A. Salamov, M. Suwa, M. B. Swindells, J. M. Thornton, Orengo Ca, C. A. Orengo

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

The CATH database of protein structures contains approximately 18000 domains organized according to their (C)lass, (A)rchitecture, (T)opology and (H)omologous superfamily. Relationships between evolutionary related structures (homologues) within the database have been used to test the sensitivity of various sequence search methods in order to identify relatives in Genbank and other sequence databases. Subsequent application of the most sensitive and efficient algorithms, gapped blast and the profile based method, Position Specific Iterated Basic Local Alignment Tool (PSI-BLAST), could be used to assign structural data to between 22 and 36 % of microbial genomes in order to improve functional annotation and enhance understanding of biological mechanism. However, on a cautionary note, an analysis of functional conservation within fold groups and homologous superfamilies in the CATH database, revealed that whilst function was conserved in nearly 55% of enzyme families, function had diverged considerably, in some highly populated families. In these families, functional properties should be inherited far more cautiously and the probable effects of substitutions in key functional residues carefully assessed.

Published in

Links

Tools

Using the CATH domain database to assign structures and functions to the genome sequences

Abstract