SIMAP - The similarity matrix of proteins

Arnold, Roland; Rattei, Thomas; Tischler, Patrick; Truong, Minh-Duc; Stümpflen, Volker; Mewes, Werner

Published in

Oxford University Press, Nucleic Acids Research, 90001(34), p. D252-D256, 2006

DOI: 10.1093/nar/gkj106

Oxford University Press (OUP), Bioinformatics, Suppl 2(21), p. ii42-ii46

DOI: 10.1093/bioinformatics/bti1107

Tools

Export citation

Search in Google Scholar

SIMAP - The similarity matrix of proteins

Journal article published in 2005 by Roland Arnold, Thomas Rattei, Patrick Tischler, Minh-Duc Truong, Volker Stümpflen, Werner Mewes

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Similarity Matrix of Proteins (SIMAP) (http://mips.gsf.de/simap) provides a database based on a pre-computed similarity matrix covering the similarity space formed by >4 million amino acid sequences from public databases and completely sequenced genomes. The database is capable of handling very large datasets and is updated incrementally. For sequence similarity searches and pairwise alignments, we implemented a grid-enabled software system, which is based on FASTA heuristics and the Smith-Waterman algorithm. Our ProtInfo system allows querying by protein sequences covered by the SIMAP dataset as well as by fragments of these sequences, highly similar sequences and title words. Each sequence in the database is supplemented with pre-calculated features generated by detailed sequence analyses. By providing WWW interfaces as well as web-services, we offer the SIMAP resource as an efficient and comprehensive tool for sequence similarity searches.

Published in

Links

Tools

SIMAP - The similarity matrix of proteins

Abstract