RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Paladin, Lisanna; Bevilacqua, Martina; Errigo, Sara; Piovesan, Damiano; Mičetić, Ivan; Necci, Marco; Monzon, Alexander Miguel; Fabre, Maria Laura; Lopez, Jose Luis; Nilsson, Juliet F.; Rios, Javier; Menna, Pablo Lorenzano; Cabrera, Maia; Buitron, Martin Gonzalez; Kulik, Mariane Gonçalves; Fernandez-Alberti, Sebastian; Fornasari, Maria Silvina; Parisi, Gustavo; Lagares, Antonio; Hirsh, Layla; Andrade-Navarro, Miguel A.; Kajava, Andrey V.; Tosatto, Silvio C. E.

Published in

Oxford University Press, Nucleic Acids Research, D1(49), p. D452-D457, 2020

DOI: 10.1093/nar/gkaa1097

Tools

Export citation

Search in Google Scholar

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Journal article published in 2020 by Lisanna Paladin

, Martina Bevilacqua, Sara Errigo, Damiano Piovesan

, Ivan Mičetić

, Marco Necci, Alexander Miguel Monzon

, Maria Laura Fabre, Jose Luis Lopez, Juliet F. Nilsson, Javier Rios, Pablo Lorenzano Menna, Maia Cabrera, Martin Gonzalez Buitron, Mariane Gonçalves Kulik and other authors.

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.

Published in

Links

Tools

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Abstract