Hidden Markov Model-derived structural alphabet for proteins: The learning of protein local shapes captures sequence specificity

Camproux, A. C.; Tufféry, P.

Published in

Elsevier, BBA - General Subjects, 3(1724), p. 394-403

DOI: 10.1016/j.bbagen.2005.05.019

Tools

Export citation

Search in Google Scholar

Hidden Markov Model-derived structural alphabet for proteins: The learning of protein local shapes captures sequence specificity

Journal article published in 2005 by A. C. Camproux, P. Tufféry

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence.

Published in

Links

Tools

Hidden Markov Model-derived structural alphabet for proteins: The learning of protein local shapes captures sequence specificity

Abstract