Characterizing and Predicting Catalytic Residues in Enzyme Active Sites Based on Local Properties: A Machine Learning Approach

Bobadilla, Leonardo; Nino, Fernando; Cepeda, Edilberto; Patarroyo, Manuel A.

Published in

2007 IEEE 7th International Symposium on BioInformatics and BioEngineering

DOI: 10.1109/bibe.2007.4375671

Tools

Export citation

Search in Google Scholar

Characterizing and Predicting Catalytic Residues in Enzyme Active Sites Based on Local Properties: A Machine Learning Approach

Proceedings article published in 2007 by Leonardo Bobadilla, Fernando Nino, Edilberto Cepeda, Manuel A. Patarroyo

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Developing computational methods for assigning protein function from tertiary structure is a very important problem, predicting a catalytic mechanism based only on structural information being a particularly challenging task. This work focuses on helping to understand the molecular basis of catalysis by exploring the nature of catalytic residues, their environment and characteristic properties in a large data set of enzyme structures and using this information to predict enzyme structures' active sites. A machine learning approach that performs feature extraction, clustering and classification on a protein structure data set is proposed. The 6,376 residues directly involved in enzyme catalysis, present in more than 800 proteins structures in the PDB were analyzed. Feature extraction provided a description of critical features for each catalytic residue, which were consistent with prior knowledge about them. Results from k-fold-cross-validation for classification showed more than 80% accuracy. Complete enzymes were scanned using these classifiers to locate catalytic residues.

Published in

Links

Tools

Characterizing and Predicting Catalytic Residues in Enzyme Active Sites Based on Local Properties: A Machine Learning Approach

Abstract