Motivation: The prediction of protein stability change upon mutations is a key problem for understanding protein folding and misfolding. Presently methods are available to predict stability changes only when the atomic structure of the protein is available. Methods addressing the same task starting from the protein sequence are however necessary in order to complete genome annotation, especially in relation to single nucleotide polymorphisms (SNPs) and related diseases. Results: We develop a method based on support vector machines (SVM) the starting from the protein sequence predicts the sign and the value of free energy stability change upon single point mutation. We show that the accuracy of our predictor is as high as 77% in the specific task of predicting the ddG sign related to the corresponding protein stability. When predicting the ddG values, a satisfying correlation agreement with the experimental data is also found. As a final blind benchmark, the predictor is applied to proteins with a set of disease-related SNPs, for which thermodynamics data are also known. We found that our predictions corroborate the view that disease-related mutations correspond to decrease of protein stability. Availability: www.gpcr2.biocomp.unibo.it

Predicting protein stability changes from sequence with Support Vector Machines

Fariselli P.;
2005-01-01

Abstract

Motivation: The prediction of protein stability change upon mutations is a key problem for understanding protein folding and misfolding. Presently methods are available to predict stability changes only when the atomic structure of the protein is available. Methods addressing the same task starting from the protein sequence are however necessary in order to complete genome annotation, especially in relation to single nucleotide polymorphisms (SNPs) and related diseases. Results: We develop a method based on support vector machines (SVM) the starting from the protein sequence predicts the sign and the value of free energy stability change upon single point mutation. We show that the accuracy of our predictor is as high as 77% in the specific task of predicting the ddG sign related to the corresponding protein stability. When predicting the ddG values, a satisfying correlation agreement with the experimental data is also found. As a final blind benchmark, the predictor is applied to proteins with a set of disease-related SNPs, for which thermodynamics data are also known. We found that our predictions corroborate the view that disease-related mutations correspond to decrease of protein stability. Availability: www.gpcr2.biocomp.unibo.it
2005
4th European Conference on Computational Biology, ECCB'05/jbi'05
Instituto Nacional de Bioinformatica (INB), Madrid (Spagna)
28/09-1/10 2005
Proceedings of the "4th European Conference on Computational Biology, ECCB'05/jbi'05"
54
54
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/1687497
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact