Header menu link for other important links
X
Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches
Published in John Wiley and Sons Inc.
2014
PMID: 24648146
Volume: 82
   
Issue: 9
Pages: 2088 - 2096
Abstract
Protein-protein interactions are intrinsic to virtually every cellular process. Predicting the binding affinity of protein-protein complexes is one of the challenging problems in computational and molecular biology. In this work, we related sequence features of protein-protein complexes with their binding affinities using machine learning approaches. We set up a database of 185 protein-protein complexes for which the interacting pairs are heterodimers and their experimental binding affinities are available. On the other hand, we have developed a set of 610 features from the sequences of protein complexes and utilized Ranker search method, which is the combination of Attribute evaluator and Ranker method for selecting specific features. We have analyzed several machine learning algorithms to discriminate protein-protein complexes into high and low affinity groups based on their Kd values. Our results showed a 10-fold cross-validation accuracy of 76.1% with the combination of nine features using support vector machines. Further, we observed accuracy of 83.3% on an independent test set of 30 complexes. We suggest that our method would serve as an effective tool for identifying the interacting partners in protein-protein interaction networks and human-pathogen interactions based on the strength of interactions. © 2014 Wiley Periodicals, Inc.
About the journal
JournalData powered by TypesetProteins: Structure, Function and Bioinformatics
PublisherData powered by TypesetJohn Wiley and Sons Inc.
ISSN08873585
Open AccessNo
Concepts (35)
  •  related image
    Amino acid
  •  related image
    Ligand
  •  related image
    Protein
  •  related image
    PROTEIN PROTEIN COMPLEX
  •  related image
    Unclassified drug
  •  related image
    MULTIPROTEIN COMPLEX
  •  related image
    Protein binding
  •  related image
    Amino acid sequence
  •  related image
    Area under the curve
  •  related image
    Article
  •  related image
    Binding affinity
  •  related image
    Binding site
  •  related image
    HOST PATHOGEN INTERACTION
  •  related image
    pH
  •  related image
    Priority journal
  •  related image
    Protein protein interaction
  •  related image
    Protein secondary structure
  •  related image
    Random forest
  •  related image
    Sensitivity and specificity
  •  related image
    Support vector machine
  •  related image
    Temperature
  •  related image
    Algorithm
  •  related image
    Artificial intelligence
  •  related image
    Biology
  •  related image
    Chemical structure
  •  related image
    Classification
  •  related image
    Metabolism
  •  related image
    Protein database
  •  related image
    Algorithms
  •  related image
    Computational biology
  •  related image
    Databases, protein
  •  related image
    Models, molecular
  •  related image
    Multiprotein complexes
  •  related image
    Proteins
  •  related image
    Support vector machines