Improving the Protein-Protein Interaction Prediction Engine (PIPE) with Protein Physicochemical Properties

Public Deposited
Resource Type
Creator
Abstract
  • Protein-protein interactions (PPI) serve an important role in both protein and cell function. They are difficult and time consuming to determine experimentally and thus benefit from in silico prediction methods. This thesis improves a high throughput, sequence-based protein-protein interaction prediction method called the protein-protein interaction engine (PIPE). A Python implementation of the scoring of PIPE was developed. Subsequently, a sequence-based solvent accessibility approach was integrated with PIPE, improving PPI prediction recall by 0.9% at 90% precision. Finally, 166 different sequence-based physicochemical properties were generated using the ProtDCal software tool and were integrated with PIPE using the framework developed in this thesis. The best of these properties improved the recall of PIPE by 2% at 90% precision. This improvement was shown to be statistically significant and was confirmed on a larger test set including 10,000 protein pairs known to interact and 10,000 randomly selected pairs, assumed not to interact.

Subject
Language
Publisher
Thesis Degree Level
Thesis Degree Name
Thesis Degree Discipline
Identifier
Rights Notes
  • Copyright © 2020 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.

Date Created
  • 2020

Relations

In Collection:

Items