Sequence-based Protein Interaction Site Prediction using Computer Vision and Deep Learning

Public Deposited
Resource Type
Creator
Abstract
  • Computational prediction of protein-protein interaction (PPI) from protein sequence is important as many cellular functions are made possible through PPI. The Protein Interaction Prediction Engine (PIPE) software suite was developed at Carleton University for such predictions. This thesis aims to conduct a thorough performance assessment of the PIPE-Sites predictor through the use of a large high-quality set of known PPI sites. The results show that PIPE-Sites has relatively low accuracy even after retuning the inherent hyperparameters of the method. Furthermore, PIPE-Sites are shown to be ineffective when applied to similarity-weighted score data. Thus, three new sequence-based methods of predicting PPI sites are proposed and evaluated, including the Panorama, BrightSpot, and ClusterNet methods. The new methods leverage similarity-weighted score data to further increase performance. Ultimately, ClusterNet significantly outperforms the other methods over two different performance metrics when evaluated on both human and yeast data PPI site data.

Subject
Language
Publisher
Thesis Degree Level
Thesis Degree Name
Thesis Degree Discipline
Identifier
Rights Notes
  • Copyright © 2021 the author(s). Theses may be used for non-commercial research, educational, or related academic purposes only. Such uses include personal study, research, scholarship, and teaching. Theses may only be shared by linking to Carleton University Institutional Repository and no part may be used without proper attribution to the author. No part may be used for commercial purposes directly or indirectly via a for-profit platform; no adaptation or derivative works are permitted without consent from the copyright owner.

Date Created
  • 2021

Relations

In Collection:

Items