University of Worcester Worcester Research and Publications
 
  USER PANEL:
  ABOUT THE COLLECTION:
  CONTACT DETAILS:

Predicting Primary Sequence-Based Protein-Protein Interactions Using a Mercer Series Representation of Nonlinear Support Vector Machine

Chatrabgoun, O. ORCID logoORCID: https://orcid.org/0000-0001-5025-4760, Daneshkhah, A., Esmaeilbeigi, M. ORCID logoORCID: https://orcid.org/0000-0001-5331-0393, Sohrabi Safa, Nader ORCID logoORCID: https://orcid.org/0000-0003-4897-0084, Alenezi, A. ORCID logoORCID: https://orcid.org/0000-0002-8469-880X and Rahman, A. (2022) Predicting Primary Sequence-Based Protein-Protein Interactions Using a Mercer Series Representation of Nonlinear Support Vector Machine. IEEE Access, 10. pp. 124345-124354. ISSN 2169-3536

[thumbnail of IEEE access.pdf]
Preview
Text
IEEE access.pdf - Published Version
Available under License Creative Commons Attribution.

Download (2MB) | Preview

Abstract

The prediction of protein-protein interactions (PPIs) is essential to understand the cellular processes from a medical perspective. Among the various machine learning techniques, kernel-based Support Vector Machine (SVM) has been commonly employed to discriminate between interacting and non-interacting protein pairs. The main drawback of employing the kernel-based SVM to datasets with many features, such as the primary sequence-based protein-protein dataset, is the significant increase in computational time of training stage. This increase in computational time is mainly due to the presence of the kernel in solving the quadratic optimisation problem (QOP) involved in nonlinear SVM. In order to fix this issue, we propose a novel and efficient computational algorithm by approximating the kernel-based SVM using a low-rank truncated Mercer series as well as desired. As a result, the QOP for the approximated kernel-based SVM will be very tractable in the sense that there is a significant reduction in computational time of training and validating stages. We illustrate the novelty of the proposed method by predicting the PPIs of “S. Cerevisiae” where the protein features extracted using the multiscale local descriptor (MLD), and then we compare the predictive performance of the proposed low-rank approximation with the existing methods. Finally, the new method results in significant reduction in computational time for predicting PPIs with almost as accuracy as kernel-based SVM.

Item Type: Article
Subjects: T Technology > T Technology (General)
Divisions: College of Business, Psychology and Sport > Worcester Business School
Related URLs:
Copyright Info: Open Access, This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
Depositing User: Nader Sohrabisafa
Date Deposited: 13 Jun 2025 13:53
Last Modified: 13 Jun 2025 13:53
URI: https://eprints.worc.ac.uk/id/eprint/15037

Actions (login required)

View Item View Item
 
     
Worcester Research and Publications is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.