A novel computational approach for predicting RNA binding proteins using optimum path forest with hybrid features

Thumbnail Image

Date

2019-09

Journal Title

Journal ISSN

Volume Title

Type

Article

Publisher

INNOVATIVE SCIENTIFIC INFORMATION & SERVICES NETWORK, INNOVATIVE SCIENTIFIC INFORMATION & SERVICES NETWORK,

Series Info

BIOSCIENCE RESEARCH;Volume: 16 Issue: 3 Pages: 2699-2709

Doi

Abstract

RNA - Protein interactions have vital roles in several cellular processes such as RNA transfer, gene regulation at the transcriptional processes and sequence encoding. RNA-binding prediction is a very important aspect of the analysis that helps in identifying the motifs that bind to DNA and for gene regulations. Predicting and recognizing the proteins that bind to RNA is a major challenging and complex process due to structural biology. Previously, several computational methods have been used and developed for predicting RNA-binding proteins (RBPs) using Support Vector Machine other than many other machine learning techniques. This paper proposes a novel computational approach for predicting RBPs using Optimum Path Forest (OPF) classifier in conjunction with the information of predicted RNA-binding residues. Moreover, the statistical information, mainly the singlet and doublet propensity, have been taken into consideration. For a given protein, its RNA-binding residues are predicted and then checked whether the protein binds to RNA or not through positive and negative samples based on the information from that prediction methodology. The results for the previous step can be classified as "Binding Protein", "Nonbinding Protein", "Binding Protein predicted as Non-Binding Protein" and "Nonbinding Protein predicted as Binding Protein", and in this case if the protein cannot be identified then the OPF classifier is used to determine the protein prediction status. The OPF classifier is used incorporated with the amino acid composition feature. The results showed that the statistical information and the binding propensity measures of the predicted RNA-binding residues especially contributed to the prediction process. In addition, the classifier has improved the overall performance of RBPs prediction process

Description

WOS:000498511100030

Keywords

October University for university of Gene Regulations, Optimum Path Forest Classifier, Prediction, RNA-binding proteins (RBPs), Non- binding proteins, Motifs, Transcriptional Processes, Biomolecules interactions, SITES, DNA, RECOGNITION, RESIDUES, SEQUENCE, SERVER

Citation