Applying machine learning techniques for classifying cyclin-dependent kinase inhibitors

dc.AffiliationOctober University for modern sciences and Arts (MSA)
dc.contributor.authorAbdelbaky I.Z.
dc.contributor.authorAl-Sadek A.F.
dc.contributor.authorBadr A.A.
dc.contributor.otherAgricultural Research Center
dc.contributor.otherCairo
dc.contributor.otherEgypt; Computer Science Department
dc.contributor.otherOctober University for Modern Sciences and Arts
dc.contributor.otherMSA
dc.contributor.otherEgypt; Computer Science department
dc.contributor.otherFaculty of Computers and Information
dc.contributor.otherCairo University Cairo
dc.contributor.otherEgypt
dc.date.accessioned2020-01-09T20:41:04Z
dc.date.available2020-01-09T20:41:04Z
dc.date.issued2018
dc.descriptionScopus
dc.description.abstractThe importance of protein kinases made them a target for many drug design studies. They play an essential role in cell cycle development and many other biological processes. Kinases are divided into different subfamilies according to the type and mode of their enzymatic activity. Computational studies targeting kinase inhibitors identification is widely considered for modelling kinase-inhibitor. This modelling is expected to help in solving the selectivity problem arising from the high similarity between kinases and their binding profiles. In this study, we explore the ability of two machine-learning techniques in classifying compounds as inhibitors or non-inhibitors for two members of the cyclin-dependent kinases as a subfamily of protein kinases. Random forest and genetic programming were used to classify CDK5 and CDK2 kinases inhibitors. This classification is based on calculated values of chemical descriptors. In addition, the response of the classifiers to adding prior information about compounds promiscuity was investigated. The results from each classifier for the datasets were analyzed by calculating different accuracy measures and metrics. Confusion matrices, accuracy, ROC curves, AUC values, F1 scores, and Matthews correlation, were obtained for the outputs. The analysis of these accuracy measures showed a better performance for the RF classifier in most of the cases. In addition, the results show that promiscuity information improves the classification accuracy, but its significant effect was notably clear with GP classifiers. � 2018 International Journal of Advanced Computer Science and Applications.en_US
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=21100867241&tip=sid&clean=0
dc.identifier.issn2158107X
dc.identifier.urihttps://t.ly/LX1B0
dc.language.isoEnglishen_US
dc.publisherScience and Information Organizationen_US
dc.relation.ispartofseriesInternational Journal of Advanced Computer Science and Applications
dc.relation.ispartofseries9
dc.subjectCDK inhibitorsen_US
dc.subjectGenetic programming classificationen_US
dc.subjectRandom forest classificationen_US
dc.titleApplying machine learning techniques for classifying cyclin-dependent kinase inhibitorsen_US
dc.typeArticleen_US
dcterms.isReferencedByEnna, S., Bylund, D., (2008) XPharm: The Comprehensive Pharmacology Reference, , Amsterdam: Elsevier; Park, H., Kim, K., Kim, C., Shin, J., No, K., 'Descriptor-Based Profile Analysis of Kinase Inhibitors to Predict Inhibitory Activity and to Grasp Kinase Selectivity' (2013) Bulletin of the Korean Chemical Society, 34 (9), pp. 680-2684; Hu, Y., Furtmann, N., Bajorath, J., 'Current Compound Coverage of the Kinome' (2014) Journal of Medicinal Chemistry, 58 (1), pp. 30-40; Malumbres, M., 'Cyclin-dependent kinases' (2014) Genome Biology, 15 (6), p. 122; Taylor, W., Grabovich, A., 'Targeting the Cell Cycle to Kill Cancer Cells' (2009) Pharmacology, pp. 429-453; Sayour, M., Basheer, I., Salam, R., Yamany, M., Badr, A., Al-sadek, A., El-awady, R., 'Potential mechanistic profiling of an otc analgesic as a cytotoxic agent in the treatment of hepatocellular carcinoma' (2018) ACTA Pharmaceutica Sciencia, 56 (1), p. 95; Cheung, Z., Ip, N., 'Cdk5: a multifaceted kinase in neurodegenerative diseases' (2012) Trends in Cell Biology, 22 (3), pp. 169-175; Arif, A., 'Extraneuronal activities and regulatory mechanisms of the atypical cyclin-dependent kinase Cdk5' (2012) Biochemical Pharmacology, 84 (8), pp. 985-993; Pons, T., Vazquez, M., Matey-Hernandez, M., Brunak, S., Valencia, A., Izarzugaza, J., 'KinMutRF: a random forest classifier of sequence variants in the human protein kinase superfamily' (2016) BMC Genomics, 17 (2); Cichonska, A., Ravikumar, B., Parri, E., Timonen, S., Pahikkala, T., Airola, A., Wennerberg, K., Aittokallio, T., 'Computational-experimental approach to drug-target interaction mapping: A case study on kinase inhibitors' (2017) PLOS Computational Biology, 13 (8); Metz, J.E., Soni, N., Merta, P., Kifle, L., Hajduk, P., 'Navigating the kinome' (2011) Nature Chemical Biology, 7 (4), pp. 200-202; Buchwald, F., Richter, L., Kramer, S., 'Predicting a small molecule-kinase interaction map: A machine learning approach' (2011) Journal of Cheminformatics, 3 (1), p. 22; McSkimming, D., Rasheed, K., Kannan, N., 'Classifying kinase conformations using a machine learning approach' (2017) BMC Bioinformatics, 18 (1); (1994) Koza, Genetic programming, , Cambridge, Mass.: MIT Press; Tan, M., Tan, J., Chang, S., Yap, H., Abdul Kareem, S., Zain, R., 'A genetic programming approach to oral cancer prognosis' (2016) PeerJ, 4; Breiman, L., 'Machine Learning,' (2001) Machine Learning, 45 (3), pp. 261-277; Boulesteix, A., Janitza, S., Kruppa, J., K�nig, I., 'Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics' (2012) Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2 (6), pp. 493-507; Tetko, G.J., Todeschini, R., Mauri, A., Livingstone, D., Ertl, P., Palyulin, V., Radchenko, E., Prokopenko, V., 'Virtual Computational Chemistry Laboratory-Design and Description' (2005) Journal of Computer-Aided Molecular Design, 19 (6), pp. 453-463; Elyasaf, A., Sipper, M., 'Software review: the HeuristicLab framework' (2014) Genetic Programming and Evolvable Machines, 15 (2), pp. 215-218; Sing, T., Sander, O., Beerenwinkel, N., Lengauer, T., 'ROCR: visualizing classifier performance in R' (2005) Bioinformatics, 21 (20), pp. 3940-3941
dcterms.sourceScopus

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
avatar_scholar_128.png
Size:
2.73 KB
Format:
Portable Network Graphics
Description:
Loading...
Thumbnail Image
Name:
Paper_32-Applying_Machine_Learning_Techniques.pdf
Size:
595.87 KB
Format:
Adobe Portable Document Format
Description: