A new hybrid approach for feature selection and predicting of protein interaction network in lung cancer
Loading...
Date
2019
Journal Title
Journal ISSN
Volume Title
Type
Article
Publisher
ztaha
Series Info
Bioscience Research,;2019 volume 16(2): 1323-1336
Doi
Scientific Journal Rankings
Abstract
Different computational and evolutionary methods have been employed in the last decade for selecting important molecular features from biological data. Extracting information from microarray data is extremely important and complex task due to the high dimensionality of its datasets. Feature selection is a very important aspect of the analysis that helps in identifying the important genes that can be used in a further biological analysis. This paper proposes a new hybridization between the Flower Pollination and Differential Evolution algorithms for optimizing feature selection parameters and to find out the most important subset of features over gene expression profiles of lung cancer. The results showed that the hybrid approach has a better capability in searching for the best solutions compared to applying each algorithm independently. SLC5A1 gene was identified as a biomarker gene of lung cancer. By constructing the protein-protein interaction network for the extracted genes, a direct interaction has been detected between the SLC5A1 and EGFR genes, where the latter is known to have an important role in the mutation process of lung cells.
Description
MSA Google Scholar
Keywords
Evolutionary algorithms, Flower pollination algorithm
Citation
Abdel-Raouf, O., Abdel-Baset, M., El-Henawy, I. (2014). An Improved Flower Pollination Algorithm with Chaos. International Journal of Education and Management Engineering (IJEME), 4(2), 1-8. Abraham A., Das S., Roy S. (2008). Swarm Intelligence Algorithms for Data Clustering. In: Maimon O., Rokach L. (eds), Soft Computing for Knowledge Discovery and Data Mining. Springer, Boston, MA, 279-313 Akutekwe, A., Seker, H., Iliya, S. (2014). An optimized hybrid dynamic Bayesian network approach using differential evolution algorithm for the diagnosis of Hepatocellular Carcinoma. IEEE 6th International Conference on Adaptive Science & Technology (ICAST). Cai, Y., Du, J. (2014). Enhanced differential evolution with adaptive direction information. IEEE Congr Evol Comput (CEC), Beijing, 305-312. Coello, C., Lamont, G., Van Veldhuizen, D. (2015). Multi-objective Evolutionary Algorithms in Real-World Applications: Some Recent Results and Current Challenges. In Springer International Publishing, D.Greiner et al. (eds), Advances in Evolutionary and Deterministic Methods for Design, Optimization and Control in Engineering and Sciences (pp. 3-18). Das, S., Suganthan, P.N. (2011). Differential evolution: A survey of the state-of-the-art. IEEE Transactions on Evolutionary Computation 15, 4-31. Dorigo, M., Birattari, M., Stutzle, T. (2006). Ant colony optimization: Artificial ants as a computational intelligence technique. IEEE Computational Intelligence Magazine, 28-39. Gong, W., Cai, Z., Ling, C. X. (2010). DE/ BBO: A hybrid differential evolution with biogeography based optimization for global numerical optimization. Soft Computing 15, 645-665. Guyon, I., Weston, J., Barnhill, S. (2002). Gene selection for cancer classification using support vector machines. Machine learning, Vol. 46(1-3), (PP 389-422). He, X., Zhang, Q., Sun, N., Dong, Y. (2009). Feature selection with discrete binary differential evolution. In Proceedings of the International Conference on Artificial Intelligence and Computational Intelligence IEEE, Shanghai, (pp. 327-330). John, G.H., Kohavi, R., Pfleger, K. (1994). Irrelevant features and the subset selection problem. ICML 94, (pp. 121-129). Kancherla K., Mukkamala S. (2012) Feature Selection for Lung Cancer Detection Using SVM Based Recursive Feature Elimination Method. In: Giacobini M., Vanneschi L., Bush W.S. (eds) Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. EvoBIO 2012. Lecture Notes in Computer Science, vol 7246. Springer, Berlin, Heidelberg (pp.168-176). Karaboga, D., Basturk, B.J (2007). A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm. Journal of Glob Optim vol. 39, no. 3, 459-471. Nagarajan, R., Scutari, M., Le’bre, S. (2013). Bayesian Networks in R with Applications in Systems Biology. Springer-Verlag New York, (pp. XIII, 157). Nakamura, R. Y., Pereira, L. A., Costa, K. A., Rodrigues, D., Papa, J.P., Yang, X.S., (2012).BBA: A binary bat algorithm for feature selection. In Proceedings of 25th the Conference on Graphics, Patterns and Images (SIBGRAPI), IEEE, Ouro Preto, Brazil, (pp. 291-297). Papa, J. P., Falcao, A. X., Suzuki, C. T. (2009). Supervised pattern classification based on optimum-path forest. Int. J. Imaging Syst. Technology. 19 (2), 120-131. Papa, J. P., Falcao, A. X., Albuquerque, V. H., Tavares., J. M. (2012). Efficient supervised optimum-path forest classification for large data sets. Pattern Recognition, 512-520. Powell, C., Spira, A., Derti, A. (2003). Gene expression in lung adenocarcinoma of smokers and nonsmokers. AJRCMB vol 29,