A comparative study to classify big data using fuzzy techniques

dc.AffiliationOctober University for modern sciences and Arts (MSA)
dc.contributor.authorLabib S.S.
dc.contributor.otherFaculty of Computer Science
dc.contributor.otherOctober University for Modern Science and Arts
dc.contributor.otherEgypt
dc.date.accessioned2020-01-09T20:41:24Z
dc.date.available2020-01-09T20:41:24Z
dc.date.issued2017
dc.descriptionScopus
dc.description.abstractIt is very difficult to implement an efficient analysis by using the customary techniques currently available; this is due to the fact that the data size has had a huge increase. Many complications were faced because of the numerous characteristics of big data; some of them include complexity, value, variability, variety, velocity, and volume. The objective of this paper is to implement classification techniques using the map reduce framework using fuzzy and crisp methods, also to arrange for a study that can compare and contrast the outcomes of the suggested systems against the methods appraised in the documented works. For this research the applied method for the fuzzy technique is the fuzzy k-nearest neighbor, and for the non-fuzzy techniques both the support vector machine and the k-nearest neighbor are used. The use of the map reduce paradigm is applied to be able to process big data. We also implemented an integrated system using the Support Vector Machine with the fuzzy soft label and Gaussian fuzzy membership. Results show that fuzzy k-nearest neighbor classifier gives higher accuracy but it takes a lot of time in classification compared to the other techniques. But the outcomes when projected onto other data sets demonstrate that the suggested method that used fuzzy logic in the Reducer function gives higher accuracy and lower time than the new suggested methods and the methods revised in the paper. � 2016 IEEE.en_US
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=20300195017&tip=sid&clean=0
dc.identifier.doihttps://doi.org/10.1109/ICEDSA.2016.7818508
dc.identifier.doiPubMed ID :
dc.identifier.isbn9.78E+12
dc.identifier.issn21592047
dc.identifier.otherhttps://doi.org/10.1109/ICEDSA.2016.7818508
dc.identifier.otherPubMed ID :
dc.identifier.urihttps://t.ly/VZ2gd
dc.language.isoEnglishen_US
dc.publisherIEEE Computer Societyen_US
dc.relation.ispartofseriesInternational Conference on Electronic Devices, Systems, and Applications
dc.subjectBig dataen_US
dc.subjectClassificationen_US
dc.subjectFuzzy k-nearest neighboren_US
dc.subjectFuzzy logicen_US
dc.subjectGaussian membership functionen_US
dc.subjectHadoopen_US
dc.subjectK-nearest neighboren_US
dc.subjectMapReduceen_US
dc.subjectSoft labelsen_US
dc.subjectSupport vector machineen_US
dc.subjectClassification (of information)en_US
dc.subjectComputer circuitsen_US
dc.subjectElectronic equipmenten_US
dc.subjectFuzzy logicen_US
dc.subjectMembership functionsen_US
dc.subjectMotion compensationen_US
dc.subjectNearest neighbor searchen_US
dc.subjectSupport vector machinesen_US
dc.subjectThermoelectric equipmenten_US
dc.subjectFuzzy k nearest neighbor (FKNN)en_US
dc.subjectGaussian membership functionen_US
dc.subjectHadoopen_US
dc.subjectK-nearest neighborsen_US
dc.subjectMap-reduceen_US
dc.subjectSoft labelsen_US
dc.subjectBig dataen_US
dc.titleA comparative study to classify big data using fuzzy techniquesen_US
dc.typeConference Paperen_US
dcterms.isReferencedByJacobs, A., The pathologies of big data (2009) Communications of the ACM, 52 (8), pp. 36-44; Dean, J., Ghemawat, S., MapReduce: A flexible data processing tool (2010) Communications of the ACM, 53 (1), pp. 72-77; Zuech, R., Khoshgoftaar, T.M., Wald, R., Intrusion detection and Big Heterogeneous Data: A Survey (2015) Journal of Big Data, 2 (1), pp. 1-41; Agrawal, D., Mind your Ps and Vs: A perspective on the challenges of big data management and privacy concerns (2015) Big Data and Smart Computing (BigComp), 2015 International Conference on, , IEEE; Aggarwal, A., Opportunities and challenges of big data in public sector (2015) Managing Big Data Integration in the Public Sector, p. 289; Juki, N., Augmenting data warehouses with big data (2015) Information Systems Management, , justaccepted; Russom, P., Big data analytics (2011) TDWI Best Practices Report, , Fourth Quarter; Puurtinen, J., (2014) Big Data Mining As Part of Substation Automation and Network Management.; Laney, D., (2001) 3D Data Management: Controlling Data Volume, Velocity and Variety, 6, p. 70. , META Group Research Note; Bhagattjee, B., (2014) Emergence and Taxonomy of Big Data As A Service., , Massachusetts Institute of Technology; Kumarasamy, M., AN ANALYSIS of BIG DATA DISCOVERY and COLLABORATION; Klir, G., Yuan, B., (1995) Fuzzy Sets and Fuzzy Logic., 4. , Prentice Hall New Jersey; Kamavisdar, P., Saluja, S., Agrawal, S., A survey on image classification approaches and techniques (2013) International Journal of Advanced Research in Computer and Communication Engineering, 2 (1), pp. 1005-1009; Lopez, V., On the use of MapReduce to build linguistic fuzzy rule based classification systems for big data (2014) Fuzzy Systems (FUZZ-IEEE), 2014 IEEE International Conference on, , IEEE; Kopczynski, M., Grzes, T., Stepaniuk, J., Computation of cores in big datasets: An FPGA approach (2015) Rough Sets and Knowledge Technology., pp. 153-163. , Springer; Del R�o, S., A mapreduce approach to address big data classification problems based on the fusion of linguistic fuzzy rules (2015) International Journal of Computational Intelligence Systems, 8 (3), pp. 422-437
dcterms.sourceScopus

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
avatar_scholar_256.png
Size:
6.31 KB
Format:
Portable Network Graphics
Description: