A NOVEL OVERSAMPLING TECHNIQUE TO HANDLE IMBALANCED DATASETS
No Thumbnail Available
Date
06/01/2020
Authors
Journal Title
Journal ISSN
Volume Title
Type
Article
Publisher
European Council for Modelling and Simulation
Series Info
Proceedings - European Council for Modelling and Simulation, ECMS;Volume 34, Issue 1, 1 June 2020, Pages 177-182 34th International ECMS Conference on Modelling and Simulation, ECMS 2020; Wildau; Germany; 9 June 2020 through 12 June 2020; Code 164036
Doi
Scientific Journal Rankings
Abstract
With the amount of data is growing extensively in different domains in the recent years, the data imbalance problem arises frequently. A dataset is called imbalanced when the data of a certain class has significantly more instances than that of other classes of the same dataset. This imbalanced nature of the data negatively affects the performance of a classifier since misclassification of data may cause data analysis results to be inaccurate and hence leads to wrong business decisions. This paper presents a study of the different techniques that are used to handle the imbalanced dataset, and finally proposes a novel oversampling technique to tackle the binary classification of imbalanced dataset problem. © ECMS Mike Steglich, Christian Mueller, Gaby Neumann, Mathias Walther (Editors).
Description
Scopus