A NOVEL OVERSAMPLING TECHNIQUE TO HANDLE IMBALANCED DATASETS

No Thumbnail Available

Date

06/01/2020

Journal Title

Journal ISSN

Volume Title

Type

Article

Publisher

European Council for Modelling and Simulation

Series Info

Proceedings - European Council for Modelling and Simulation, ECMS;Volume 34, Issue 1, 1 June 2020, Pages 177-182 34th International ECMS Conference on Modelling and Simulation, ECMS 2020; Wildau; Germany; 9 June 2020 through 12 June 2020; Code 164036

Doi

Abstract

With the amount of data is growing extensively in different domains in the recent years, the data imbalance problem arises frequently. A dataset is called imbalanced when the data of a certain class has significantly more instances than that of other classes of the same dataset. This imbalanced nature of the data negatively affects the performance of a classifier since misclassification of data may cause data analysis results to be inaccurate and hence leads to wrong business decisions. This paper presents a study of the different techniques that are used to handle the imbalanced dataset, and finally proposes a novel oversampling technique to tackle the binary classification of imbalanced dataset problem. © ECMS Mike Steglich, Christian Mueller, Gaby Neumann, Mathias Walther (Editors).

Description

Scopus

Keywords

Citation

Full Text link