TGT: A Novel Adversarial Guided Oversampling Technique for Handling Imbalanced Datasets

dc.AffiliationOctober University for modern sciences and Arts (MSA)
dc.contributor.authorMahmoud, Ayat
dc.contributor.authorEl-Kilany, Ayman
dc.contributor.authorAli, Farid
dc.contributor.authorMazen, Sherif
dc.date.accessioned2021-02-12T09:50:58Z
dc.date.available2021-02-12T09:50:58Z
dc.date.issued1/19/2021
dc.description.abstractWith the volume of data increasing exponentially, there is a growing interest in helping people to benefit from their data regardless of its poor quality. One of the major data quality problems is the imbalanced distribution of different categories existing in the data. Such problem would affect the performance of any possible of analysis and mining on the data. For instance, data with an imbalanced distribution has a negative effect on the performance achieved by most traditional classification techniques. This paper proposes TGT (Train Generate Test), a novel oversampling technique for handling imbalanced datasets problem. Using different learning strategies, TGT guarantees that the generated synthetic samples reside in minority regions. TGT showed a high improvement in performance of different classification techniques when was experimented with five imbalanced datasets of different types. 2021 Production and hosting by Elsevier B.V. on behalf of Faculty of Computers and Information, Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/ licenses/by-nc-nd/4.0/).en_US
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=19700182731&tip=sid&clean=0
dc.identifier.doihttps://doi.org/10.1016/j.eij.2021.01.002
dc.identifier.otherhttps://doi.org/10.1016/j.eij.2021.01.002
dc.identifier.urihttp://repository.msa.edu.eg/xmlui/handle/123456789/4429
dc.language.isoen_USen_US
dc.publisherElsevieren_US
dc.relation.ispartofseriesEgyptian Informatics Journal;
dc.subjectuniversityen_US
dc.subjectImbalanceen_US
dc.subjectOversamplingen_US
dc.subjectClassificationen_US
dc.titleTGT: A Novel Adversarial Guided Oversampling Technique for Handling Imbalanced Datasetsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S1110866521000025-main.pdf
Size:
847.7 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
51 B
Format:
Item-specific license agreed upon to submission
Description: