Skip to main content

Research Repository

Advanced Search

CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification.

Elyan, Eyad; Moreno-Garcia, Carlos Francisco; Jayne, Chrisina

Authors

Chrisina Jayne



Abstract

Class-imbalanced datasets are common across several domains such as health, banking, security, and others. The dominance of majority class instances (negative class) often results in biased learning models, and therefore, classifying such datasets requires employing some methods to compact the problem. In this paper, we propose a new hybrid approach aiming at reducing the dominance of the majority class instances using class decomposition and increasing the minority class instances using an oversampling method. Unlike other undersampling methods, which suffer data loss, our method preserves the majority class instances, yet significantly reduces its dominance, resulting in a more balanced dataset and hence improving the results. A large-scale experiment using 60 public datasets was carried out to validate the proposed methods. The results across three standard evaluation metrics show the comparable and superior results with other common and state-of-the-art techniques.

Citation

ELYAN, E., MORENO-GARCIA, C.F. and JAYNE, C. 2021. CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification. Neural computing and applications [online], 33(7), pages 2839-2851. Available from: https://doi.org/10.1007/s00521-020-05130-z

Journal Article Type Article
Acceptance Date Jun 16, 2020
Online Publication Date Jul 18, 2020
Publication Date Apr 30, 2021
Deposit Date Jul 30, 2020
Publicly Available Date Mar 28, 2024
Journal Neural computing and applications
Print ISSN 0941-0643
Electronic ISSN 1433-3058
Publisher Springer
Peer Reviewed Peer Reviewed
Volume 33
Issue 7
Pages 2839-2851
DOI https://doi.org/10.1007/s00521-020-05130-z
Keywords Machine learning; Class-imbalance; Classification; Undersapmpling; Oversampling
Public URL https://rgu-repository.worktribe.com/output/933911

Files





You might also like



Downloadable Citations