Professor Eyad Elyan e.elyan@rgu.ac.uk
Professor
CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification.
Elyan, Eyad; Moreno-Garcia, Carlos Francisco; Jayne, Chrisina
Authors
Dr Carlos Moreno-Garcia c.moreno-garcia@rgu.ac.uk
Associate Professor
Chrisina Jayne
Abstract
Class-imbalanced datasets are common across several domains such as health, banking, security, and others. The dominance of majority class instances (negative class) often results in biased learning models, and therefore, classifying such datasets requires employing some methods to compact the problem. In this paper, we propose a new hybrid approach aiming at reducing the dominance of the majority class instances using class decomposition and increasing the minority class instances using an oversampling method. Unlike other undersampling methods, which suffer data loss, our method preserves the majority class instances, yet significantly reduces its dominance, resulting in a more balanced dataset and hence improving the results. A large-scale experiment using 60 public datasets was carried out to validate the proposed methods. The results across three standard evaluation metrics show the comparable and superior results with other common and state-of-the-art techniques.
Citation
ELYAN, E., MORENO-GARCIA, C.F. and JAYNE, C. 2021. CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification. Neural computing and applications [online], 33(7), pages 2839-2851. Available from: https://doi.org/10.1007/s00521-020-05130-z
Journal Article Type | Article |
---|---|
Acceptance Date | Jun 16, 2020 |
Online Publication Date | Jul 18, 2020 |
Publication Date | Apr 30, 2021 |
Deposit Date | Jul 30, 2020 |
Publicly Available Date | Jul 30, 2020 |
Journal | Neural computing and applications |
Print ISSN | 0941-0643 |
Electronic ISSN | 1433-3058 |
Publisher | Springer |
Peer Reviewed | Peer Reviewed |
Volume | 33 |
Issue | 7 |
Pages | 2839-2851 |
DOI | https://doi.org/10.1007/s00521-020-05130-z |
Keywords | Machine learning; Class-imbalance; Classification; Undersapmpling; Oversampling |
Public URL | https://rgu-repository.worktribe.com/output/933911 |
Files
ELYAN 2021 CDSMOTE
(632 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/4.0/
You might also like
A multimodel-based screening framework for C-19 using deep learning-inspired data fusion.
(2024)
Journal Article
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search