UNENEIBOTEJIT OTOKWALA u.otokwala@rgu.ac.uk
Research Student
Improving intrusion detection through training data augmentation.
Otokwala, Uneneibotejit; Petrovski, Andrei; Kalutarage, Harsha
Authors
Dr Andrei Petrovski a.petrovski@rgu.ac.uk
Reader
Dr Harsha Kalutarage h.kalutarage@rgu.ac.uk
Lecturer
Contributors
Naghmeh Moradpoor
Editor
Atilla Elçi
Editor
Dr Andrei Petrovski a.petrovski@rgu.ac.uk
Editor
Abstract
Imbalanced classes in datasets are common problems often found in security data. Therefore, several strategies like class resampling and cost-sensitive training have been proposed to address it. In this paper, we propose a data augmentation strategy to oversample the minority classes in the dataset. Using our Sort-Augment-Combine (SAC) technique, we split the dataset into subsets of the class labels and then generate synthetic data from each of the subsets. The synthetic data were then used to oversample the minority classes. Upon the completion of the oversampling, the independent classes were combined to form an augmented training data for model fitting. Using performance metrics such as accuracy, recall (sensitivity) and true positives (specificity), the models trained using the augmented datasets show an improvement in performance metrics over the original dataset. Similarly, in a binary class dataset, SAC performed optimally and the combination of SAC and ROSE model shows an improvement in overall accuracy, sensitivity and specificity when compared with the performance of the Random Forest model on the original dataset, ROSE and SMOTE augmented datasets.
Citation
OTOKWALA, U., PETROVSKI, A. and KALUTARAGE, H. 2021. Improving intrusion detection through training data augmentation. In Moradpoor, N., Elçi, A. and Petrovski, A. (eds.) Proceedings of 14th International conference on Security of information and networks 2021 (SIN 2021), 15-17 December 2021, [virtual conference]. Piscataway: IEEE [online], article 17. Available from: https://doi.org/10.1109/SIN54109.2021.9699293
Conference Name | 14th International conference on Security of information and networks 2021 (SIN 2021) |
---|---|
Conference Location | [virtual conference] |
Start Date | Dec 15, 2021 |
End Date | Dec 17, 2021 |
Acceptance Date | Dec 7, 2021 |
Online Publication Date | Dec 17, 2021 |
Publication Date | Feb 10, 2022 |
Deposit Date | Feb 11, 2022 |
Publicly Available Date | Feb 11, 2022 |
Publisher | IEEE Institute of Electrical and Electronics Engineers |
Book Title | Proceedings of the 14th International conference on Security of information and networks 2021 (SIN 2021) |
ISBN | 9781728192666 |
DOI | https://doi.org/10.1109/SIN54109.2021.9699293 |
Keywords | Imbalanced data; Minority oversampling; Data augmentation; Intrusion detection |
Public URL | https://rgu-repository.worktribe.com/output/1592322 |
Files
OTOKWALA 2021 Improving intrusion detection (AAM)
(509 Kb)
PDF
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or
promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
You might also like
Effective detection of cyber attack in a cyber-physical power grid system.
(2021)
Conference Proceeding
On the class overlap problem in imbalanced data classification.
(2020)
Journal Article
Detection of false command and response injection attacks for cyber physical systems security and resilience.
(2020)
Conference Proceeding
Detecting malicious signal manipulation in smart grids using intelligent analysis of contextual data.
(2020)
Conference Proceeding