Multiple fake classes GAN for data augmentation in face image dataset.
Ali-Gombe, Adamu; Elyan, Eyad; Jayne, Chrisina
Doctor Eyad Elyan firstname.lastname@example.org
Class-imbalanced datasets often contain one or more class that are under-represented in a dataset. In such a situation, learning algorithms are often biased toward the majority class instances. Therefore, some modification to the learning algorithm or the data itself is required before attempting a classification task. Data augmentation is one common approach used to improve the presence of the minority class instances and rebalance the dataset. However, simple augmentation techniques such as applying some affine transformation to the data, may not be sufficient in extreme cases, and often do not capture the variance present in the dataset. In this paper, we propose a new approach to generate more samples from minority class instances based on Generative Adversarial Neural Networks (GAN). We introduce a new Multiple Fake Class Generative Adversarial Networks (MFC-GAN) and generate additional samples to rebalance the dataset. We show that by introducing multiple fake class and oversampling, the model can generate the required minority samples. We evaluate our model on face generation task from attributes using a reduced number of samples in the minority class. Results obtained showed that MFC-GAN produces plausible minority samples that improve the classification performance compared with state-of-the-art ACGAN generated samples.
|Start Date||Jul 14, 2019|
|Publication Date||Sep 30, 2019|
|Publisher||Institute of Electrical and Electronics Engineers|
|Series Title||Proceedings of International joint conference on neural networks|
|Institution Citation||ALI-GOMBE, A., ELYAN, E. and JAYNE, C. 2019. Multiple fake classes GAN for data augmentation in face image dataset. In Proceedings of the 2019 International joint conference on neural networks (IJCNN 2019), 14-19 July 2019, Budapest, Hungary. Piscataway: IEEE [online], article ID 8851953. Available from: https://doi.org/10.1109/IJCNN.2019.8851953|
|Keywords||Datasets; Learning algorithms; Generative adversarial neural networks (GAN)|
ALI-GOMBE 2019 Multiple fake classes
You might also like
Data stream mining: methods and challenges for handling concept drift.
Digitisation of assets from the oil and gas industry: challenges and opportunities.
Neighbourhood-based undersampling approach for handling imbalanced and overlapped data.