Vol.12, No.1, February 2023.                                                                                                                                                                              ISSN: 2217-8309

                                                                                                                                                                                                                        eISSN: 2217-8333


TEM Journal



Association for Information Communication Technology Education and Science

Data Generative Model to Detect the Anomalies for IDS Imbalance CICIDS2017 Dataset


Azhari Shouni Barkah, Siti Rahayu Selamat, Zaheera Zainal Abidin, Rizki Wahyudi


© 2023 Azhari Shouni Barkah, published by UIKTEN. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. (CC BY-NC-ND 4.0)


Citation Information: TEM Journal. Volume 12, Issue 1, Pages 80-89, ISSN 2217-8309, DOI: 10.18421/TEM121-11, February 2023.


Received: 11 October 2022.

Revised:   09 January 2023.
Accepted:  13 January 2023.
Published: 27 February 2023.




The system of intrusion detection dataset enables machine learning to recognize attack activity in the network. The intrusion, however, is naturally imbalanced, most of the traffic is normal traffic. Moreover, a certain attack is more popular than others. Therefore, the number of cases is highly imbalanced with the majority of attacks dominated by Distributed Denial of Services (DDoS), Denial of Service Hulk (DoS_Hulk), and PortScan more than 4.5% of attacks data. While the minority attack such as DoS_goldeneye, DoS_slowloris, DoS_slowhttptest, Web Attacks, Infiltration, Bot, and Heartbleed was only recorded in less than 1% of attack data. We propose data generative model (DGM) using the Conditional Generative Adversarial Network (CGAN) to improve the class of minorities of the IDS dataset. In this study, we tested the performance of the Data Generative Model based on CGAN in the CICIDS2017 dataset. There are new attacks in this dataset, including Bot, Web_attacks, Infiltration and Heartbleed. According to our experiments, the model successfully detect new attacks and improves the weighted f1-score by 99,92% compared to that of achievers by existing methods using the CICIDS2017 dataset.


Keywords –DGM, CGAN, imbalance data, IDS dataset.



Full text PDF >  



Copyright © 2023 UIKTEN
Copyright licence: All articles are licenced via Creative Commons CC BY-NC-ND 4.0 licence