Komparasi Information Gain, Gain Ratio, CFs-Bestfirst dan CFs-PSO Search Terhadap Performa Deteksi Anomali

Kurniabudi Kurniabudi; Abdul Harris; Albertus Edward Mintaria

doi:10.30865/mib.v5i1.2258

Authors

Kurniabudi Kurniabudi STIKOM Dinamika Bangsa, Jambi
Abdul Harris STIKOM Dinamika Bangsa, Jambi
Albertus Edward Mintaria STIKOM Dinamika Bangsa, Jambi

DOI:

https://doi.org/10.30865/mib.v5i1.2258

Keywords:

Feature Selection, Anomaly Detection, CICIDS-2017, Information Gain, Gain Ratio, Correlation-Based, PSO-Search

Abstract

Large data dimensionality is one of the issues in anomaly detection. One approach used to overcome large data dimensions is feature selection. An effective feature selection technique will produce the most relevant features and can improve the classification algorithm to detect attacks. There have been many studies on feature selection techniques, each using different methods and strategies to find the best and relevant features. In this study, a comparison of Information Gain, Gain Ratio, CFs-BestFirst and CFs-PSO Search techniques was compared. The selection features of the four techniques were further validated by the Naive Bayes classification algorithm, k-NN and J48. This study uses the ISCX CICIDS-2017 dataset. Based on the test results the feature selection techniques affect the performance of the Naive Bayes algorithm, k-NN and J48. Increasingly relevant and important features can improve detection performance. The test results also show that the number of features influences the processing / computing time. CFs-BestFirst produces a smaller number of features compared to CFs-PSO Search, Information Gain and Gain Ratio so it requires lower processing time. In addition, k-NN requires a higher processing time than Naive Bayes and J48

Author Biographies

Kurniabudi Kurniabudi, STIKOM Dinamika Bangsa, Jambi

Sistem Komputer

Abdul Harris, STIKOM Dinamika Bangsa, Jambi

Teknik Informatika

Albertus Edward Mintaria, STIKOM Dinamika Bangsa, Jambi

Sistem Komputer

References

J. Zhang, H. Li, Q. Gao, H. Wang, and Y. Luo, â€œDetecting anomalies from big network traffic data using an adaptive detection approach,â€ Inf. Sci. (Ny)., vol. 318, no. August, pp. 91â€“110, 2015.

G. Chandrashekar and F. Sahin, â€œA survey on feature selection methods,â€ Comput. Electr. Eng., vol. 40, no. 1, pp. 16â€“28, 2014.

Y. Dhote, S. Agrawal, and A. J. Deen, â€œA Survey on Feature Selection Techniques for Internet Traffic Classification,â€ Proc. - 2015 Int. Conf. Comput. Intell. Commun. Networks, CICN 2015, pp. 1375â€“1380, 2016.

R. F. Najeeb and B. N. Dhannoon, â€œClassification for Intrusion Detection with Different Feature Selection Methods : A Survey ( 2014-2016),â€ Int. J. Adv. Res. Comput. Sci. Softw. Eng., vol. 7, no. 5, pp. 305â€“311, 2017.

P. R. K. Varma, V. V. Kumari, and S. S. Kumar, A Survey of Feature Selection Techniques in Intrusion Detection System: A Soft Computing Perspective, vol. 710. Springer Singapore, 2018.

S. Aljawarneh, M. Aldwairi, and M. B. Yassein, â€œAnomaly-based intrusion detection system through feature selection analysis and building hybrid efficient model,â€ J. Comput. Sci., vol. 25, pp. 152â€“160, 2018.

M. El Boujnouni and M. Jedra, â€œNew Intrusion Detection System Based on Support Vector Domain Description with Information Gain Metric,â€ Int. J. Netw. Secur., vol. 20, no. 1, pp. 25â€“34, 2018.

N. AraÃºjo, â€œIdentifying Important Characteristics in the KDD99 Intrusion Detection Dataset by Feature Selection using a Hybrid Approach,â€ pp. 552â€“558, 2010.

P. Kushwaha, H. Buckchash, and B. Raman, â€œAnomaly based intrusion detection using filter based feature selection on KDD-CUP 99,â€ IEEE Reg. 10 Annu. Int. Conf. Proceedings/TENCON, vol. 2017-Decem, pp. 839â€“844, 2017.

N. Sainis, â€œFeature Classification and Outlier Detection to Increased Accuracy in Intrusion Detection System,â€ Int. J. Appl. Eng. Res., vol. 13, no. 10, pp. 7249â€“7255, 2018.

K. A. Taher, B. M. Yasin Jisan, and M. M. Rahman, â€œNetwork Intrusion Detection using Supervised Machine Learning Technique with Feature Selection,â€ 2019 Int. Conf. Robot. Signal Process. Tech., pp. 643â€“646, 2019.

V. Zhang and L. J. Zhang, â€œA rule generation model using S-PSO for Misuse Intrusion Detection,â€ ICCASM 2010 - 2010 Int. Conf. Comput. Appl. Syst. Model. Proc., vol. 3, no. Iccasm, pp. 418â€“423, 2010.

A. Panigrahi and M. R. Patra, â€œAn evolutionary computation based classification model for network intrusion detection,â€ Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 8956, pp. 318â€“324, 2015.

I. Sharafaldin, A. H. Lashkari, and A. A. Ghorbani, â€œToward generating a new intrusion detection dataset and intrusion traffic characterization,â€ ICISSP 2018 - Proc. 4th Int. Conf. Inf. Syst. Secur. Priv., vol. 2018-Janua, no. Cic, pp. 108â€“116, 2018.

K. Goeschel, â€œReducing false positives in intrusion detection systems using data-mining techniques utilizing support vector machines, decision trees, and naive Bayes for off-line analysis,â€ Conf. Proc. - IEEE SOUTHEASTCON, vol. 2016-July, 2016.

S. Mukherjee and N. Sharma, â€œIntrusion Detection using Naive Bayes Classifier with Feature Reduction,â€ vol. 4, pp. 119â€“128, 2012.

G. Serpen and E. Aghaei, â€œHost-based misuse intrusion detection using PCA feature extraction and kNN classification algorithms,â€ Intell. Data Anal., vol. 22, no. 5, pp. 1101â€“1114, 2018.

S. Sahu and B. M. Mehtre, â€œNetwork intrusion detection system using J48 Decision Tree,â€ 2015 Int. Conf. Adv. Comput. Commun. Informatics, ICACCI 2015, pp. 2023â€“2026, 2015.

N. F. Haq, A. R. Onik, and F. M. Shah, â€œAn ensemble framework of anomaly detection using hybridized feature selection approach (HFSA),â€ IntelliSys 2015 - Proc. 2015 SAI Intell. Syst. Conf., pp. 989â€“995, 2015.

S. Chormunge and S. Jena, â€œEfficient feature subset selection algorithm for high dimensional data,â€ Int. J. Electr. Comput. Eng., vol. 6, no. 4, pp. 1880â€“1888, 2016.

P. BereziÅ„ski, B. Jasiul, and M. Szpyrka, â€œAn entropy-based network anomaly detection method,â€ Entropy, vol. 17, no. 4, pp. 2367â€“2408, 2015.

H. EzzatIbrahim, S. M. Badr, and M. A. Shaheen, â€œAdaptive Layered Approach using Machine Learning Techniques with Gain Ratio for Intrusion Detection Systems,â€ Int. J. Comput. Appl., vol. 56, no. 7, pp. 10â€“16, 2012.

H. Chae and S. H. Choi, â€œFeature Selection for efficient Intrusion Detection using Attribute Ratio,â€ Int. J. Comput. Commun., vol. 8, pp. 134â€“139, 2014.

I. Syarif, â€œFeature Selection of Network Intrusion Data using Genetic Algorithm and Particle Swarm Optimization,â€ Emit. Int. J. Eng. Technol., vol. 4, no. 2, pp. 277â€“290, 2016.

A. I. Madbouly and T. M. Barakat, â€œEnhanced relevant feature selection model for intrusion detection systems,â€ Int. J. Intell. Eng. Informatics, vol. 4, no. 1, p. 21, 2016.

T. Ahmad and M. N. Aziz, â€œData preprocessing and feature selection for machine learning intrusion detection systems,â€ ICIC Express Lett., vol. 13, no. 2, pp. 93â€“101, 2019.

B. Dhruba K and K. Jugal K, Network Anomaly Detection A Machine Learning Perspective. 2014.

S. Agrawal and J. Agrawal, â€œSurvey on Anomaly Detection using Data Mining Techniques,â€ Procedia - Procedia Comput. Sci., vol. 60, pp. 708â€“713, 2015.

A. Buczak and E. Guven, â€œA survey of data mining and machine learning methods for cyber security intrusion detection,â€ IEEE Commun. Surv. Tutorials, vol. PP, no. 99, p. 1, 2015.

D. Summeet and D. Xian, Data Mining and Machine Learning in Cybersecurity. CRC Press, 2011.

S. Aljawarneh, M. B. Yassein, and M. Aljundi, â€œAn enhanced J48 classification algorithm for the anomaly intrusion detection systems,â€ Cluster Comput., pp. 1â€“17, 2017.

R. Goel, A. Sardana, and R. C. Joshi, â€œParallel Misuse and Anomaly Detection Model,â€ vol. 14, no. 4, pp. 211â€“222, 2012.

T. Garg and S. S. Khurana, â€œComparison of classification techniques for intrusion detection dataset using WEKA,â€ Int. Conf. Recent Adv. Innov. Eng. ICRAIE 2014, 2014.

B. Cui and S. He, â€œAnomaly detection model based on hadoop platform and weka interface,â€ Proc. - 2016 10th Int. Conf. Innov. Mob. Internet Serv. Ubiquitous Comput. IMIS 2016, pp. 84â€“89, 2016.

A. Abd and A. Hadi, â€œPerformance Analysis of Big Data Intrusion Detection System over Random Forest Algorithm,â€ Int. J. Appl. Eng. Res., vol. 13, no. 2, pp. 1520â€“1527, 2018.

Komparasi Information Gain, Gain Ratio, CFs-Bestfirst dan CFs-PSO Search Terhadap Performa Deteksi Anomali

Authors

DOI:

Keywords:

Abstract

Author Biographies

Kurniabudi Kurniabudi, STIKOM Dinamika Bangsa, Jambi

Abdul Harris, STIKOM Dinamika Bangsa, Jambi

Albertus Edward Mintaria, STIKOM Dinamika Bangsa, Jambi

References

Downloads

Published

Issue

Section

License