Analisis Penerapan Normalisasi Data Dengan Menggunakan Z-Score Pada Kinerja Algoritma K-NN

Raditya Galih Whendasmoro; Joseph Joseph

doi:10.30865/jurikom.v9i4.4526

Authors

Raditya Galih Whendasmoro Universitas Bung Karno, Jakarta
Joseph Joseph Universitas Bung Karno, Jakarta

DOI:

https://doi.org/10.30865/jurikom.v9i4.4526

Keywords:

Data Mining, Normalization, Data, Z-Score, K-NN

Abstract

The large volume of information in the data causes a lot of data to be stored in the dataset. The dataset consists of various attributes and attribute values which contain information stored in the dataset. Data mining is a process that can be used to search for information on datasets. However, the problems encountered in the dataset are often found to have abnormal data such as the range of values that are too far and different between dataset attributes. The value range that is too far causes the results of the information obtained to be not optimal, in data mining itself the process or results are good based on the quality of the data stored in the dataset. Data normalization is a preprocessing stage, where data normalization is scaled back to the range of values in the attribute. Z-Score Normalization is a statistical technique that can be used in data mining to preprocess data by performing data transformations. Z-Score Normalization can be combined with data mining classification techniques, where the role of Z-Score Normalization is to normalize data which is useful for improving the performance of data mining classification algorithms, especially the K-NN algorithm in this study. The results of the study show that Z-Score Normalization is useful for improving performance than the K-NN algorithm. This can be seen from the increase in the accuracy value obtained from the K-NN process before normalizing the dataset and after normalizing the dataset. The accuracy values respectively before normalizing the dataset were 95.13%, 95.83%, 96.11%, 95.77% and 95.81% after normalizing the dataset there was an increase in the accuracy value, namely 97.87%, 98, 57%, 98.77%, 97.23% and 98.11%.

References

C. Luo, J. Zhan, X. Xue, L. Wang, R. Ren, and Q. Yang, â€œCosine normalization: Using cosine similarity instead of dot product in neural networks,â€ Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 11139 LNCS, pp. 382â€“391, 2018, doi: 10.1007/978-3-030-01418-6_38.

D. A. Nasution, H. H. Khotimah, and N. Chamidah, â€œPerbandingan Normalisasi Data untuk Klasifikasi Wine Menggunakan Algoritma K-NN,â€ Comput. Eng. Sci. Syst. J., vol. 4, no. 1, p. 78, 2019, doi: 10.24114/cess.v4i1.11458.

S. Z. Rosiana and N. Laily, â€œAnalisis Altman Z-Score Untuk Memprediksi Kebangkrutan Perusahaan Kabel Di Indonesiaanalisis Altman Z-Score Untuk â€¦,â€ J. Ilmu dan Ris. â€¦, vol. 7, no. 1, 2018, [Online]. Available: http://jurnalmahasiswa.stiesia.ac.id/index.php/jirm/article/view/533.

Gde Agung Brahmana Suryanegara, Adiwijaya, and Mahendra Dwifebri Purbolaksono, â€œPeningkatan Hasil Klasifikasi pada Algoritma Random Forest untuk Deteksi Pasien Penderita Diabetes Menggunakan Metode Normalisasi,â€ J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 5, no. 1, pp. 114â€“122, 2021, doi: 10.29207/resti.v5i1.2880.

Henderi, T. Wahyuningsih, and E. Rahwanto, â€œComparison of Min-Max normalization and Z-Score Normalization in the K-nearest neighbor (kNN) Algorithm to Test the Accuracy of Types of Breast Cancer,â€ IJIIS Int. J. Informatics Inf. Syst., vol. 4, no. 1, pp. 13â€“20, 2021, doi: 10.47738/ijiis.v4i1.73.

M. R. A. Nasution and M. Hayaty, â€œPerbandingan Akurasi dan Waktu Proses Algoritma K-NN dan SVM dalam Analisis Sentimen Twitter,â€ J. Inform., vol. 6, no. 2, pp. 226â€“235, 2019, doi: 10.31311/ji.v6i2.5129.

S. Harlina, â€œData Mining Pada Penentuan Kelayakan Kredit Menggunakan Algoritma K-Nn Berbasis Forward Selection Data Mining on Credit Feasibility Determination Using K-Nn Algorithm Based on Forward Selection,â€ CCIT J., vol. 11, no. 2, pp. 236â€“244, 2018, doi: 10.33050/ccit.v11i2.591.

D. Noviana, Y. Susanti, and I. Susanto, â€œAnalisis Rekomendasi Penerima Beasiswa Menggunakan Algoritma K-Nearest Neighbor (K-NN) dan Algoritma C4.5,â€ Semin. Nas. Penelit. Pendidik. Mat. 2019 UMT, pp. 79â€“87, 2019.

T. T. Muryono and I. Irwansyah, â€œImplementasi Data Mining Untuk Menentukan Kelayakan Pemberian Kredit Dengan Menggunakan Algoritma K-Nearest Neighbors (K-Nn),â€ Infotech J. Technol. Inf., vol. 6, no. 1, pp. 43â€“48, 2020, doi: 10.37365/jti.v6i1.78.

Y. D. Atma and A. Setyanto, â€œPerbandingan algoritma c4.5 dan k-nn dalam identifikasi mahasiswa berpotensi drop out,â€ Metik J. ISSN 2580-1503, vol. 2, no. 2, pp. 31â€“37, 2018.

Analisis Penerapan Normalisasi Data Dengan Menggunakan Z-Score Pada Kinerja Algoritma K-NN

Authors

DOI:

Keywords:

Abstract

References

Additional Files

Published

How to Cite

Issue

Section

menujuribaru

template

sitasigs

member