Penerapan Algoritma Adaboost Untuk Peningkatan Kinerja Klasifikasi Data Mining Pada Imbalance Dataset Diabetes

Nia Novianti; Muhammad Zarlis; Poltak Sihombing

doi:10.30865/mib.v6i2.4017

Authors

Nia Novianti Universitas Sumatera Utara, Medan
Muhammad Zarlis Universitas Sumatera Utara, Medan
Poltak Sihombing Universitas Sumatera Utara, Medan

DOI:

https://doi.org/10.30865/mib.v6i2.4017

Keywords:

Improvement, Performance, Classification, Adaboost, Data Mining

Abstract

According to the World Health Organization (WHO), it has been recorded that up to now more than 150 million people have diabetes, whether they are elderly people, adults, teenagers, men or women. Early knowledge of diabetes can be seen based on data from patients who already have diabetes. The patient's disease data has previously been stored and arranged in a data warehouse or what is commonly referred to as a dataset. Therefore, it is necessary to process the data contained in the dataset. But the use of data mining techniques themselves must be assisted by using the techniques contained in the data mining, namely classification techniques. K-Nearest Neighbor (K-NN) is one of the methods used in the classification technique. In the results of the classification of the level of confidence obtained in the process, it is seen based on the amount of accuracy. However, there are important issues that need special attention. In the dataset used for the classification process, the data collected contains unbalanced class results (balance). The unbalanced data classification process becomes an important problem, this is because it can cause a decrease in performance. Adaboost is a technique in data mining that can be used to increase the level of accuracy in classification methods. The results showed that the adaboost algorithm can help improve classification performance. This can be seen from the increasing level of accuracy obtained from the process carried out before and after using the adaboost algorithm. The results obtained from the research show that the adaboost algorithm can be used properly to help the performance of the K-Nearest Neighbor algorithm for the classification process on diabetes datasets. It can be seen from 5 tests with values of K = 7, 13, 19, 25 and 31 there is an increase in the accuracy results obtained after using the adaboost algorithm.

References

S. Ucha Putri, E. Irawan, and F. Rizky, â€œImplementasi Data Mining Untuk Prediksi Penyakit Diabetes Dengan Algoritma C4.5,â€ Januari, vol. 2, no. 1, pp. 39â€“46, 2021.

F. Aris and Benyamin, â€œPenerapan Data Mining untuk Identifikasi Penyakit Diabetes Melitus dengan Menggunakan Metode Klasifikasi,â€ Router Res., vol. 1, no. 1, pp. 1â€“6, 2019, [Online]. Available: https://www.ejournal.stipwunaraha.ac.id/index.php/router/article/view/313.

R. Kajen, â€œPengertian Penyakit Diabetes, Faktor Risiko, dan Cara Pencegahannya,â€ RSUD Kajen, 2021. https://rsudkajen.id/pengertian-penyakit-diabetes-faktor-risiko-dan-cara-pencegahannya/ (accessed Mar. 08, 2022).

N. Sagala and H. Tampubolon, â€œKomparasi Kinerja Algoritma Data Mining pada Dataset Konsumsi Alkohol Siswa,â€ Khazanah Inform. J. Ilmu Komput. dan Inform., vol. 4, no. 2, p. 98, 2018, doi: 10.23917/khif.v4i2.7061.

D. P. Utomo and S. Aripin, â€œPenerapan Algoritma C5 . 0 Untuk Mengetahui Pola Kepuasan Mahasiswa di Masa Pembelajaran Daring,â€ in Seminar Nasional Riset Dan Information Science (SENARIS), 2021, vol. 3, pp. 7â€“12.

U. R. Amanda and D. P. Utomo, â€œPenerapan Data Mining Algoritma Hash Based Pada Data Pemesanan Buah Impor Cv. Green Uni Fruit,â€ KOMIK (Konferensi Nas. Teknol. Inf. dan Komputer), vol. 5, no. 1, pp. 86â€“93, 2021, doi: 10.30865/komik.v5i1.3653.

A. Handayanto, K. Latifa, N. D. Saputro, and R. R. Waliansyah, â€œAnalisis dan Penerapan Algoritma Support Vector Machine (SVM) dalam Data Mining untuk Menunjang Strategi Promosi,â€ JUITA J. Inform., vol. 7, no. 2, 2019, doi: 10.30595/juita.v7i2.4378.

R. R. Putra and C. Wadisman, â€œIMPLEMENTASI DATA MINING PEMILIHAN PELANGGAN POTENSIAL MENGGUNAKAN ALGORITMA K-MEANS,â€ Intecoms J. Inf. Technol. Comput. Sci., vol. 11, no. 1, pp. 1â€“5, 2018, [Online]. Available: http://link.springer.com/10.1007/978-3-319-59379-1%0Ahttp://dx.doi.org/10.1016/B978-0-12-420070-8.00002-7%0Ahttp://dx.doi.org/10.1016/j.ab.2015.03.024%0Ahttps://doi.org/10.1080/07352689.2018.1441103%0Ahttp://www.chile.bmw-motorrad.cl/sync/showroom/lam/es/.

H. D. Wijaya and S. Dwiasnati, â€œImplementasi Data Mining dengan Algoritma NaÃ¯ve Bayes pada Penjualan Obat,â€ J. Inform., vol. 7, no. 1, pp. 1â€“7, 2020, doi: 10.31311/ji.v7i1.6203.

H. H. Patel and P. Prajapati, â€œStudy and Analysis of Decision Tree Based Classification Algorithms,â€ Int. J. Comput. Sci. Eng., vol. 6, no. 10, pp. 74â€“78, 2018.

I. Ahmad, M. Basheri, M. J. Iqbal, and A. Rahim, â€œPerformance Comparison of Support Vector Machine, Random Forest, and Extreme Learning Machine for Intrusion Detection,â€ IEEE Access, vol. 6, pp. 33789â€“33795, 2018, doi: 10.1109/ACCESS.2018.2841987.

S. Mulyati, S. M. Husein, and Ramdhan, â€œRANCANG BANGUN APLIKASI DATA MINING PREDIKSI KELULUSAN UJIAN NASIONAL MENGGUNAKAN ALGORITMA (KNN) K-NEAREST NEIGHBOR DENGAN METODE EUCLIDEAN DISTANCE PADA SMPN 2 PAGEDANGAN,â€ J. Tek. Inform. Univ. Muhammadiyah Tangerang, vol. 4, no. 1, pp. 65â€“73, 2020.

I. A. Nikmatun and I. Waspada, â€œImplementasi Data Mining untuk Klasifikasi Masa Studi Mahasiswa Menggunakan Algoritma K-Nearest Neighbor,â€ J. SIMETRIS, vol. 10, no. 2, pp. 421â€“432, 2019.

A. M. Argina, â€œPenerapan Metode Klasifikasi K-Nearest Neigbor pada Dataset Penderita Penyakit Diabetes,â€ Indones. J. Data Sci., vol. 1, no. 2, pp. 29â€“33, 2020, doi: 10.33096/ijodas.v1i2.11.

N. M. Putry and B. N. Sari, â€œKOMPARASI ALGORITMA KNN DAN NAÃVE BAYES UNTUK KLASIFIKASI DIAGNOSIS PENYAKIT DIABETES MELITUS,â€ Evolusi J. Sains dan Manaj., vol. 10, no. 1, 2022.

R. Ahsana, R. R. Saedudin, and V. P. Widharta, â€œPerbandingan Akurasi Algoritma Adaboost Dan Algoritma Lightgbm Untuk Klasifikasi Penyakit Diabetes,â€ in e-Proceeding of Engineering, 2021, vol. 8, no. 5, pp. 9757â€“9764.

D. P. Utomo and Mesran, â€œAnalisis Komparasi Metode Klasifikasi Data Mining dan Reduksi Atribut Pada Data Set Penyakit Jantung,â€ Media Inform. Budidarma, vol. 4, no. 2, pp. 437â€“444, 2020.

D. P. Utomo, P. Sirait, and R. Yunis, â€œReduksi Atribut Pada Dataset Penyakit Jantung dan Klasifikasi Menggunakan Algoritma C5. 0,â€ J. Media Inform. Budidarma, vol. 4, no. 4, pp. 994â€“1006, 2020, doi: 10.30865/mib.v4i4.2355.

A. N. Kasanah, Muladi, and U. Pujianto, â€œPenerapan Teknik SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Objektivitas Berita Online Menggunakan Algoritma KNN,â€ J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 3, no. 2, pp. 196â€“201, 2019.

Ardiyansyah, P. A. Rahayuningsih, and R. Maulana, â€œAnalisis Perbandingan Algoritma Klasifikasi Data Mining Untuk Dataset Blogger Dengan Rapid Miner,â€ J. Khatulistiwa Inform., vol. VI, no. 1, pp. 20â€“28, 2018.

A. Byna and M. Basit, â€œPenerapan Metode Adaboost Untuk Mengoptimasi Prediksi Penyakit Stroke Dengan Algoritma NaÃ¯ve Bayes,â€ J. Sisfokom (Sistem Inf. dan Komputer), vol. 9, no. 3, pp. 407â€“411, 2020, doi: 10.32736/sisfokom.v9i3.1023.

S. I. Gultom, â€œImplementasi Data Mining Menentukan Pola Hidup Sehat Bagi Pengguna KB Menggunakan Algoritma Adaboost ( Studi Kasus : Dinas Serdang Bedagai ),â€ J. Inf. dan Teknol. Ilm., vol. 7, no. 3, pp. 298â€“304, 2020.

Penerapan Algoritma Adaboost Untuk Peningkatan Kinerja Klasifikasi Data Mining Pada Imbalance Dataset Diabetes

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Menu Utama

flagcounter

template

statcounter

rji

terindex