Data Mining Dalam Analisis Faktor Drop Out Mahasiswa Menerapkan Algoritma Decision Tree

 (*)Mulkan Azhari Mail (Universitas Muhammadiyah Sumatera Utara, Medan, Indonesia)
 Halim Maulana (Universitas Muhammadiyah Sumatera Utara, Medan, Indonesia)
 Ferdy Riza (Universitas Muhammadiyah Sumatera Utara, Medan, Indonesia)

(*) Corresponding Author

Submitted: January 19, 2024; Published: April 30, 2024

Abstract

Graduation accuracy is one of the indicators used in assessing the suitability of undergraduate programs as a functional unit of higher education. Knowing the factors that influence student graduation time helps study programs and faculties make decisions to increase the number of students who graduate on time. The purpose of this research is to obtain an overview of the factors that influence students in the accuracy of completing the study time by using the Machine Learning algorithm, namely Decision Tree, which is expected to have high classification efficiency and good description so that it can increase the number of students graduating on time. The methods used to determine student dropout factors are Classification and Regression Tree (CART) and LightGBM. The data used is the data of undergraduate students of Universitas Muhammadiyah Sumatera Utara in 2019. The quality of classification can be read from the accuracy, sensitivity and specification values. The result using CART is 95.1% with the most influencing factors are GPA, faculty, lecture time and predicate while Lightgbm is 83% with the most influencing factors are GPA, gender, lecture time and faculty. Decision tree can be used to determine student dropout factors because of its high accuracy with GPA being the main factor.

Keywords


CART; Decision Tree; Dropout; Lightgbm

Full Text:

PDF


Article Metrics

Abstract view : 155 times
PDF - 42 times

References

H. Abijono, P. Santoso, and N. L. Anggreini, “Algoritma Supervised Learning Dan Unsupervised Learning Dalam Pengolahan Data,” J. Teknol. Terap. G-Tech, vol. 4, no. 2, pp. 315–318, 2021, doi: 10.33379/gtech.v4i2.635.

M. Azhari, S. Efendi, P. Purwanto, and H. Maulana, “Applying the simplex method to optimize employee distribution to increase sales internet data package,” Indones. J. Educ. Math. Sci., vol. 4, no. 3, pp. 143–148, 2023, doi: https://doi.org/10.30596/ijems.v4i3.15151.

S. Wiyono, D. S. Wibowo, M. F. Hidayatullah, and D. Dairoh, “Comparative Study of KNN, SVM and Decision Tree Algorithm for Student’s Performance Prediction,” Int. J. Comput. Sci. Appl. Math., vol. 6, no. 2, p. 50, 2020, doi: 10.12962/j24775401.v6i2.4360.

R. M. Aguilar-Chinea, I. C. Rodriguez, C. Expósito, B. Melian-Batista, and J. M. Moreno-Vega, “Using a decision tree algorithm to predict the robustness of a transshipment schedule,” Procedia Comput. Sci., vol. 149, pp. 529–536, 2019, doi: 10.1016/j.procs.2019.01.172.

R. Latifah, E. S. Wulandari, and P. E. Kreshna, “Model Decision Tree untuk Prediksi Jadwal Kerja menggunakan Scikit-Learn,” Semin. Nas. Sains dan Teknol., vol. 16, pp. 1–6, 2019.

D. Zhang et al., “An Interpretable Station Delay Prediction Model Based on Graph Community Neural Network and Time-Series Fuzzy Decision Tree,” IEEE Trans. Fuzzy Syst., vol. 31, no. 2, pp. 421–433, 2023, doi: 10.1109/TFUZZ.2022.3181453.

F. Manzella, G. Pagliarini, G. Sciavicco, and I. E. Stan, “The voice of COVID-19: Breath and cough recording classification with temporal decision trees and random forests,” Artif. Intell. Med., vol. 137, no. December 2022, p. 102486, 2023, doi: 10.1016/j.artmed.2022.102486.

J. Y. Nakayama, J. Ho, E. Cartwright, R. Simpson, and V. S. Hertzberg, “Predictors of progression through the cascade of care to a cure for hepatitis C patients using decision trees and random forests,” Comput. Biol. Med., vol. 134, no. April, p. 104461, 2021, doi: 10.1016/j.compbiomed.2021.104461.

H. Toyoda, H., Terai, H., Yamada, K., Kato, M., Suzuki, A., Takahashi, S., ... & Nakamura, “decision tree analysis to predict clinical outcome of minimally invasive lumbar decompression surgery for lumbar spinal stenosis with and without coexisting spondylolisthesis and scoliosis.,” Spine J., pp. 23(7), 973-981., 2023.

I. W. Misshuari, R. Herdiana, F. Farikhin, T. Afrizal, and J. Saputra, “Factors that affect customer credit payments during Covid 19 pandemic: An application of Light Gradient Boosting Machine (LightGBM) and Classification and Regression Tree (CART),” Ind. Eng. Oper. Manag. Singapore, pp. 5083–5091, 2021.

B. Charbuty and A. Abdulazeez, “Classification Based on Decision Tree Algorithm for Machine Learning,” J. Appl. Sci. Technol. Trends, vol. 2, no. 01, pp. 20–28, 2021, doi: 10.38094/jastt20165.

R. Ahmad, S. A. Po, and R. . Ageng, “Pemanfaatan machine learning dalam berbagai bidang: Review paper,” Indones. J. Comput. Inf. Technol., vol. 5, no. 1, pp. 75–82, 2020, doi: 10.31294/ijcit.v5i1.7951.

F. Dwi Meliani Achmad, Budanis, Slamat, “Klasifikasi Data Karyawan Untuk Menentukan Jadwal Kerja Menggunakan Metode Decision Tree,” J. IPTEK, vol. 16, no. 1, pp. 18–23, 2019, [Online]. Available: http://jurnal.itats.ac.id/wp-content/uploads/2013/06/3.-BUDANIS-FINAL-hal-17-23.pdf

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Data Mining Dalam Analisis Faktor Drop Out Mahasiswa Menerapkan Algoritma Decision Tree

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 JURNAL MEDIA INFORMATIKA BUDIDARMA

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.



JURNAL MEDIA INFORMATIKA BUDIDARMA
STMIK Budi Darma
Secretariat: Sisingamangaraja No. 338 Telp 061-7875998
Email: mib.stmikbd@gmail.com

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.