Evaluasi Pengaruh RFE Terhadap Kinerja Random Forest dengan SVM pada Klasifikasi Kemiskinan Kabupaten/Kota Indonesia

Authors

  • Shafa Kirana Aralia Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro
  • Mula Agung Barata Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro
  • Ita Aristia Sa'ida Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro

DOI:

https://doi.org/10.30865/jurikom.v13i1.9527

Keywords:

Recursive Feature Elimination, Random Forest, Support Vector Machine, Classificatio, Poverty

Abstract

Poverty is a socio-economic issue that remains a concern in Indonesia, with differences in development characteristics between districts/cities causing wide variations in indicators that are intercorrelated. Feature redundancy and the existence of extreme values have the potential to reduce the generalization ability of classification models and reduce the interpretability of results. Therefore, an approach is needed that not only produces high accuracy but is also capable of identifying the most relevant indicators. Therefore, an approach is needed that not only produces high accuracy but is also capable of identifying the most relevant indicators. This study aims to evaluate the effect of Recursive Feature Elimination (RFE) on the performance of Support Vector Machine (SVM) and Random Forest in classifying the poverty status of districts/cities in Indonesia. The dataset used consists of 514 observations with two target classes, namely non-poor and poor. The preprocessing stage included data cleaning and outlier handling using the IQR capping method, then the data was divided into 80% training data and 20% test data. Testing was conducted on four scenarios: SVM, SVM+RFE, Random Forest, and Random Forest+RFE. Evaluation used a confusion matrix, accuracy, precision, recall, and F1-score. The results show that RFE does not change the accuracy of SVM (0.971), but improves the performance of Random Forest from 0.981 to 0.99 and improves the precision of the minority class. The Random Forest+RFE combination is the most effective and efficient configuration for regional poverty classification.

References

[1] World Bank, “Poverty Overview: Data dan tren kemiskinan global, termasuk definisi garis kemiskinan ekstrem.” Accessed: Jan. 19, 2025. [Online]. Available: https://www.worldbank.org/en/topic/poverty/overview?utm_source=chatgpt.com

[2] A. Heryati and T. Setiawan Saputra, “Optimizing Socioeconomic Features for Poverty Prediction in South Sumatera,” TIERS Inf. Technol. J., vol. 6, no. 1, pp. 16–32, 2025.

[3] M. F. M. Khalik and F. Arifin, “Klasifikasi Indeks Kedalaman Kemiskinan Provinsi Sulawesi Selatan Berbasis Decision Tree, K-Nearest Neighbor, Naive Bayes, Neural Network, dan Random Forest,” J. Edukasi dan Penelit. Inform., vol. 9, no. 2, p. 282, 2023, doi: 10.26418/jp.v9i2.67492.

[4] M. Y. Sofian, R. Dwijaya, and S. Rachmalija, “Kebijakan Anti Kemiskinan Program Pemerintah Dalam Penananggulangan Kemiskinan Di Indonesia,” Vol. 2, No. 10, Pp. 3209–3218, 2022.

[5] N. Putu, N. Hendayanti, and M. Nurhidayati, “Regresi Logistik Biner dalam Penentuan Ketepatan Klasifikasi Tingkat Kedalaman Kemiskinan Provinsi-Provinsi di Indonesia,” vol. 12, no. 2, 2020.

[6] A. Of, D. Mining, F. To, and P. Data, “Penerapan Data mining Dan Forecasting Terhadap Data Kemiskinan Di Indonesia Application of Data mining and Forecasting To Poverty Data in,” no. September 2019, pp. 375–383, 2024.

[7] M. S. Hartawan, M. Erkamim, and S. R. Yahya, “Application of Supervised Learning Algorithm for Classification of Family Hope Program Penerapan Algoritma Supervised Learning untuk Klasifikasi Program Keluarga Harapan,” vol. 3, no. October, pp. 83–91, 2023.

[8] E. F. Laili et al., “Komparasi Algoritma Decision Tree Dan Support Vector Machine ( Svm ) Dalam,” Vol. 8, No. 1, Pp. 67–76, 2025.

[9] D. Daniel, “Poverty Prediction using Random Forest based Machine Learning Technique,” vol. 10, no. 04, pp. 153–157, 2021.

[10] L. Nuzula, A. Prahutama, A. R. Hakim, F. Sains, and U. Diponegoro, “Klasifikasi Status Kemiskinan Rumah Tangga Dengan Metode Support Vector Machines ( Svm ) Dan Classification And Regression Trees ( Cart ) Menggunakan Gui R ( Studi Kasus Di Kabupaten Wonosobo Tahun 2018 ),” Vol. 9, Pp. 525–534, 2020.

[11] V. No, Z. A. Mukharyahya, Y. P. Astuti, and O. N. Cahyani, “Edumatic : Jurnal Pendidikan Informatika Perbandingan Naive Bayes dan Support Vector Machine dalam Klasifikasi Tingkat Kemiskinan di Indonesia,” vol. 9, no. 1, pp. 119–128, 2025.

[12] C. Human and D. Index, “Perbandingan Metode Ensemble Learning : Random Forest , Support Vector Machine , AdaBoost pada Klasifikasi Indeks Pembangunan Manusia ( IPM ) Comparison of Ensemble Learning Method : Random Forest , Support Vector,” vol. 12, pp. 206–218, 2023.

[13] A. M. Priyatno, T. Widiyaningtyas, I. Engineering, and U. N. Malang, “A Systematic Literature Review : Recursive Feature,” Vol. 9, No. 2, Pp. 196–207, 2024, Doi: 10.33480/Jitk.V9i2.5015.Introduction.

[14] N. Aqmar, H. Wijayanto, and F. M. Afendi, “Performance Analysis of Machine Learning Models using RFE Feature Selection and Bayesian Optimization in Imbalanced Data Classification with Shap-Based Explanations,” vol. 12, no. 3, pp. 539–554, 2025, doi: 10.15294/sji.v12i3.31459.

[15] A. Amir, F. Fachruddin, F. Idris, M. Safriani, R. Saefuddin, and I. Sakti, “Perbandingan Model Decision Tree dan Random Forest,” vol. 12, no. 5, pp. 695–704, 2025, doi: 10.30865/jurikom.v12i5.8672.

[16] A. Informatics and A. Info, “Pendekatan Data - Driven untuk Pengembangan Model Prediksi Tingkat Kemiskinan di Provinsi Indonesia,” vol. 8, no. 1, pp. 84–92, 2025.

[17] A. R. Alsaber and J. Pan, “Handling Complex Missing Data Using Random Forest Approach for an Air Quality Monitoring Dataset : A Case Study of Kuwait Environmental Data ( 2012 to 2018 ),” 2021.

[18] B. Satria, T. Azhima, Y. Siswa, and W. J. Pranoto, “Optimasi Random Forest dengan Genetic Algorithm dan Recursive Feature Elimination pada High Dimensional Data Stunting Samarinda,” vol. 8, pp. 1778–1789, 2024, doi: 10.30865/mib.v8i3.7883.

[19] W. Apriliah, I. Kurniawan, M. Baydhowi, and T. Haryati, “Prediksi Kemungkinan Diabetes pada Tahap Awal Menggunakan Algoritma Klasifikasi Random Forest,” Sistemasi, vol. 10, no. 1, p. 163, 2021, doi: 10.32520/stmsi.v10i1.1129.

[20] I. Kurniawan et al., “Implementasi Algoritma Random Forest Untuk Menentukan Implementation Of Random Forest Algorithm For Determining Recipients Of Raskin,” Vol. 10, No. 2, Pp. 421–428, 2023, Doi: 10.25126/Jtiik.202396225.

[21] S. Adi, S. Mola, Y. C. Luttu, and D. N. Rumlaklak, “Perbandingan Metode Machine Learning dalam Analisis Sentimen Komentar Pengguna Aplikasi InDriver pada Dataset Tidak Seimbang,” vol. 03, 2024, doi: 10.21456/vol14iss3pp247-255.

[22] S. D. Amalia, M. A. Barata, and P. E. Yuwita, “Optimization of Random Forest Algorithm with Backward Elimination Method in Classification of Academic Stress Levels,” vol. 9, no. 3, 2025.

Additional Files

Published

2026-02-28

How to Cite

Shafa Kirana Aralia, Mula Agung Barata, & Ita Aristia Sa’ida. (2026). Evaluasi Pengaruh RFE Terhadap Kinerja Random Forest dengan SVM pada Klasifikasi Kemiskinan Kabupaten/Kota Indonesia. JURNAL RISET KOMPUTER (JURIKOM), 13(1), 285–295. https://doi.org/10.30865/jurikom.v13i1.9527