Analisis Perbandingan Algoritma SVM, Logistic Regression, Naive Bayes, dan XGBoost Untuk Deteksi Fake News

Authors

  • Umar Farid Al Faqihi Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro
  • Afril Efan Pajri Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro
  • Muhammad Jauhar Vikri Universitas Nahdlatul Ulama Sunan Giri, Bojonegoro

DOI:

https://doi.org/10.30865/jurikom.v13i1.9492

Keywords:

Fake news, SVM, Logistic Regression, Naive Bayes, XGBoost

Abstract

The rapid growth of digital technology and internet access has completely changed how information is shared, enabling content to spread quickly across various online platforms. However, these advancements have also made it easier for misleading or entirely fabricated news to circulate, posing serious risks to social stability, political environments, and public health. This study tackles this problem by employing several machine learning-based classification methods for analyzing textual data. Four algorithms Support Vector Machine (SVM), Logistic Regression (LR), Naive Bayes (NB), and Extreme Gradient Boosting (XGBoost) were applied to detect linguistic patterns that differentiate genuine news from fake content. A major contribution of this research is the creation of a custom dataset gathered directly from Indonesian online news portals, comprising a total of 4,909 entries. The evaluation results demonstrate exceptionally high accuracy across the models: 99.69% for SVM, 99.39% for LR, 99.29% for NB, and 99.19% for XGBoost. To verify reliability, each model was further evaluated using cross-validation, yielding average accuracy scores of 99.57% (SVM), 99.52% (LR), 99.44% (NB), and 99.49% (XGBoost). These findings confirm that all four classifiers are highly effective and well-suited for identifying fake news in text-based data.

References

[1] F. Olan, U. Jayawickrama, E. O. Arakpogun, J. Suklan, and S. Liu, “Fake news on Social Media: the Impact on Society,” Inf. Syst. Front., vol. 26, no. 2, pp. 443–458, 2024, doi: 10.1007/s10796-022-10242-z.

[2] O. N. Cahyani and F. Budiman, “Performa Logistic Regression dan Naive Bayes dalam Klasifikasi Berita Hoax di Indonesia,” Edumatic J. Pendidik. Inform., vol. 9, no. 1, pp. 60–68, 2025, doi: 10.29408/edumatic.v9i1.28987.

[3] Admidppid, “Rekap Isu Hoaks & Disinformasi Tahun 2025,” Ppid.Diskominfo.Jatengprov.Go.Id. Accessed: Nov. 18, 2025. [Online]. Available: https://ppid.diskominfo.jatengprov.go.id/rekap-isu-hoaks-disinformasi-tahun-2025/

[4] A. Sarjito, “Hoaks, Disinformasi, dan Ketahanan Nasional: Ancaman Teknologi Informasi dalam Masyarakat Digital Indonesia,” J. Gov. Local Polit., vol. 5, no. 2, pp. 175–186, 2021.

[5] H. S. Pratama, “Menghadapi Berita Palsu,” p. 24, 2019.

[6] C. Juditha and J. J. Darmawan, “Komunikasi Politik Terkait Hoaks Pada Pemilu Presiden Indonesia 2024,” J. Stud. Komun. dan Media Komdigi, vol. 28, p. 178, 2024, doi: 10.17933/jskm.2024.5682.

[7] A. Anshari, “Penyuluhan Hukum Dampak Beredarnya Berita Hoax dan Pencegahannya di Situasi Covid-19 Melalui Sosial Media,” J. Bul. Al-Ribaath, vol. 20, no. 1, p. 30, 2023, doi: 10.29406/br.v20i1.5802.

[8] N. Amaly and A. Armiah, “Peran Kompetensi Literasi Digital Terhadap Konten Hoaks dalam Media Sosial [The Role of Digital Literacy Competence in Hoax Content on Social Media],” Alhadharah J. Ilmu Dakwah, vol. 20, no. 2, pp. 43–52, 2021.

[9] N. Chamidah and R. Sahawaly, “Comparison Support Vector Machine and Naive Bayes Methods for Classifying Cyberbullying in Twitter,” J. Ilm. Tek. Elektro Komput. dan Inform., vol. 7, no. 2, p. 338, 2021, doi: 10.26555/jiteki.v7i2.21175.

[10] N. E. Febriyanty, M. A. Hariyadi, and C. Crysdian, “Hoax Detection News Using Naïve Bayes and Support Vector Machine Algorithm,” Int. J. Adv. Data Inf. Syst., vol. 4, no. 2, pp. 191–200, 2023, doi: 10.25008/ijadis.v4i2.1306.

[11] M. Jauhar Vikri, I. Wisma Dwi Prastya, U. Pradema Sanjaya, and M. Agung Barata, “Rice Quality Identification Built on Indonesian Food Standards Based on Electronic Nose using Naïve Bayes Algorithm,” INOVTEK Polbeng - Seri Inform., vol. 10, no. 1, pp. 49–60, 2025, doi: 10.35314/0y0xct32.

[12] A. M. Wahid, Turino, K. A. Nugroho, D. Titi Safitri4, and F. S. Utomo, “Optimasi Logistic Regression dan Random Forest untuk Deteksi Berita Hoax Optimasi Logistic Regression dan Random Forest untuk Deteksi Berita Hoax Berbasis Hyperparameter Optimization of Logistic Regression and Random Forest for Hoax News Detection Using T,” J. Pendidik. dan Teknol. Indones., no. January, 2025, doi: 10.52436/1.jpti.602.

[13] B. Hamdikatama, “Beyond Algorithms: an Integrated Approach To Fake News Detection Using Machine Learning Techniques,” JITK (Jurnal Ilmu Pengetah. dan Teknol. Komputer), vol. 10, no. 3, pp. 609–622, 2025, doi: 10.33480/jitk.v10i3.6061.

[14] F. Hukum, C. B. Devina, D. C. Iswari, G. Christian, B. Goni, and D. Kimberly, “Kosmik Hukum,” vol. 21, no. 1, pp. 44–58, 2021.

[15] Febriansyah Putra and H. Patra, “Analisis Hoax pada Pemilu: Tinjauan dari Perspektif Pendidikan Politik,” Naradidik J. Educ. Pedagog., vol. 2, no. 1, pp. 95–102, 2023, doi: 10.24036/nara.v2i1.119.

[16] Dinda Marta Almas Zakirah, “Pengaruh Hoax Di Media Sosial Terhadap Preferensi Sosial Politik Remaja Di Surabaya,” Mediakita, vol. 4, no. 1, pp. 37–36, 2020, doi: 10.30762/mediakita.v4i1.2446.

[17] I. A. Ropikoh, R. Abdulhakim, U. Enri, and N. Sulistiyowati, “Penerapan Algoritma Support Vector Machine (SVM) untuk Klasifikasi Berita Hoax Covid-19,” J. Appl. Informatics Comput., vol. 5, no. 1, pp. 64–73, 2021, doi: 10.30871/jaic.v5i1.3167.

[18] R. K. Putri and M. Athoillah, “Support Vector Machine Untuk Identifikasi Berita Hoax Terkait Virus Corona (Covid-19),” J. Inform. J. Pengemb. IT, vol. 6, no. 3, pp. 162–167, 2021, doi: 10.30591/jpit.v6i3.2489.

[19] S. Murugesan and K. P. Kaliyamurthie, “A Machine Learning Framework for Automatic Fake News Detection in Indian Tamil News Channels,” Ing. des Syst. d’Information, vol. 28, no. 1, pp. 205–209, 2023, doi: 10.18280/isi.280123.

[20] Y. Sibaroni and S. S. Prasetiyowati, “Buzzer Detection on Indonesian Twitter using SVM and Account Property Feature Extension,” J. RESTI, vol. 6, no. 4, pp. 663–669, 2022, doi: 10.29207/resti.v6i4.4338.

[21] H. A. Santoso, E. H. Rachmawanto, A. Nugraha, A. A. Nugroho, D. R. I. M. Setiadi, and R. S. Basuki, “Hoax classification and sentiment analysis of Indonesian news using Naive Bayes optimization,” Telkomnika (Telecommunication Comput. Electron. Control., vol. 18, no. 2, pp. 799–806, 2020, doi: 10.12928/TELKOMNIKA.V18I2.14744.

[22] N. W. S. Saraswati, I. P. K. S. Putra, I. D. M. K. Muku, and G. D. Pramitha, “Support Vector Machine For Hoax Detection,” SINTECH (Science Inf. Technol. J., vol. 6, no. 2, pp. 107–117, Aug. 2023, doi: 10.31598/sintechjournal.v6i2.1366.

[23] N. H. Ovirianti, M. Zarlis, and H. Mawengkang, “Support Vector Machine Using A Classification Algorithm,” SinkrOn, vol. 7, no. 3, pp. 2103–2107, 2022, doi: 10.33395/sinkron.v7i3.11597.

[24] D. K. Sharma and S. Garg, “IFND: a benchmark dataset for fake news detection,” Complex Intell. Syst., vol. 9, no. 3, pp. 2843–2863, 2023, doi: 10.1007/s40747-021-00552-1.

[25] M. Sudhakar and K. P. Kaliyamurthie, “Detection of fake news from social media using support vector machine learning algorithms,” Meas. Sensors, vol. 32, no. January, p. 101028, 2024, doi: 10.1016/j.measen.2024.101028.

[26] A. Frenica, L. Lindawati, L. Lindawati, S. Soim, and S. Soim, “Implementasi Algoritma Support Vector Machine (SVM) untuk Deteksi Banjir,” INOVTEK Polbeng - Seri Inform., vol. 8, no. 2, p. 291, 2023, doi: 10.35314/isi.v8i2.3443.

[27] K. K. Ray et al., “Guava leaf disease detection using support vector machine (SVM),” Smart Agric. Technol., vol. 12, Dec. 2025, doi: 10.1016/j.atech.2025.101190.

Additional Files

Published

2026-02-28

How to Cite

Umar Farid Al Faqihi, Afril Efan Pajri, & Muhammad Jauhar Vikri. (2026). Analisis Perbandingan Algoritma SVM, Logistic Regression, Naive Bayes, dan XGBoost Untuk Deteksi Fake News. JURNAL RISET KOMPUTER (JURIKOM), 13(1), 462–473. https://doi.org/10.30865/jurikom.v13i1.9492