Analisis Data Mining Klasifikasi Berita Hoax COVID 19 Menggunakan Algoritma Naive Bayes

 Fani Prasetya (Universitas Bina Darma, Palembang, Indonesia)
 (*)Ferdiansyah Ferdiansyah Mail (Universitas Bina Darma, Palembang, Indonesia)

(*) Corresponding Author

Submitted: September 16, 2022; Published: September 30, 2022


The rapid dissemination of information along with the rapid development of technology along with the massive speed of electronic media and the internet. But the rapid spread of news cannot guarantee that the information and news that we get can be validated from valid sources. Based on data released by Kominfo at the end of 2021, there were 1773 hoax news that were successfully clarified from the hoax news. Then during the Covid-19 pandemic itself, there were various hoaxes circulating in the community. Throughout 2021, the Ministry of Communications and Informatics discovered as many as 723 hoaxes about Covid-19. Based on the background above, the researchers and previous studies have discussed hoax detection in various fields. Such as, fraud detection in online writing style [1], classification of hoax news based on machine learning [3] and the application of nave Bayes and PSO algorithms for classification of hoax news on social media [4]. From here the researchers tried to carry out experiments on the nave Bayes classification algorithm to classify hoax covid 19 news. Based on the results of research that has been done, the nave Bayes model and cross validation can classify hoax news well, the resulting accuracy is 86.3% where 80-90% included in the good classification criteria. The data that is predicted to be incorrect is also not too much from a total of 300 datasets, only 41 are declared incorrect in labeling less than 2% of the total dataset, so it can be concluded that this model can be used as a reference if you want to proceed to a more complex prediction model, for example the model prediction using web-based machine learning.


Covid-19; Classification; Hoax; Naive Bayes

Full Text:


Article Metrics

Abstract view : 186 times
PDF - 96 times


S. Afroz, M. Brennan, and R. Greenstadt, “Detecting hoaxes, frauds, and deception in writing style online,” in 2012 IEEE Symposium on Security and Privacy, 2012, pp. 461–475.

S. Banerjee, A. Y. K. Chua, and J.-J. Kim, “Using supervised learning to classify authentic and fake online reviews,” in Proceedings of the 9th international conference on ubiquitous information management and communication, 2015, pp. 1–7.

E. Rasywir and A. Purwarianti, “Eksperimen pada sistem klasifikasi berita hoax berbahasa Indonesia berbasis pembelajaran mesin,” J. Cybermatika, vol. 3, no. 2, 2016.

R. Wati and others, “Penerapan Algoritma Naive Bayes Dan Particle Swarm Optimization Untuk Klasifikasi Berita Hoax Pada Media Sosial,” JITK (Jurnal Ilmu Pengetah. Dan Teknol. Komputer), vol. 5, no. 2, pp. 159–164, 2020.

L. Ishwara, Catatan-catatan jurnalisme dasar, vol. 1. Penerbit Buku Kompas, 2005.

S. Kasman, “Sistem Verifikasi Menangkal Berita Hoax di Media Cetak,” J. Mimb. Kesejaht. Sos., vol. 2, no. 1, 2019.

T. Kompas, “Data Sebaran Hoaks Sepanjang 2021, Terbanyak soal Pandemi Covid-19,” 2022. [Online]. Available: [Accessed: 16-May-2022].

Novrizaldi, “Tingkat Literasi Indonesia Memprihatinkan, Kemenko PMK Siapkan Peta Jalan Pembudayaan Literasi Nasional.” [Online]. Available: survei yang dilakukan Program,yang memiliki tingkat literasi rendah. [Accessed: 16-May-2022].

J. Han, M. Kamber, and J. Pei, “Data Mining: Concepts and Techniques Third Edition [M],” Morgan Kaufmann Ser. Data Manag. Syst., vol. 5, no. 4, pp. 83–124, 2011.

B. Liu, “Sentiment analysis and opinion mining,” Synth. Lect. Hum. Lang. Technol., vol. 5, no. 1, pp. 1–167, 2012.

M. W. Berry and J. Kogan, Text mining: applications and theory. John Wiley & Sons, 2010.

N. Herlinawati, Y. Yuliani, S. Faizah, W. Gata, and S. Samudi, “Analisis Sentimen Zoom Cloud Meetings di Play Store Menggunakan Na{"i}ve Bayes dan Support Vector Machine,” CESS (Journal Comput. Eng. Syst. Sci., vol. 5, no. 2, pp. 293–298.

L. L. Dhande and G. K. Patnaik, “Analyzing sentiment of movie review data using Naive Bayes neural classifier,” Int. J. Emerg. Trends & Technol. Comput. Sci., vol. 3, no. 4, pp. 313–320, 2014.

A. Wibowo, “10 FOLD-CROSS VALIDATION,” 2017.

S. Narkhede, “Understanding auc-roc curve,” Towar. Data Sci., vol. 26, pp. 220–227, 2018.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Analisis Data Mining Klasifikasi Berita Hoax COVID 19 Menggunakan Algoritma Naive Bayes


  • There are currently no refbacks.

Copyright (c) 2022 Fani Prasetya, Ferdiansyah

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

Jurnal Sistem Komputer dan Informatika (JSON)
Dikelola oleh STMIK Budi Darma
Sekretariat : Jln. Sisingamangaraja No. 338 Telp 061-7875998
email :

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.