Sentiment Analysis on Twitter Social Media towards Climate Change on Indonesia Using IndoBERT Model

Authors

  • Muhammad Fadhil Mubaraq Telkom University, Bandung
  • Warih Maharani Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v6i4.4570

Keywords:

Climate Change, Sentiment Analysis, Twitter, Indobert, Bert

Abstract

The phenomenon of climate change is a change in temperature and weather patterns in the long term. This incident became a frightening specter for everyone because consciously or unconsciously the bad effects of climate change are already in sight. This has become an urgency for all levels of society so that this topic has become quite hot on Social Media, especially on Twitter. The topic of climate change in Indonesia on Twitter Social Media can be analyzed so that it can be seen how people's sentiments towards this phenomenon. This research utilizes the Transformer architecture, namely IndoBERT, IndoBERT itself is the development of the BERT architecture by the IndoNLU team which has 74 million words from various Bahasa Indonesia sources. Therefore, this method was chosen in the hope of helping sentiment analysis on the topic of climate change so that public sentiment can be mapped. The test results obtained an F1-Score values of 95.6% with a tuning parameter of 0.00002 learning rate and 16 of batch size. Hopefully the results of this research can be used in future research.

References

“What Is Climate Change? | United Nations.†https://www.un.org/en/climatechange/what-is-climate-change (accessed Aug. 07, 2022).

“About Twitter | Our company purpose, principles, leadership.†https://about.twitter.com/en/who-we-are/our-company (accessed Aug. 07, 2022).

“Twitter’s Daily Active Users Increase By 13 Percent In Q3 2021 / Digital Information World.†https://www.digitalinformationworld.com/2021/10/twitters-daily-active-users-increase-by.html (accessed Aug. 07, 2022).

N. Kankanamge, T. Yigitcanlar, A. Goonetilleke, and M. Kamruzzaman, “Determining disaster severity through social media analysis: Testing the methodology with South East Queensland Flood tweets,†Int. J. Disaster Risk Reduct., vol. 42, p. 101360, Jan. 2020, doi: 10.1016/J.IJDRR.2019.101360.

B. Liu, “Sentiment Analysis and Opinion Mining,†http://dx.doi.org/ 10.2200/S00416ED1V01Y201204HLT016 , vol. 5, no. 1, pp. 1–184, May 2012, doi: 10.2200/S00416ED1V01Y201204HLT016.

A. Layalia Safara Az-Zahra Gunawan and K. Muslim Lhaksamana, “Analisis Sentimen pada Media Sosial Twitter terhadap Penanganan Bencana Banjir di Jawa Barat dengan Metode Jaringan Saraf Tiruan Sentiment Analysis On Twitter Social Media On Flood Disaster Management In West Java With Neural Network Methodâ€.

F. Rozi et al., “ANALISIS SENTIMEN PADA TWITTER MENGENAI PASCA BENCANA MENGGUNAKAN METODE NAÃVE BAYES DENGAN FITUR N-GRAM,†J. Inform. Polinema, vol. 6, no. 2, pp. 33–39, Mar. 2020, doi: 10.33795/JIP.V6I2.316.

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,†NAACL HLT 2019 - 2019 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. - Proc. Conf., vol. 1, no. Mlm, pp. 4171–4186, 2019.

R. Rahutomo and B. Pardamean, Finetunning IndoBERT to Understand Indonesian Stock Trader Slang Language.

B. Wilie et al., “IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding,†2020, [Online]. Available: http://arxiv.org/abs/2009.05387

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,†pp. 757–770, 2021, doi: 10.18653/v1/2020.coling-main.66.

E. Acuña, “PREPROCESSING IN DATA MININGâ€.

J. Cheng and R. Greiner, “Comparing Bayesian Network Classifiers,†pp. 101–108, 2013, [Online]. Available: http://arxiv.org/abs/1301.6684

M. E. Peters et al., “Deep contextualized word representations,†NAACL HLT 2018 - 2018 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. - Proc. Conf., vol. 1, pp. 2227–2237, Feb. 2018, doi: 10.18653/v1/n18-1202.

“GitHub - IndoNLP/indonlu: The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020).†https://github.com/IndoNLP/indonlu (accessed Aug. 07, 2022).

P. Singh, N. Singh, K. K. Singh, and A. Singh, “Diagnosing of disease using machine learning,†Mach. Learn. Internet Med. Things Healthc., pp. 89–111, Jan. 2021, doi: 10.1016/B978-0-12-821229-5.00003-3.

I. Menarianti, “Klasifikasi data mining dalam menentukan pemberian kredit bagi nasabah koperasi,†J. Ilm. Teknosains, vol. 1, no. 1, pp. 1–10, 2015, [Online]. Available: http://e-jurnal.upgrismg.ac.id/index.php/JITEK/article/view/836

I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016.

Downloads

Published

2022-10-25