Sentiment Analysis on Twitter Social Media towards Climate Change on Indonesia Using IndoBERT Model

Muhammad Fadhil Mubaraq; Warih Maharani

doi:10.30865/mib.v6i4.4570

Authors

Muhammad Fadhil Mubaraq Telkom University, Bandung
Warih Maharani Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v6i4.4570

Keywords:

Climate Change, Sentiment Analysis, Twitter, Indobert, Bert

Abstract

The phenomenon of climate change is a change in temperature and weather patterns in the long term. This incident became a frightening specter for everyone because consciously or unconsciously the bad effects of climate change are already in sight. This has become an urgency for all levels of society so that this topic has become quite hot on Social Media, especially on Twitter. The topic of climate change in Indonesia on Twitter Social Media can be analyzed so that it can be seen how people's sentiments towards this phenomenon. This research utilizes the Transformer architecture, namely IndoBERT, IndoBERT itself is the development of the BERT architecture by the IndoNLU team which has 74 million words from various Bahasa Indonesia sources. Therefore, this method was chosen in the hope of helping sentiment analysis on the topic of climate change so that public sentiment can be mapped. The test results obtained an F1-Score values of 95.6% with a tuning parameter of 0.00002 learning rate and 16 of batch size. Hopefully the results of this research can be used in future research.

References

â€œWhat Is Climate Change? | United Nations.â€ https://www.un.org/en/climatechange/what-is-climate-change (accessed Aug. 07, 2022).

â€œAbout Twitter | Our company purpose, principles, leadership.â€ https://about.twitter.com/en/who-we-are/our-company (accessed Aug. 07, 2022).

â€œTwitterâ€™s Daily Active Users Increase By 13 Percent In Q3 2021 / Digital Information World.â€ https://www.digitalinformationworld.com/2021/10/twitters-daily-active-users-increase-by.html (accessed Aug. 07, 2022).

N. Kankanamge, T. Yigitcanlar, A. Goonetilleke, and M. Kamruzzaman, â€œDetermining disaster severity through social media analysis: Testing the methodology with South East Queensland Flood tweets,â€ Int. J. Disaster Risk Reduct., vol. 42, p. 101360, Jan. 2020, doi: 10.1016/J.IJDRR.2019.101360.

B. Liu, â€œSentiment Analysis and Opinion Mining,â€ http://dx.doi.org/ 10.2200/S00416ED1V01Y201204HLT016 , vol. 5, no. 1, pp. 1â€“184, May 2012, doi: 10.2200/S00416ED1V01Y201204HLT016.

A. Layalia Safara Az-Zahra Gunawan and K. Muslim Lhaksamana, â€œAnalisis Sentimen pada Media Sosial Twitter terhadap Penanganan Bencana Banjir di Jawa Barat dengan Metode Jaringan Saraf Tiruan Sentiment Analysis On Twitter Social Media On Flood Disaster Management In West Java With Neural Network Methodâ€.

F. Rozi et al., â€œANALISIS SENTIMEN PADA TWITTER MENGENAI PASCA BENCANA MENGGUNAKAN METODE NAÃVE BAYES DENGAN FITUR N-GRAM,â€ J. Inform. Polinema, vol. 6, no. 2, pp. 33â€“39, Mar. 2020, doi: 10.33795/JIP.V6I2.316.

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, â€œBERT: Pre-training of deep bidirectional transformers for language understanding,â€ NAACL HLT 2019 - 2019 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. - Proc. Conf., vol. 1, no. Mlm, pp. 4171â€“4186, 2019.

R. Rahutomo and B. Pardamean, Finetunning IndoBERT to Understand Indonesian Stock Trader Slang Language.

B. Wilie et al., â€œIndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding,â€ 2020, [Online]. Available: http://arxiv.org/abs/2009.05387

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, â€œIndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,â€ pp. 757â€“770, 2021, doi: 10.18653/v1/2020.coling-main.66.

E. AcuÃ±a, â€œPREPROCESSING IN DATA MININGâ€.

J. Cheng and R. Greiner, â€œComparing Bayesian Network Classifiers,â€ pp. 101â€“108, 2013, [Online]. Available: http://arxiv.org/abs/1301.6684

M. E. Peters et al., â€œDeep contextualized word representations,â€ NAACL HLT 2018 - 2018 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. - Proc. Conf., vol. 1, pp. 2227â€“2237, Feb. 2018, doi: 10.18653/v1/n18-1202.

â€œGitHub - IndoNLP/indonlu: The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020).â€ https://github.com/IndoNLP/indonlu (accessed Aug. 07, 2022).

P. Singh, N. Singh, K. K. Singh, and A. Singh, â€œDiagnosing of disease using machine learning,â€ Mach. Learn. Internet Med. Things Healthc., pp. 89â€“111, Jan. 2021, doi: 10.1016/B978-0-12-821229-5.00003-3.

I. Menarianti, â€œKlasifikasi data mining dalam menentukan pemberian kredit bagi nasabah koperasi,â€ J. Ilm. Teknosains, vol. 1, no. 1, pp. 1â€“10, 2015, [Online]. Available: http://e-jurnal.upgrismg.ac.id/index.php/JITEK/article/view/836

I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016.