Telkom University Opinion Topic Modeling on Twitter Using Latent Dirichlet Allocation During Covid-19 Pandemic

Authors

  • Tandya Rizky Pratama Telkom University, Bandung
  • Donni Richasdy Telkom University, Bandung
  • Mahendra Dwifebri Purbolaksono Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v6i4.4426

Keywords:

Topic Modelling, Latent Dirichlet Allocation (LDA), Topic coherence, Public opinion, Term Frequency — Inverse Document Frequency (TF-IDF)

Abstract

In the current digital era, the development of information technology is growing rapidly. The development of information technology is followed by the development of social media, one of the social media that is on the rise is Twitter. Because there are many Twitter users around the world, Twitter stores a lot of data that can be used for something, one of which is to determine the category of public opinion about a company or university, in this study the focus is more on the category of public opinion about Telkom University. The public opinion can be grouped or categorized to make it easier to determine the topic being discussed. Determining opinions manually will take a long time due to the large number of tweets. Therefore, there must be another method to determine the categories of public opinion on Twitter. One of them is the Latent Dirichlet Allocation (LDA) method with a dataset of tweets of Indonesian-language Twitter users. With this method, grouping tweets on a large scale is more efficient. From the modeling made, the most optimum results obtained with a coherence score using the c_umass method of -15.33029 with a combination of 9 topics, 0.31 alpha value, and 0.01 beta value.

References

Pestov, I. (n.d.). Today’s Incredible Numbers About Social Media | by Ilya Pestov | Medium. Retrieved June 30, 2022, from https://medium.com/@ipestov/todays-incredible-numbers-about-social-media-a6b1ff2ca887

Purwadi, M. (n.d.). Telkom University Kembali Jadi PTS Terbaik versi THE AUR 2022. Retrieved June 30, 2022, from https://edukasi.sindonews.com/read/787197/211/telkom-university-kembali-jadi-pts-terbaik-versi-the-aur-2022-1654225596

Institute of Electrical and Electronics Engineers. Indonesia Section., & Institute of Electrical and Electronics Engineers. (2017). 2017 International Conference on Sustainable Information Engineering and Technology (SIET) : proceedings : Batu City, Indonesia, November 24th-25th 2017. (A. F. and M. M. R. Hidayatullah, Ed.).

Putri, S. A., Daru Kusuma, P., & Setianingsih, C. (2021). CLUSTERING TOPIK PADA DATA SENTIMEN BPJS KESEHATAN MENGGUNAKAN METODE LATENT DIRICHLET ALLOCATION TOPIC CLUSTERING ON SENTIMENT DATA OF BPJS KESEHATAN USING LATENT DIRICHLET ALLOCATION METHOD.

Blei, D. M., & Lafferty, J. D. (2009). Topic Models.

Choirul Rahmadan, M., Nizar Hidayanto, A., Swadani Ekasari, D., Purwandari, B., & Theresiawati. (2020). Sentiment Analysis and Topic Modelling Using the LDA Method related to the Flood Disaster in Jakarta on Twitter. Proceedings - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020, 126–130. https://doi.org/10.1109/ICIMCIS51567.2020.9354320

Wicaksono Arianto, B., & Anuraga, G. (2020). Pemodelan Topik Pengguna Twitter Mengenai Aplikasi “Ruangguru†Topic Modeling for Twitter Users Regarding the “Ruanggguru†Application (Vol. 21, Issue 2).

Hidayatullah, A., & Pembrani, E. (2018). ICCCS 2018 : 2018 3rd International Conference on Computer and Communication Systems : April 27-30, 2018, Nagoya, Japan.

Anggraini, E. (n.d.). LATENT DIRICHLET ALLOCATION UNTUK PEMODELAN TOPIK ABSTRAK DOKUMEN SKRIPSI TUGAS AKHIR.

Cai, Y., & Sun, J.-T. (2009). Text Mining. In L. LIU & M. T. ÖZSU (Eds.), Encyclopedia of Database Systems (pp. 3061–3065). Springer US. https://doi.org/10.1007/978-0-387-39940-9_418

Feldman, R., & Sanger, J. (2007). The text mining handbook : advanced approaches in analyzing unstructured data. Cambridge University Press.

Robertson, S. (2005). Understanding Inverse Document Frequency: On theoretical arguments for IDF (Issue 5).

Doig, C. (n.d.). topic-modeling: Topic modeling with python. Retrieved June 30, 2022, from http://chdoig.github.io/pytexas2015-topic-%20modeling

Newman, D., Lau, J. H., Grieser, K., & Baldwin, T. (2010). Automatic Evaluation of Topic Coherence. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 100–108.

Kapadia, S. (2019). Evaluate Topic Models: Latent Dirichlet Allocation (LDA) | by Shashank Kapadia | Towards Data Science. https://towardsdatascience.com/evaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0

Lee, D., Institute of Electrical and Electronics Engineers, IEEE ITSS, & Florida International University. (n.d.). IEEE ISI2018 : IEEE International Conference on Intelligence and Security Informatics : November 8-10, 2018, Florida International University, Miami FL.

Sievert, C., & Shirley, K. E. (2014). LDAvis: A method for visualizing and interpreting topics. 63–70. https://doi.org/10.13140/2.1.1394.3043

Downloads

Published

2022-10-25