Telkom University Opinion Topic Modeling on Twitter Using Latent Dirichlet Allocation During Covid-19 Pandemic
DOI:
https://doi.org/10.30865/mib.v6i4.4426Keywords:
Topic Modelling, Latent Dirichlet Allocation (LDA), Topic coherence, Public opinion, Term Frequency — Inverse Document Frequency (TF-IDF)Abstract
In the current digital era, the development of information technology is growing rapidly. The development of information technology is followed by the development of social media, one of the social media that is on the rise is Twitter. Because there are many Twitter users around the world, Twitter stores a lot of data that can be used for something, one of which is to determine the category of public opinion about a company or university, in this study the focus is more on the category of public opinion about Telkom University. The public opinion can be grouped or categorized to make it easier to determine the topic being discussed. Determining opinions manually will take a long time due to the large number of tweets. Therefore, there must be another method to determine the categories of public opinion on Twitter. One of them is the Latent Dirichlet Allocation (LDA) method with a dataset of tweets of Indonesian-language Twitter users. With this method, grouping tweets on a large scale is more efficient. From the modeling made, the most optimum results obtained with a coherence score using the c_umass method of -15.33029 with a combination of 9 topics, 0.31 alpha value, and 0.01 beta value.
References
Pestov, I. (n.d.). Today’s Incredible Numbers About Social Media | by Ilya Pestov | Medium. Retrieved June 30, 2022, from https://medium.com/@ipestov/todays-incredible-numbers-about-social-media-a6b1ff2ca887
Purwadi, M. (n.d.). Telkom University Kembali Jadi PTS Terbaik versi THE AUR 2022. Retrieved June 30, 2022, from https://edukasi.sindonews.com/read/787197/211/telkom-university-kembali-jadi-pts-terbaik-versi-the-aur-2022-1654225596
Institute of Electrical and Electronics Engineers. Indonesia Section., & Institute of Electrical and Electronics Engineers. (2017). 2017 International Conference on Sustainable Information Engineering and Technology (SIET) : proceedings : Batu City, Indonesia, November 24th-25th 2017. (A. F. and M. M. R. Hidayatullah, Ed.).
Putri, S. A., Daru Kusuma, P., & Setianingsih, C. (2021). CLUSTERING TOPIK PADA DATA SENTIMEN BPJS KESEHATAN MENGGUNAKAN METODE LATENT DIRICHLET ALLOCATION TOPIC CLUSTERING ON SENTIMENT DATA OF BPJS KESEHATAN USING LATENT DIRICHLET ALLOCATION METHOD.
Blei, D. M., & Lafferty, J. D. (2009). Topic Models.
Choirul Rahmadan, M., Nizar Hidayanto, A., Swadani Ekasari, D., Purwandari, B., & Theresiawati. (2020). Sentiment Analysis and Topic Modelling Using the LDA Method related to the Flood Disaster in Jakarta on Twitter. Proceedings - 2nd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2020, 126–130. https://doi.org/10.1109/ICIMCIS51567.2020.9354320
Wicaksono Arianto, B., & Anuraga, G. (2020). Pemodelan Topik Pengguna Twitter Mengenai Aplikasi “Ruangguru†Topic Modeling for Twitter Users Regarding the “Ruanggguru†Application (Vol. 21, Issue 2).
Hidayatullah, A., & Pembrani, E. (2018). ICCCS 2018 : 2018 3rd International Conference on Computer and Communication Systems : April 27-30, 2018, Nagoya, Japan.
Anggraini, E. (n.d.). LATENT DIRICHLET ALLOCATION UNTUK PEMODELAN TOPIK ABSTRAK DOKUMEN SKRIPSI TUGAS AKHIR.
Cai, Y., & Sun, J.-T. (2009). Text Mining. In L. LIU & M. T. ÖZSU (Eds.), Encyclopedia of Database Systems (pp. 3061–3065). Springer US. https://doi.org/10.1007/978-0-387-39940-9_418
Feldman, R., & Sanger, J. (2007). The text mining handbook : advanced approaches in analyzing unstructured data. Cambridge University Press.
Robertson, S. (2005). Understanding Inverse Document Frequency: On theoretical arguments for IDF (Issue 5).
Doig, C. (n.d.). topic-modeling: Topic modeling with python. Retrieved June 30, 2022, from http://chdoig.github.io/pytexas2015-topic-%20modeling
Newman, D., Lau, J. H., Grieser, K., & Baldwin, T. (2010). Automatic Evaluation of Topic Coherence. Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, 100–108.
Kapadia, S. (2019). Evaluate Topic Models: Latent Dirichlet Allocation (LDA) | by Shashank Kapadia | Towards Data Science. https://towardsdatascience.com/evaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0
Lee, D., Institute of Electrical and Electronics Engineers, IEEE ITSS, & Florida International University. (n.d.). IEEE ISI2018 : IEEE International Conference on Intelligence and Security Informatics : November 8-10, 2018, Florida International University, Miami FL.
Sievert, C., & Shirley, K. E. (2014). LDAvis: A method for visualizing and interpreting topics. 63–70. https://doi.org/10.13140/2.1.1394.3043
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).