Twitter Sentiment Analysis on Online Transportation in Indonesia Using Ensemble Stacking

Authors

DOI:

https://doi.org/10.30865/mib.v6i3.4359

Keywords:

Ensemble Stacking, Machine Learning, Online Transportation, Sentiment Analysis, Twitter

Abstract

Online transportation is a transportation innovation that has emerged along with the development of online-based applications that provide many features and conveniences. In its development, many users wrote their responses to the application on social media such as twitter. Many opinions and responses are directly conveyed by users of online transportation modes to their official accounts. The responses given by these users are very large and can be used as sentiment analysis on online transportation. However, the analysis process cannot be done manually. Therefore, we need a system that can help analyze user responses on Twitter automatically. In this study, a sentiment analysis system was built for online transportation in Indonesia using the ensemble stacking algorithm, which will simplify and increase the accuracy of the sentiment analysis. Ensemble stacking is a solution for advanced machine learning methods that can improve the performance of the base classifier. The system built on ensemble stacking uses three base classifiers, namely SVM kernel RBF, SVM linear kernel, and logistic regression. The best accuracy result on the gojek dataset is 88%, and the best F1 score is 87%. Ensemble Stacking which is applied to the research that the author conducted on online transportation sentiment analysis on twitter, obtained better accuracy than the base classifier used.

Author Biography

Yahya Setiawan, Telkom University, Bandung

Prodi Informatika

References

M. Tika Adilah, H. Supendar, R. Ningsih, S. Muryani, and K. Solecha, “Sentiment Analysis of Online Transportation Service using the Naïve Bayes Methods,†Journal of Physics: Conference Series., vol. 1641, no. 1, 2020, doi: 10.1088/1742-6596/1641/1/012093.

I. P. Windasari, F. N. Uzzi, and K. I. Satoto, “Sentiment analysis on Twitter posts: An analysis of positive or negative opinion on GoJek,†Proceedings - 2017 4th International Conference on Information Technology, Computer, and Electrical Engineering, ICITACEE 2017., vol. 2018-Janua, pp. 266–269, 2017, doi: 10.1109/ICITACEE.2017.8257715.

M. I. Petiwi, A. Triayudi, and I. D. Sholihati, “Analisis Sentimen Gofood Berdasarkan Twitter Menggunakan Metode Naïve Bayes dan Support Vector Machine,†Jurnal Media Informatika Budidarma., vol. 6, no. 1, p. 542, 2022, doi: 10.30865/mib.v6i1.3530.

E. B. S. Mila Putri Kartika Dewi, “Feature Expansion Using Word2vec for Hate Speech Detection on Indonesian Twitter with Classification Using SVM and Random Forest,†Jurnal Media Informatika Budidarma., vol. 6, no. April, pp. 979–988, 2022.

T. B. N. Hoang and J. Mothe, “Predicting information diffusion on Twitter – Analysis of predictive features,†Journal of Computational Science., vol. 28, pp. 257–264, 2018, doi: 10.1016/j.jocs.2017.10.010.

M. S. Akhtar, A. Ekbal, and E. Cambria, “How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [Application Notes],†IEEE Computational Intelligence Magazine., vol. 15, no. 1, pp. 64–75, 2020, doi: 10.1109/MCI.2019.2954667.

P. P. Tribhuvan, S. G. Bhirud, and R. R. Deshmukh, “Stacking Ensemble Model for Polarity Classification in Feature Based Opinion Mining,†Indian Journal of Computer Science and Engineering., vol. 9, no. 3, pp. 91–95, 2018, doi: 10.21817/indjcse/2018/v9i3/180903004.

R. N. Harahap and K. Muslim, “Peningkatan Akurasi pada Prediksi Kepribadian Mbti Pengguna Twitter Menggunakan Augmentasi Data,†Jurnal Teknologi Informasi dan Ilmu Komputer., vol. 7, no. 4, p. 815, 2020, doi: 10.25126/jtiik.2020743622.

M. Tika Adilah, H. Supendar, R. Ningsih, S. Muryani, and K. Solecha, “Sentiment Analysis of Online Transportation Service using the Naïve Bayes MethodsNo Title,†Journal of Physics: Conference Series., vol. 1641, no. 1, 2020.

A. Serna, A. Soroa, and R. Agerri, “Applying deep learning techniques for sentiment analysis to assess sustainable transport,†Sustainability (Switzerland)., vol. 13, no. 4, pp. 1–19, 2021, doi: 10.3390/su13042397.

K. Zvarevashe and O. O. Olugbara, “A framework for sentiment analysis with opinion mining of hotel reviews,†2018 Conference on Information Communications Technology and Society, ICTAS 2018 - Proceedings, pp. 1–4, 2018, doi: 10.1109/ICTAS.2018.8368746.

P. H. Lai and R. Alfred, “An optimized multi-layer ensemble framework for sentiment analysis,†Proceedings - 2019 1st International Conference on Artificial Intelligence and Data Sciences, AiDAS 2019, pp. 158–163, 2019, doi: 10.1109/AiDAS47888.2019.8970949.

K. Ramadhan and K. M. L, “Analisis Sentimen Terhadap Toko Online Menggunakan Naïve Bayes pada Media Sosial Twitter,†e-Proceeding of Engineering., vol. 5, no. 3, pp. 8141–8151, 2018.

M. Khan and A. Malviya, “Big data approach for sentiment analysis of twitter data using Hadoop framework and deep learning,†International Conference on Emerging Trends in Information Technology and Engineering, ic-ETITE 2020, pp. 1–5, 2020, doi: 10.1109/ic-ETITE47903.2020.201.

R. A. Rizal, I. S. Girsang, and S. A. Prasetiyo, “Klasifikasi Wajah Menggunakan Support Vector Machine (SVM),†REMIK (Riset dan E-Jurnal Manaj. Inform. Komputer), vol. 3, no. 2, p. 1, 2019, doi: 10.33395/remik.v3i2.10080.

S. Sperandei, “Understanding logistic regression analysis,†Biochemia Medica, vol. 24, no. 1, pp. 12–18, 2014, doi: 10.11613/BM.2014.003.

A. R. Prananda and I. Thalib, “Sentiment Analysis for Customer Review: Case Study of GO-JEK Expansion,†Journal of Information Systems Engineering and Business Intelligence., vol. 6, no. 1, p. 1, 2020, doi: 10.20473/jisebi.6.1.1-8.

Downloads

Published

2022-07-25

Issue

Section

Articles