Aspect-Based Sentiment Analysis on Twitter Using Long Short-Term Memory Method

Authors

  • Siti Inayah Putri Telkom University, Bandung
  • Erwin Budi Setiawan Telkom University, Bandung
  • Yuliant Sibaroni Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v7i2.5637

Keywords:

Aspect-Based Analysis Sentiment, Movie Review, LSTM, Fasttext, TF-IDF, SMOTE

Abstract

Twitter is one of the most popular social media among Indonesian people. Due to the high number of users and the intensity of their use, Twitter can also be used to dig up information related to a topic or product with sentiment analysis. One of the most frequently discussed topics on Twitter is related to movie reviews. Everyone's opinion of movie reviews can refer to different aspects. So, aspect-based sentiment analysis can be applied to movie reviews to get more optimal results. Aspect-based sentiment analysis is a solution to find out the opinions of Twitter users on movie reviews based on the aspects. In this study, a system for aspect-based sentiment analysis was built with a dataset of Indonesian language movie reviews consisting of 3 aspects: plot, acting, and director. The classification model uses Long Short-Term Memory (LSTM) method with the application of TF-IDF feature extraction, fastText feature expansion, and handling of imbalanced data using SMOTE. The results of this study for the plot aspect obtained an accuracy score of 74.86% and F1-score of 74.74%, the acting aspect obtained an accuracy score of 94.80% and F1-score of 94.74%, and the director aspect obtained an accuracy score of 94.02% and F1-score of 93.89%.

References

S. A. el Rahman, F. A. AlOtaibi, and W. A. AlShehri, “Sentiment Analysis of Twitter Data,†in 2019 international conference on computer and information sciences (ICCIS), IEEE, 2019, pp. 1–4.

Z. Drus and H. Khalid, “Sentiment Analysis in Social Media and Its Application: Systematic Literature Review,†Procedia Comput Sci, vol. 161, pp. 707–714, 2019, doi: 10.1016/j.procs.2019.11.174.

F. Hemmatian and M. K. Sohrabi, “A survey on classification techniques for opinion mining and sentiment analysis,†Artif Intell Rev, vol. 52, no. 3, pp. 1495–1545, Oct. 2019, doi: 10.1007/s10462-017-9599-6.

N. S. Fathullah, Y. A. Sari, and P. P. Adikara, “Analisis Sentimen Terhadap Rating dan Ulasan Film dengan menggunakan Metode Klasifikasi Naïve Bayes dengan Fitur Lexicon-Based,†J. Pengemb. Teknol. Inf. dan Ilmu Komput, vol. 4, no. 2, pp. 590–593, 2020.

B. N. Saha and A. Senapati, “Long Short Term Memory (LSTM) based Deep Learning for Sentiment Analysis of English and Spanish Data,†in 2020 International Conference on Computational Performance Evaluation (ComPE), IEEE, 2020, pp. 442–446.

L. Zhang, S. Wang, and B. Liu, “Deep learning for sentiment analysis: A survey,†Wiley Interdiscip Rev Data Min Knowl Discov, vol. 8, no. 4, p. e1253, 2018.

L. C. Cheng and S. L. Tsai, “Deep learning for automated sentiment analysis of social media,†in Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, 2019, pp. 1001–1004.

A. Yadav and D. K. Vishwakarma, “Sentiment analysis using deep learning architectures: a review,†Artif Intell Rev, vol. 53, no. 6, pp. 4335–4385, 2020.

F. Miedema, “Sentiment Analysis with Long Short-Term Memory networks,†Vrije Universiteit Amsterdam, vol. 1, pp. 1–17, 2018.

S. M. Qaisar, “Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory,†in 2020 2nd International Conference on Computer and Information Sciences (ICCIS), IEEE, 2020, pp. 1–4.

R. Ahuja, A. Chug, S. Kohli, S. Gupta, and P. Ahuja, “The Impact of Features Extraction on the Sentiment Analysis,†Procedia Comput Sci, vol. 152, pp. 341–348, 2019, doi: 10.1016/j.procs.2019.05.008.

R. DziseviÄ and D. Å eÅ¡ok, “Text Classification using Different Feature Extraction Approaches,†2019 Open Conference of Electrical, Electronic and Information Sciences (eStream), 2019.

E. Anggi, “Text Classification on Disaster Tweets with LSTM and Word Embedding | by Emmanuella Anggi | Towards Data Science,†2020. https://towardsdatascience.com/text-classification-on-disaster-tweets-with-lstm-and-word-embedding-df35f039c1db (accessed May 23, 2022).

H. R. Alhakiem and E. B. Setiawan, “Aspect-Based Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion,†Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 5, pp. 840–846, Nov. 2022, doi: 10.29207/resti.v6i5.4429.

S. A. Alex, N. Z. Jhanjhi, M. Humayun, A. O. Ibrahim, and A. W. Abulfaraj, “Deep LSTM Model for Diabetes Prediction with Class Balancing by SMOTE,†Electronics (Switzerland), vol. 11, no. 17, Sep. 2022, doi: 10.3390/electronics11172737.

A. Fernández, S. García, F. Herrera, and N. v Chawla, “SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary,†Journal of artificial intelligence research, vol. 61, pp. 863–905, 2018.

B. Athiwaratkun, A. G. Wilson, and A. Anandkumar, “Probabilistic FastText for Multi-Sense Word Embeddings,†2018.

B. Wang, A. Wang, F. Chen, Y. Wang, and C.-C. J. Kuo, “Evaluating word embedding models: methods and experimental results,†APSIPA Trans Signal Inf Process, vol. 8, 2019.

S. Seo, C. Kim, H. Kim, K. Mo, and P. Kang, “Comparative Study of Deep Learning-Based Sentiment Classification,†IEEE Access, vol. 8, pp. 6861–6875, 2020, doi: 10.1109/ACCESS.2019.2963426.

F. Landi, L. Baraldi, M. Cornia, and R. Cucchiara, “Working Memory Connections for LSTM,†Neural Networks, vol. 144, pp. 334–341, Dec. 2021, doi: 10.1016/j.neunet.2021.08.030.

A. Suresh, “What is a confusion matrix?,†2020. https://medium.com/analytics-vidhya/what-is-a-confusion-matrix-d1c0f8feda5 (accessed May 15, 2022).

Downloads

Published

2023-04-27

Issue

Section

Articles