Aspect-Based Sentiment Analysis on Twitter Using Long Short-Term Memory Method

Siti Inayah Putri; Erwin Budi Setiawan; Yuliant Sibaroni

doi:10.30865/mib.v7i2.5637

Authors

Siti Inayah Putri Telkom University, Bandung
Erwin Budi Setiawan Telkom University, Bandung
Yuliant Sibaroni Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v7i2.5637

Keywords:

Aspect-Based Analysis Sentiment, Movie Review, LSTM, Fasttext, TF-IDF, SMOTE

Abstract

Twitter is one of the most popular social media among Indonesian people. Due to the high number of users and the intensity of their use, Twitter can also be used to dig up information related to a topic or product with sentiment analysis. One of the most frequently discussed topics on Twitter is related to movie reviews. Everyone's opinion of movie reviews can refer to different aspects. So, aspect-based sentiment analysis can be applied to movie reviews to get more optimal results. Aspect-based sentiment analysis is a solution to find out the opinions of Twitter users on movie reviews based on the aspects. In this study, a system for aspect-based sentiment analysis was built with a dataset of Indonesian language movie reviews consisting of 3 aspects: plot, acting, and director. The classification model uses Long Short-Term Memory (LSTM) method with the application of TF-IDF feature extraction, fastText feature expansion, and handling of imbalanced data using SMOTE. The results of this study for the plot aspect obtained an accuracy score of 74.86% and F1-score of 74.74%, the acting aspect obtained an accuracy score of 94.80% and F1-score of 94.74%, and the director aspect obtained an accuracy score of 94.02% and F1-score of 93.89%.

References

S. A. el Rahman, F. A. AlOtaibi, and W. A. AlShehri, â€œSentiment Analysis of Twitter Data,â€ in 2019 international conference on computer and information sciences (ICCIS), IEEE, 2019, pp. 1â€“4.

Z. Drus and H. Khalid, â€œSentiment Analysis in Social Media and Its Application: Systematic Literature Review,â€ Procedia Comput Sci, vol. 161, pp. 707â€“714, 2019, doi: 10.1016/j.procs.2019.11.174.

F. Hemmatian and M. K. Sohrabi, â€œA survey on classification techniques for opinion mining and sentiment analysis,â€ Artif Intell Rev, vol. 52, no. 3, pp. 1495â€“1545, Oct. 2019, doi: 10.1007/s10462-017-9599-6.

N. S. Fathullah, Y. A. Sari, and P. P. Adikara, â€œAnalisis Sentimen Terhadap Rating dan Ulasan Film dengan menggunakan Metode Klasifikasi NaÃ¯ve Bayes dengan Fitur Lexicon-Based,â€ J. Pengemb. Teknol. Inf. dan Ilmu Komput, vol. 4, no. 2, pp. 590â€“593, 2020.

B. N. Saha and A. Senapati, â€œLong Short Term Memory (LSTM) based Deep Learning for Sentiment Analysis of English and Spanish Data,â€ in 2020 International Conference on Computational Performance Evaluation (ComPE), IEEE, 2020, pp. 442â€“446.

L. Zhang, S. Wang, and B. Liu, â€œDeep learning for sentiment analysis: A survey,â€ Wiley Interdiscip Rev Data Min Knowl Discov, vol. 8, no. 4, p. e1253, 2018.

L. C. Cheng and S. L. Tsai, â€œDeep learning for automated sentiment analysis of social media,â€ in Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, 2019, pp. 1001â€“1004.

A. Yadav and D. K. Vishwakarma, â€œSentiment analysis using deep learning architectures: a review,â€ Artif Intell Rev, vol. 53, no. 6, pp. 4335â€“4385, 2020.

F. Miedema, â€œSentiment Analysis with Long Short-Term Memory networks,â€ Vrije Universiteit Amsterdam, vol. 1, pp. 1â€“17, 2018.

S. M. Qaisar, â€œSentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory,â€ in 2020 2nd International Conference on Computer and Information Sciences (ICCIS), IEEE, 2020, pp. 1â€“4.

R. Ahuja, A. Chug, S. Kohli, S. Gupta, and P. Ahuja, â€œThe Impact of Features Extraction on the Sentiment Analysis,â€ Procedia Comput Sci, vol. 152, pp. 341â€“348, 2019, doi: 10.1016/j.procs.2019.05.008.

R. DziseviÄ and D. Å eÅ¡ok, â€œText Classification using Different Feature Extraction Approaches,â€ 2019 Open Conference of Electrical, Electronic and Information Sciences (eStream), 2019.

E. Anggi, â€œText Classification on Disaster Tweets with LSTM and Word Embedding | by Emmanuella Anggi | Towards Data Science,â€ 2020. https://towardsdatascience.com/text-classification-on-disaster-tweets-with-lstm-and-word-embedding-df35f039c1db (accessed May 23, 2022).

H. R. Alhakiem and E. B. Setiawan, â€œAspect-Based Sentiment Analysis on Twitter Using Logistic Regression with FastText Feature Expansion,â€ Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 6, no. 5, pp. 840â€“846, Nov. 2022, doi: 10.29207/resti.v6i5.4429.

S. A. Alex, N. Z. Jhanjhi, M. Humayun, A. O. Ibrahim, and A. W. Abulfaraj, â€œDeep LSTM Model for Diabetes Prediction with Class Balancing by SMOTE,â€ Electronics (Switzerland), vol. 11, no. 17, Sep. 2022, doi: 10.3390/electronics11172737.

A. FernÃ¡ndez, S. GarcÃa, F. Herrera, and N. v Chawla, â€œSMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-year Anniversary,â€ Journal of artificial intelligence research, vol. 61, pp. 863â€“905, 2018.

B. Athiwaratkun, A. G. Wilson, and A. Anandkumar, â€œProbabilistic FastText for Multi-Sense Word Embeddings,â€ 2018.

B. Wang, A. Wang, F. Chen, Y. Wang, and C.-C. J. Kuo, â€œEvaluating word embedding models: methods and experimental results,â€ APSIPA Trans Signal Inf Process, vol. 8, 2019.

S. Seo, C. Kim, H. Kim, K. Mo, and P. Kang, â€œComparative Study of Deep Learning-Based Sentiment Classification,â€ IEEE Access, vol. 8, pp. 6861â€“6875, 2020, doi: 10.1109/ACCESS.2019.2963426.

F. Landi, L. Baraldi, M. Cornia, and R. Cucchiara, â€œWorking Memory Connections for LSTM,â€ Neural Networks, vol. 144, pp. 334â€“341, Dec. 2021, doi: 10.1016/j.neunet.2021.08.030.

A. Suresh, â€œWhat is a confusion matrix?,â€ 2020. https://medium.com/analytics-vidhya/what-is-a-confusion-matrix-d1c0f8feda5 (accessed May 15, 2022).