Sentiment Analysis of the Jakarta - Bandung Fast Train Project Using the SVM Method

 Muhammad Daffa Dhiyaulhaq (Telkom University, Bandung, Indonesia)
 (*)Putu Harry Gunawan Mail (Telkom University, Bandung, Indonesia)

(*) Corresponding Author

Submitted: October 2, 2023; Published: October 31, 2023


Web growth contributes greatly to user-generated content such as user feedback, opinions and reviews. The construction of the Jakarta-Bandung High Speed Train is both an icon and a momentum for Indonesia to modernize mass transportation in an era of continuous progress. Sentiment analysis is one of the text-based research field solutions suitable for addressing satisfaction issues based on user reviews. In this research, the system will be made with review sentences from users and produce output in the form of positive and negative classes. The method used by the author is classification using the Support Vector Machine (SVM) method and Word2Vec extraction features. In addition, a comparison of the accuracy value between the Support Vector Machine method, Naïve Bayes method and TF-IDF extraction features is carried out. The data studied came from several news websites containing user reviews of the Jakarta-Bandung High Speed Train. This method is used because it represents words in a vector, besides that the training process is faster when compared to other extraction features. This research resulted in the performance of accurasy, precision, recall, and f1-score, namely accurasy of 82.74%, precision of 75.68%, recall of 97.67%, and f1-score of 85.28%. These results were obtained using the best tuning hyperparameters, namely ('C': 10, 'gamma': 0.1, 'kernel': 'rbf'). Then in the second scenario a comparison is made with the Naïve Bayes method. It was found that the accuracy of the Support Vector Machine method using the TF-IDF extraction feature obtained better and stable performance results than the other three performance results, which amounted to 86.90%. So the author concludes that the Support Vector Machine method using the TF-IDF extraction feature is better when compared to the Naïve Bayes method and the Word2vec extraction feature.


Fast Train; Sentiment Analysis; SVM; Word2vec

Full Text:


Article Metrics

Abstract view : 119 times
PDF - 27 times


B. Jonathan, J. I. Sihotang, and S. Martin, “Sentiment Analysis of Customer Reviews in Zomato Bangalore Restaurants Using Random Forest Classifier,” Abstract Proceedings International Scholars Conference, vol. 7, no. 1, pp. 1719–1728, Dec. 2019, doi: 10.35974/isc.v7i1.1003.

C. Nanda, M. Dua, and G. Nanda, “Sentiment analysis of movie reviews in hindi language using machine learning,” in 2018 International Conference on Communication and Signal Processing (ICCSP), 2018, pp. 1069–1072.

J. Nayak, B. Naik, and H. S. Behera, “A comprehensive survey on support vector machine in data mining tasks: applications & challenges,” International Journal of Database Theory and Application, vol. 8, no. 1, pp. 169–186, 2015.

P. H. Gunawan, T. D. Alhafidh, B. A. Wahyudi, and others, “The sentiment analysis of spider-man: No way home film based on imdb reviews,” Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), vol. 6, no. 1, pp. 177–182, 2022.

A. P. Perdana, P. H. Gunawan, and N. Aquarini, “Web-Based Machine Learning Bi-Class Congestion Identification Based on Audio Data Application,” in 2022 2nd International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA), 2022, pp. 278–282.

A. M. Rahat, A. Kahir, and A. K. M. Masum, “Comparison of Naive Bayes and SVM Algorithm based on sentiment analysis using review dataset,” in 2019 8th International Conference System Modeling and Advancement in Research Trends (SMART), 2019, pp. 266–270.

E. M. Alshari, A. Azman, S. Doraisamy, N. Mustapha, and M. Alkeshr, “Effective method for sentiment lexical dictionary enrichment based on Word2Vec for sentiment analysis,” in 2018 fourth international conference on information retrieval and knowledge management (CAMP), 2018, pp. 1–5.

A. Firmanto, R. Sarno, and others, “Prediction of movie sentiment based on reviews and score on rotten tomatoes using sentiwordnet,” in 2018 International Seminar on Application for Technology of Information and Communication, 2018, pp. 202–206.

J. S. Lee, D. Zuba, and Y. Pang, “Sentiment analysis of Chinese product reviews using gated recurrent unit,” in Proceedings - 5th IEEE International Conference on Big Data Service and Applications, BigDataService 2019, Workshop on Big Data in Water Resources, Environment, and Hydraulic Engineering and Workshop on Medical, Healthcare, Using Big Data Technologies, Institute of Electrical and Electronics Engineers Inc., Apr. 2019, pp. 173–181. doi: 10.1109/BigDataService.2019.00030.

A. R. FITRIANSYAH, “Analisis Sentimen Terhadap Pembangunan Kereta Cepat Jakarta - Bandung pada Media Sosial Twitter Menggunakan Metode SVM dan GloVe Word Embedding,” Jurnal Tugas Akhir Fakultas Informatika, vol. 9, no. 5, pp. 6078–6083, 2022.

J. Kaur and P. K. Buttar, “A systematic review on stopword removal algorithms,” International Journal on Future Revolution in Computer Science & Communication Engineering, vol. 4, no. 4, pp. 207–210, 2018.

J. K. Raulji and J. R. Saini, “Stop-word removal algorithm and its implementation for Sanskrit language,” Int J Comput Appl, vol. 150, no. 2, pp. 15–17, 2016.

T. Rahman, F. E. M. Agustin, and N. F. Rozy, “Normalization of Unstructured Indonesian Tweet Text For Presidential Candidates Sentiment Analysis,” in 2019 7th International Conference on Cyber and IT Service Management (CITSM), 2019, pp. 1–6.

W. G. S. Parwita, “A document recommendation system of stemming and stopword removal impact: A web-based application,” in Journal of Physics: Conference Series, Institute of Physics Publishing, Feb. 2020. doi: 10.1088/1742-6596/1469/1/012050.

N. Yusliani, R. Primartha, and M. Diana, “Multiprocessing stemming: A case study of Indonesian stemming,” Int J Comput Appl, vol. 182, no. 40, pp. 15–19, 2019.

D. Khyani, B. S. Siddhartha, N. M. Niveditha, and B. M. Divya, “An interpretation of lemmatization and stemming in natural language processing,” Journal of University of Shanghai for Science and Technology, vol. 22, no. 10, pp. 350–357, 2021.

M. Yasen and S. Tedmori, “Movies reviews sentiment analysis and classification,” in 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology, JEEIT 2019 - Proceedings, 2019, pp. 860–865. doi: 10.1109/JEEIT.2019.8717422.

Q. Pan, H. Dong, Y. Wang, Z. Cai, L. Zhang, and M. Nogueira, “Recommendation of crowdsourcing tasks based on word2vec semantic tags,” Wirel Commun Mob Comput, vol. 2019, 2019, doi: 10.1155/2019/2121850.

S. M. Fati, A. Muneer, A. Alwadain, and A. O. Balogun, “Cyberbullying Detection on Twitter Using Deep Learning-Based Attention Mechanisms and Continuous Bag of Words Feature Extraction,” Mathematics, vol. 11, no. 16, p. 3567, 2023, doi: 10.3390/math11163567.

M. J. Lavin, Z. Leblanc, and Q. Dombrowski, “The Programming Historian Analyzing Documents with TF-IDF,” Programming Historian, pp. 1–21, 2020.

D. Darwis, N. Siskawati, and Z. Abidin, “Penerapan Algoritma Naive Bayes Untuk Analisis Sentimen Review Data Twitter Bmkg Nasional,” Jurnal Tekno Kompak, vol. 15, no. 1, p. 131, 2021, doi: 10.33365/jtk.v15i1.744.

Suhariyanto, A. Firmanto, and R. Sarno, “Prediction of Movie Sentiment based on Reviews and Score on Rotten Tomatoes using SentiWordnet,” 2018. doi: 10.1109/ISEMANTIC.2018.8549704.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Sentiment Analysis of the Jakarta - Bandung Fast Train Project Using the SVM Method


  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

STMIK Budi Darma
Secretariat: Sisingamangaraja No. 338 Telp 061-7875998

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.