Collaborative Filtering with Dimension Reduction Technique and Clustering for E-Commerce Product

Authors

DOI:

https://doi.org/10.30865/mib.v7i1.5538

Keywords:

Recommender System, Collaborative Filtering, Amazon Review Dataset, K-Means, Singular Value Decomposition

Abstract

The rapid development of internet users over the last decade has led to an increase in the use of electronic commerce (e-commerce). The existence of a recommender system influences the success of e-commerce. Collaborative Filtering (CF) is one of the most frequently used recommender system methods. However, in real cases, sparsity problems generally occur. This is generally caused because only a small number of users give ratings to items. In this study, we propose the combination of clustering and dimension reduction methods on the Amazon Review Data to overcome the sparsity problem. The clustering method with K-Means is used to group users based on item preferences. Meanwhile, we used Singular Value Decomposition (SVD) for dimension reduction to improve the performance of the recommender system in sparse data. The results show that the combination of SVD and K-Means is successful in predicting ratings with an RMSE value of less than 2, significant performance increase compared to previous study. The use of SVD is proven to be able to overcome sparsity, with a decrease in RMSE of 9.372%.

Author Biographies

Daffa Barin Tizard Riyadi, Telkom University, Bandung

School of Computing

Z K A Baizal, Telkom University, Bandung

School of Computing

References

“Citation & Republication — DataReportal – Global Digital Insights.†https://datareportal.com/citation (accessed Jan. 13, 2023).

C. Xu, D. Peak, and V. Prybutok, “A customer value, satisfaction, and loyalty perspective of mobile application recommendations,†Decis Support Syst, vol. 79, pp. 171–183, Nov. 2015, doi: 10.1016/J.DSS.2015.08.008.

W. Lu, S. Chen, K. Li, and L. V. S. Lakshmanan, “Show Me the Money: Dynamic Recommendations for Revenue Maximization,†Proceedings of the VLDB Endowment, vol. 7, no. 14, pp. 1785–1796, Aug. 2014, doi: 10.14778/2733085.2733086.

B. Ramzan et al., “An Intelligent Data Analysis for Recommendation Systems Using Machine Learning,†Sci Program, vol. 2019, 2019, doi: 10.1155/2019/5941096.

P. M. Chu and S. J. Lee, “A novel recommender system for E-commerce,†Proceedings - 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics, CISP-BMEI 2017, vol. 2018-January, pp. 1–5, Feb. 2018, doi: 10.1109/CISP-BMEI.2017.8302310.

H. Zarzour, Z. Al-Sharif, M. Al-Ayyoub, and Y. Jararweh, “A new collaborative filtering recommendation algorithm based on dimensionality reduction and clustering techniques,†2018 9th International Conference on Information and Communication Systems, ICICS 2018, vol. 2018-January, pp. 102–106, May 2018, doi: 10.1109/IACS.2018.8355449.

C. C. Aggarwal, “Data Mining,†2015, doi: 10.1007/978-3-319-14142-8.

H. Liu, J. Wu, T. Liu, D. Tao, and Y. Fu, “Spectral Ensemble Clustering via Weighted K-Means: Theoretical and Practical Evidence,†IEEE Trans Knowl Data Eng, vol. 29, no. 5, pp. 1129–1143, May 2017, doi: 10.1109/TKDE.2017.2650229.

A. A. Fakhri, Z. K. A. Baizal, and E. B. Setiawan, “Restaurant Recommender System Using User-Based Collaborative Filtering Approach: A Case Study at Bandung Raya Region,†J Phys Conf Ser, vol. 1192, no. 1, May 2019, doi: 10.1088/1742-6596/1192/1/012023.

M. S. Ayundhita, Z. K. A. Baizal, and Y. Sibaroni, “Ontology-based conversational recommender system for recommending laptop,†in Journal of Physics: Conference Series, 2019, vol. 1192, no. 1. doi: 10.1088/1742-6596/1192/1/012020.

M. I. Ardimansyah, A. F. Huda, and Z. K. A. Baizal, “Preprocessing matrix factorization for solving data sparsity on memory-based collaborative filtering,†in Proceeding - 2017 3rd International Conference on Science in Information Technology: Theory and Application of IT for Education, Industry and Society in Big Data Era, ICSITech 2017, 2017, vol. 2018-January. doi: 10.1109/ICSITech.2017.8257168.

M. Stratigi, X. Li, K. Stefanidis, and Z. Zhang, “Ratings vs. Reviews in Recommender Systems: A Case Study on the Amazon Movies Dataset,†Communications in Computer and Information Science, vol. 1064, pp. 68–76, 2019, doi: 10.1007/978-3-030-30278-8_9/COVER.

J. Ni, J. Li, and J. McAuley, “Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects,†EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference, pp. 188–197, 2019, doi: 10.18653/V1/D19-1018.

N. T. Son, D. H. Dat, N. Q. Trung, and B. N. Anh, “Combination of Dimensionality Reduction and User Clustering for Collaborative-Filtering,†ACM International Conference Proceeding Series, pp. 125–130, Dec. 2017, doi: 10.1145/3168390.3168405.

F. Anowar, S. Sadaoui, and B. Selim, “Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE),†Comput Sci Rev, vol. 40, p. 100378, May 2021, doi: 10.1016/J.COSREV.2021.100378.

S. M. Atif, S. Qazi, and N. Gillis, “Improved SVD-based initialization for nonnegative matrix factorization using low-rank correction,†Pattern Recognit Lett, vol. 122, pp. 53–59, May 2019, doi: 10.1016/J.PATREC.2019.02.018.

M. Gupta, A. Thakkar, Aashish, V. Gupta, and D. P. S. Rathore, “Movie Recommender System Using Collaborative Filtering,†Proceedings of the International Conference on Electronics and Sustainable Communication Systems, ICESC 2020, pp. 415–420, Jul. 2020, doi: 10.1109/ICESC48915.2020.9155879.

S. Suriati et al., “A Hybrid Approach using Collaborative filtering and Content based Filtering for Recommender System,†J Phys Conf Ser, vol. 1000, no. 1, p. 012101, Apr. 2018, doi: 10.1088/1742-6596/1000/1/012101.

R. Sallam, M. Hussein, H. M.-I. J. of, and undefined 2020, “An enhanced collaborative filtering-based approach for recommender systems,†researchgate.net, 2020, doi: 10.5120/ijca2020920531.

M. C. M. Oo and T. Thein, “An efficient predictive analytics system for high dimensional big data,†Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 1, pp. 1521–1532, Jan. 2022, doi: 10.1016/J.JKSUCI.2019.09.001.

Downloads

Published

2023-01-28