Prediksi Harga Mobil Global Menggunakan Machine Learning dengan Algoritma Naive Bayes
DOI:
https://doi.org/10.30865/jurikom.v12i6.9320Keywords:
Machine Learning, Naive Bayes, Car Price Prediction, Data Mining, Global Car SalesAbstract
Determining car prices is one of the major challenges in the global automotive industry because it is influenced by various factors such as technical specifications, vehicle condition, and market dynamics. This issue becomes more complex as the volume of available data increases, requiring methods capable of performing fast and accurate analysis. This study aims to predict car price levels based on vehicle specifications using a Machine Learning approach, with the Naive Bayes algorithm selected as a solution to simplify the price classification process on large-scale data. The dataset used is the Global Car Sales Analysis from the Kaggle platform, which includes attributes such as Manufacturer, Model, Engine size, Fuel type, Year of manufacture, Mileage, and Price. The research methodology consists of data preprocessing, label encoding for categorical attributes, splitting the dataset into training and testing sets, and applying the Naive Bayes algorithm to classify car prices into three categories: Low, Medium, and High. The results indicate that Naive Bayes is capable of predicting car prices with very strong performance, achieving an accuracy of 96%, precision of 0.97, recall of 0.96, and an F1-score of 0.96. The model performs best on the Low category with an F1-score of 0.98, although performance decreases for the Medium and High categories due to imbalanced class distribution. Further analysis also reveals that Engine size, Year of manufacture, and Mileage are the most influential attributes in determining price. Overall, this study demonstrates that Naive Bayes is an effective method for predicting car prices using global automotive data.
References
[1] M. Z. Ahmad, Muhammad; Farooq, Muhammad Ali; Hussain, M. Z. Hasan, M. Muzzamil, and A. Khalid, “Car Price Prediction using Machine Learning,” 2024 IEEE 9th Int. Conf. Converg. Technol., 2024, [Online]. Available: 10.1109/I2CT61223.2024.10544124.
[2] M. Poorv, Y. K. . Gupta, and A. K. Sharma, “Evaluating machine learning models for used car price estimation: a comparative study,” 2nd Int. Conf. Pervasive Comput. Adv. Appl. (PerCAA 2024), 2024, [Online]. Available: 10.1049/icp.2025.0793.
[3] M. Devanda, H. Kusuma, and S. Hidayat, “Penerapan Model Regresi Linier dalam Prediksi Harga Mobil Bekas di India dan Visualisasi dengan Menggunakan Power Abstrak,” vol. 5, no. 2, pp. 1097–1110, 2024.
[4] E. Gegic, B. Isakovic, D. Keco, Z. Masetic, and J. Kevric, “Car Price Prediction using Machine Learning Techniques,” pp. 113–118, 2019, doi: 10.18421/TEM81-16.
[5] B. E. Putro and D. Indrawati, “Data Mining Analytics Application for Estimating Used Car Price During the Covid-19 Pandemic in Indonesia,” vol. 6869, 2019, doi: 10.23917/jiti.v21i2.18975.
[6] J. Yang, J. Kim, H. Ryu, J. Lee, and C. Park, “Predicting Car Rental Prices : A Comparative Analysis of Machine Learning Models,” Electronics, pp. 1–20, 2024, [Online]. Available: https://doi.org/10.3390/electronics13122345.
[7] V. Nakhipova, Y. Kerimbekov, Z. Umarova, L. Suleimenova, and S. Botayeva, “Use of the Naive Bayes Classifier Algorithm in Machine Learning for Student Performance Prediction,” Int. J. Inf. Educ. Technol., vol. 14, no. 1, 2024, doi: 10.18178/ijiet.2024.14.1.2028.
[8] E. K. Ampomah, G. Nyame, P. C. Addo, and M. Gyan, “Stock Market Prediction with Gaussian Naïve Bayes Machine Learning Algorithm,” Inform., vol. 45, pp. 243–256, 2021.
[9] R. Syahputra, G. J. Yanris, and D. Irmayani, “SVM and Naïve Bayes Algorithm Comparison for User Sentiment Analysis on Twitter,” Sink. J. dan Penelit. Tek. Inform., vol. 7, no. 2, pp. 671–678, 2022.



