Pengklasifikasian Topik Hadits Terjemahan Bahasa Indonesia Menggunakan Latent Semantic Indexing dan Support Vector Machine
DOI:
https://doi.org/10.30865/mib.v2i4.948Abstract
Hadith is used as the source of Islamic law othen than Qur’an, Ijma, Ijtihad and Qiyas, hadith is the second of Islamic law after the Qur’an. This study attempted to build a system than can classify shahih hadith of Bukhari in Indonesian Translation. This topic was chosen to help Muslims who want to understand from each hadith is in the form of informations, prohibitions or suggestion. Support Vector Machine was chosen because it can perform classification by providing good performance for dataset with a large number of features. Latent Semantic Indexing as a feature selection method was chosen because it can reduce features by eliminationg unimportant features (noise term). This study also using Bootstrap Aggregating (Bagging) method to improve accuracy of the classification system. The accuracy results show that by using Latent Semantic Indexing and Bootstrap Aggregating on Support Vector Machine classification single label system is 84% on polynomial kernel and 84.67% on RBF kernelReferences
Achmad, R. & Indriati. Pengklasifikasian Dokumen Berbahasa Indonesia Dengan Pengindeksan Berbasis LSI. Jurnal Teknologi Informasi dan Ilmu Komputer (JTIIK), Oktober, Vol. 2, No. 2, hlm. 87-95. 2015.
Adiwijaya, Aulia, M.N., Mubarok, M.S., Novia, W.U. & Nhita, F. A comparative study of MFCC-KNN and LPC-KNN for hijaiyyah letters pronounciation classification system. In Information and Communication Technology (ICoIC7), 5th International Conference on (pp. 1-5). IEEE. 2017
Adiwijaya, Maharani, M., Dewi, B.K., Yulianto, F.A. & Purnama, B. Digital image compression using graph coloring quantization based on wavelet-SVD. In Journal of Physics: Conference Series (Vol. 423, No. 1, p. 012019). IOP Publishing. 2013.
Adiwijaya. Aplikasi Matriks Ruang Vektor. Graha Ilmu, Yogyakarta. 2014.
Al Mira, K. I., Mubarok, M. S., Nanang, S. H. & Adiwijaya. A Multi-label Classification on Topics of Quranic Verses in English Translation Using Tree Augmented Naïve Bayes. In 6th International Conference on Information and Communication Technology (ICoICT). IEEE. 2018.
Ana, C.C., Arlindo I. & Olivera. Combining LSI with other Classifiers to Improve Accuracy of Single-label Text. In First European Workshop on Latent Semantic Analysis in Technology Enhanced Learning, Netherlands. 2007.
Billy, E. W. A. Bagging Support Vector Machines for leukemia Classification. JURNAL IT: Media Informasi STMIK Handayani Makassar 15. 2014.
Christianini. N. & John, S. T. An Introduction to Support Vector Machines and Other Kernel-based Learning Merhods. Cambridge University Press. 2000.
Deerwester, S., et al. Improving Information Retrieval with Latent Semantic Indexing. Proceedings of the 51st Annual Meeting of the American Society for Information Science 25, pp. 36–40. 1988.
Diani R. Analisis Pengaruh Kernel Support Vector Machine (SVM) pada Klasifikasi Data Microarray untuk Deteksi Kanker. Indonesian Journal on Computing (Indo-JC), 2(1), pp. 109-118. 2017.
Durgesh, K.S. & Lekha, B. Data classification using support vector machine. Journal of Theoretical and Applied Information Technology, 12(1), pp.1-7. 2010.
Hai, J., Xiaoming, N., Hanhua & Zuoning, Y. Efficient Query Routing for Information Retrieval in Semantic Overlays. In Proceedings of the 2006 ACM Symposium on Applied Computing (SAC 2006), Dijon, France, April 23-27, ACM Press, pp. 1669-1673. 2006.
Husna, A. & Adiwijaya. A Clustering Approach for Feature Selection in Microarray Data Classification using Random Forest. Journal of Information Processing System 14(5). 2018.
Marwan, H., Sistem Pakar Menidentifikasi Hadits Menggunakan Menggunakan Metode Forward Chaining, Proc. ISBN. 978-602-17488- 1-7. 2016, paper SEMNASTIKOM 2016, p. 20. 2016.
Prangga, S. Optimasi Parameter pada Support Vector Machine menggunakan Pendekatan Metode Taguchi untuk Data High-Dimensional. Masters thesis. Institut Teknologi Sepuluh Nopember. 2017.
Ramana, B. V., Babu, S. P. & Venkateswarlu, N. B. A Critical Study Of Selected Classification Algorithms For Live Disease Diagnosis. International Journal Of Database Management Systems, Vol. 3(2). 2011.
Reynaldi, A. P., Mubarok, M. S., Nanang, S. H. & Adiwijaya. A Multi-lable Classification on Topics of Quranic Verses in English Translation using Multinomial Naive Bayes. In 6th International Conference on Information and Communication Technology (ICoICT). IEEE. 2018.
Wulandini, F. & Nugroho, A. N. Text Classification Using Support Vector Machine for Web Mining Based Spation Temporal Analuysis of the Spread of Tropical Diseases. International Conference on Rural Information and Communication Technology. 2009.
Yang, Y. & Liu, X. A re-examination of text categorization methods. In Proceedings of the Twenty-Second International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), pages 42-49. 1999.
Ziqiang, W. & Dexian, Z. Feature Selection in Text Classification Via SVM and LSI. Springer Verlag Berlin Heidelberg, LNCS 3971, pp. 1381–1386. 2006.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).