Building Synonym Set for Indonesian WordNet using Commutative Method and Hierarchical Clustering

 (*)Valentino Rossi Fierdaus Mail (Telkom University, Bandung, Indonesia)
 Moch Arif Bijaksana (Telkom University, Bandung, Indonesia)
 Widi Astuti (Telkom University, Bandung, Indonesia)

(*) Corresponding Author

DOI: http://dx.doi.org/10.30865/mib.v4i3.2254

Abstract

WordNet is a compilation of Synonyms Set (synset), which consists of the words that have the same synonymous. The development of Indonesian WordNet has a goal to build an application that can accommodate and exhibit the relation of words. Synonym Set is a set composed of one or more words that have a similar meaning or synonym relation originated from the Indonesian Thesaurus. In previous studies, the establishment of synsets were transmitted with several approaches, one of which was the cluster ring to produce synsets and WSD (Word Sense Disambiguation). In this research, research is held up to discover the semantic similarities between words in the Indonesian Thesaurus automatically, and also to know the performance of the Agglomerative Hierarchical Clustering method for the development of Indonesian synsets. To calculate performance and evaluation, this research is using the F-measure method involving the gold standard

Keywords


WordNet, Synset, Indonesian Thesaurus, Agglomerative Hierarchical Clustering, F-Measure

Full Text:

PDF


Article Metrics

Abstract view : 89 times
PDF - 31 times

References

Gunawan, “Akuisisi Gloss Berbasis Ekstraksi Synonym Set Menggunakan Supervised Learning,” Institut Teknologi Sepuluh November, 2016.

A. Saputra and others, “Building synsets for Indonesian Wordnet with monolingual lexical resources,” in 2010 International Conference on Asian Language Processing, 2010, pp. 297–300.

G. A. Miller, “WordNet: a lexical database for English,” Commun. ACM, vol. 38, no. 11, pp. 39–41, 1995.

C. Fellbaum, “WordNet,” in Theory and applications of ontology: computer applications, Springer, 2010, pp. 231–243.

U. Indonesia, “WordNet Bahasa Indonesia,” 2008. http://bahasa.cs.ui.ac.id/ (accessed Jul. 26, 2019).

H. Hendrik and A. B. Cahyono, “Model WordNet Bahasa Indonesia berbasis Linked Data,” J. Nas. Tek. Elektro dan Teknol. Inf., vol. 6, no. 1, pp. 8–14, 2017.

J. Priyatno, “Clustering Synonym Sets in English WordNet,” Universitas Telkom, 2018.

D. J. Restina, “Pembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutatif,” Indo-JC, vol. 4, no. 2, 2019.

I. P. P. Ananda, “Pembangunan Synsets untuk WordNet Bahasa Indonesia dengan Metode Komutatif,” Universitas Telkom, 2018.

K. Sasirekha and P. Baby, “Agglomerative hierarchical clustering algorithm-a,” Int. J. Sci. Res. Publ., vol. 83, p. 83, 2013.

D. Müllner, “Modern hierarchical, agglomerative clustering algorithms,” no. 1973, pp. 1–29, 2011.

L. D. Anggaraini, “Analisis Pembangunan Word Sense pada WordNet Bahasa Indonesia Menggunakan Metode Hierarchical Clustering,” Bandung, 2019.

T. Redaksi, “Tesaurus Bahasa Indonesia Pusat Bahasa,” Pus. Bahasa, Dep. Pendidik. Nas., 2008.

Y. Nan, K. M. Chai, W. S. Lee, and H. L. Chieu, “Optimizing F-measure: A tale of two approaches,” arXiv Prepr. arXiv1206.4625, 2012.

D. R. Musicant, V. Kumar, A. Ozgur, and others, “Optimizing F-Measure with Support Vector Machines.,” in FLAIRS conference, 2003, pp. 356–360.

Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Building Synonym Set for Indonesian WordNet using Commutative Method and Hierarchical Clustering

Refbacks

  • There are currently no refbacks.


Copyright (c) 2020 JURNAL MEDIA INFORMATIKA BUDIDARMA

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



JURNAL MEDIA INFORMATIKA BUDIDARMA
STMIK Budi Darma
Sekretariat : Jln. Sisingamangaraja No. 338 Telp 061-7875998
email : mib.stmikbd@gmail.com


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.