Building Synonym Set for Indonesian WordNet using Commutative Method and Hierarchical Clustering

Valentino Rossi Fierdaus; Moch Arif Bijaksana; Widi Astuti

doi:10.30865/mib.v4i3.2254

Authors

Valentino Rossi Fierdaus Telkom University, Bandung
Moch Arif Bijaksana Telkom University, Bandung
Widi Astuti Telkom University, Bandung

DOI:

https://doi.org/10.30865/mib.v4i3.2254

Keywords:

WordNet, Synset, Indonesian Thesaurus, Agglomerative Hierarchical Clustering, F-Measure

Abstract

WordNet is a compilation of Synonyms Set (synset), which consists of the words that have the same synonymous. The development of Indonesian WordNet has a goal to build an application that can accommodate and exhibit the relation of words. Synonym Set is a set composed of one or more words that have a similar meaning or synonym relation originated from the Indonesian Thesaurus. In previous studies, the establishment of synsets were transmitted with several approaches, one of which was the cluster ring to produce synsets and WSD (Word Sense Disambiguation). In this research, research is held up to discover the semantic similarities between words in the Indonesian Thesaurus automatically, and also to know the performance of the Agglomerative Hierarchical Clustering method for the development of Indonesian synsets. To calculate performance and evaluation, this research is using the F-measure method involving the gold standard

Author Biographies

Valentino Rossi Fierdaus, Telkom University, Bandung

Faculty of Informatics, Bachelor of Informatics Engineering

Moch Arif Bijaksana, Telkom University, Bandung

Faculty of Informatics, Bachelor of Informatics Engineering

Widi Astuti, Telkom University, Bandung

Faculty of Informatics, Bachelor of Informatics Engineering

References

Gunawan, â€œAkuisisi Gloss Berbasis Ekstraksi Synonym Set Menggunakan Supervised Learning,â€ Institut Teknologi Sepuluh November, 2016.

A. Saputra and others, â€œBuilding synsets for Indonesian Wordnet with monolingual lexical resources,â€ in 2010 International Conference on Asian Language Processing, 2010, pp. 297â€“300.

G. A. Miller, â€œWordNet: a lexical database for English,â€ Commun. ACM, vol. 38, no. 11, pp. 39â€“41, 1995.

C. Fellbaum, â€œWordNet,â€ in Theory and applications of ontology: computer applications, Springer, 2010, pp. 231â€“243.

U. Indonesia, â€œWordNet Bahasa Indonesia,â€ 2008. http://bahasa.cs.ui.ac.id/ (accessed Jul. 26, 2019).

H. Hendrik and A. B. Cahyono, â€œModel WordNet Bahasa Indonesia berbasis Linked Data,â€ J. Nas. Tek. Elektro dan Teknol. Inf., vol. 6, no. 1, pp. 8â€“14, 2017.

J. Priyatno, â€œClustering Synonym Sets in English WordNet,â€ Universitas Telkom, 2018.

D. J. Restina, â€œPembangunan Synonym Set untuk WordNet Bahasa Indonesia dengan Menggunakan Metode Komutatif,â€ Indo-JC, vol. 4, no. 2, 2019.

I. P. P. Ananda, â€œPembangunan Synsets untuk WordNet Bahasa Indonesia dengan Metode Komutatif,â€ Universitas Telkom, 2018.

K. Sasirekha and P. Baby, â€œAgglomerative hierarchical clustering algorithm-a,â€ Int. J. Sci. Res. Publ., vol. 83, p. 83, 2013.

D. MÃ¼llner, â€œModern hierarchical, agglomerative clustering algorithms,â€ no. 1973, pp. 1â€“29, 2011.

L. D. Anggaraini, â€œAnalisis Pembangunan Word Sense pada WordNet Bahasa Indonesia Menggunakan Metode Hierarchical Clustering,â€ Bandung, 2019.

T. Redaksi, â€œTesaurus Bahasa Indonesia Pusat Bahasa,â€ Pus. Bahasa, Dep. Pendidik. Nas., 2008.

Y. Nan, K. M. Chai, W. S. Lee, and H. L. Chieu, â€œOptimizing F-measure: A tale of two approaches,â€ arXiv Prepr. arXiv1206.4625, 2012.

D. R. Musicant, V. Kumar, A. Ozgur, and others, â€œOptimizing F-Measure with Support Vector Machines.,â€ in FLAIRS conference, 2003, pp. 356â€“360.

Building Synonym Set for Indonesian WordNet using Commutative Method and Hierarchical Clustering

Authors

DOI:

Keywords:

Abstract

Author Biographies

Valentino Rossi Fierdaus, Telkom University, Bandung

Moch Arif Bijaksana, Telkom University, Bandung

Widi Astuti, Telkom University, Bandung

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Menu Utama

flagcounter

template

statcounter

rji

terindex