Yutthana Pirunsarn. Bilingual machine readable dictionary extraction and concept tree construction for Thai wordnet development. Master's Degree(Computer Science). Mahidol University. Mahidol University Library and Knowledge Center. : Mahidol University, 2009.
Bilingual machine readable dictionary extraction and concept tree construction for Thai wordnet development
Abstract:
This thesis describes an approach that supports the automatic construction and development of Thai WordNet. The approach consists of two parts: the bilingual machine readable dictionaries extraction and the concept tree construction. In the extraction process, different MRDs representing different file formats were taken into account. These MRDs were reformatted, cleaned, integrated and selected to obtain a set of Thai nominal words. In the construction process, these Thai words were analyzed with regards to a primary hypothesis which states that words with the same prefix are likely to be related to others in hierarchical arrangement. The relationships between Thai words were constructed and represented in the form of the concept tree. We also developed a software tool to help evaluate the correctness of the concept tree. The tool allows two groups of users to evaluate the same set of trees. The result of evaluation was analyzed by using the Kappa statistic. The Kappa analysis showed that our approach was promising