Header menu link for other important links
X
Word sense disambiguation in Tamil using indo-wordnet and cross-language semantic similarity
D. Karuppaiah,
Published in Inderscience Publishers
2021
Volume: 8
   
Issue: 1
Pages: 62 - 73
Abstract
Word sense disambiguation is the way to compute the correct sense of a word. It is considered as one of the important subtasks in natural language processing, machine translation and information retrieval. WSD found improving the overall performances of these systems. The job of WSD is to eliminate all senses of a word except the appropriate one as per the given context. The work in Tamil linguistics domain for information retrieval or natural language processing is very less. WSD can be performed in supervised and unsupervised manner. Here, we have proposed an unsupervised approach to disambiguate Tamil words in a given context using the context words and their dictionary gloss definitions. We have proposed two variants of our approach. The first approach uses the number of word overlapping between the glosses of context words whereas the second one uses the similarity between the glosses of context words with that of the ambiguous word. The second one found best among the two. For our approach, we have used Tamil Indo-WordNet, Oxford Tamil Dictionary and English WordNet dictionary glosses. Our method achieves better result in recognising correct senses in Tamil text. Copyright © 2021 Inderscience Enterprises Ltd.
About the journal
JournalInternational Journal of Intelligent Enterprise
PublisherInderscience Publishers
ISSN17453232