Academia.eduAcademia.edu

Outline

Assamese Word Sense Disambiguation using Genetic Algorithm

2020

Abstract

Word sense disambiguation (WSD) is a problem to determine a word according to the context in which it occurs. There are plenty amount of works done in WSD for some languages such as English, but research work on Assamese WSD remains limited. It is a more exigent task because Assamese has an intrinsic complexity in its writing structure and ambiguity, such as syntactic, semantic, and anaphoric ambiguity levels. A novel unsupervised genetic word sense disambiguation algorithm is proposed in this paper. The algorithm first uses WordNet to extract all possible senses for a given ambiguous word, then a genetic algorithm is used taking Wu-Palmer’s similarity measure as the fitness function and calculating the similarity measure for all extracted senses. The winner sense which will have the highest score declared as the winner sense.

FAQs

sparkles

AI

What unique approach does the study employ for Assamese WSD?add

The paper proposes a novel Word Sense Disambiguation system using genetic algorithms for Assamese, achieving 81.25% precision and 74.28% recall.

How does the genetic algorithm enhance sense disambiguation effectiveness?add

The genetic algorithm optimizes sense selection by maximizing overall semantic similarity using Wu-Palmer's similarity measure as a fitness function.

What challenges were identified in disambiguating Assamese words?add

Challenges included the scarcity of synonymous sense definitions in Assamese WordNet, and handling spelling errors adversely affecting performance.

How does the performance of genetic algorithms compare with supervised methods?add

The proposed genetic algorithm outperformed traditional supervised methods, illustrated by a comparison yielding significant improvements in precision metrics.

What limitations affected the experiment's sentence data quality?add

Short sentence lengths and contextual word variations hindered data retrieval and similarity measurement accuracy during experiments.

References (16)

  1. Refrences Deborah Gail Tatar. 2005. Word Sense Dis- ambiguation by Machine Learning Approach. Fundamental Informaticae, pages 433-442.
  2. ChunHui Zhang, Yiming Zhou, and Trevor Martin. 2008. Genetic Word Sense Disam- biguation Algorithm. Second International Symposium on Intelligent Information Tech- nology Application, pages 123-127.
  3. Sruthi Sankar, Reghu Raj, Jayan. 2016.
  4. Unsupervised Approach to Word Sense Disambiguation in Malayalam. In textitpro- ceedings of the International Conference on Emerging Trends in Engineering, Science and Technology, pages 1507 -1513.
  5. Zankhana Vaishnav and Priti Sajja. 2019.
  6. KnowledgeBased Approach for Word Sense Disambiguation Using Genetic Algorithm for Gujarati. Information and Communication Technology for Intelligent Systems, pages 485-45.
  7. Devendra Tayal, Leena Ahuja, Shreya Chhabra. 2015. Word Sense Disambiguation in Hindi Language Using Hyperspace Analogue to Language and Fuzzy C-Means Clustering. In Proceedings of the 12th International Conference on Natural Language Processing, pages 49-58.
  8. Shabnam Sangwan and Paramjit Singh. 2013. Genetic Algorithm Based Hindi word sense disambiguation. International Journal of Computer Science and Mobile Computing , 2(5):139-144.
  9. Kalyani Mamulkar and Lokesh Nandanwar. 2020. Marathi Word Sense Disambiguation Using Genetic Algorithm. In IJACEN , pages 16-18.
  10. Mohana Priya, Krishna Priya, Geetha Harini, Ragavi. 2018. Handling WSD using Fuzzy Hierarchical Clustering of Sentence Level Text. International Journal of Research in Advent Technology, 6(11):3209 -3214.
  11. Alok Ranjan Pal and Diganta Saha. 2019. Word Sense Disambiguation in Bengali lan- guage using unsupervised methodology with modifications. Sådhanå, pages 44:168.
  12. Jumi Sarmah and Shikhar Kumar Sarma. 2016. Decision Tree based Supervised Word Sense Disambiguation for Assamese. International Journal of Computer Applications, 141(1):42- 48.
  13. Pranjal Protim Borah, Gitimoni Talukdar, Arup Baruah. 2014. Assamese Word Sense Disambiguation using Supervised Learning. International Conference on Contemporary Computing and Informatics, pages 946-950.
  14. Purabi Kalita and Anup Kumar Barman. 2015. Implementation of Walker algorithm in Word Sense Disambiguation for Assamese language. In ISACC, pages 136-140.
  15. Thabet Slimani. 2013. Description and Evaluation of Semantic Similarity Measures Approaches. International Journal of Com- puter Applications, 80(10):25-33.
  16. Nomi Baruah, Arjun Gogoi, Shikhar kumar Sarma, Randeep Borah. 2020. Utilizing corpus statistics for Assamese word sense disambiguation. CoCoNET'2020.