Academia.eduAcademia.edu

Outline

IRJET- Identification of Text Similarity Based On Context

2021, IRJET

Abstract

Text similarity computing plays an important role in natural language processing. The similarity calculation of short text is influenced by the small feature of text words and the accuracy is low. so it is a common improvement method to calculate the similarity of short texts with word semantic similarity. The word similarity calculation method combines two word semantic similarity by some strategies. Instead of doing a word for word comparison, we also need to pay attention to context in order to capture more of the semantics. Calculating similarities between texts that have been written in English language is still one of the most important challenges facing natural language processing. The proposed system will find the similarity between two English texts by using similarity measures techniques: Semantic similarity measure, Cosine similarity measure and N-gram. In our proposed system we will design English Semantic Net that stores the keywords for a specific field, by this network we can find semantic similarity between words according to specific equations.

References (7)

  1. Thanh-Phu Nguyen1(B), Mina Ryoke, and Van-Nam Huynh1 "A New Context-Based Similarity Measure for Categorical Data Using Information Theory" Asahidai, Nomi, Ishikawa 923-1292, Japan
  2. Maake Benard Magara, Sunday O. Ojo, Tranos Zuva "A Comparative Analysis of Text Similarity Measures and Algorithms in Research PaperRecommender Systems" Information Communications Technology and Society (ICTAS), (2018).
  3. Haoyu Pu, Gaolei Fei, Hailin Zhao, Guangmin Hu, Chengbo Jiao, Zhoujun Xu "Short Text Similarity Calculation Using Semantic Information" Big Data Computing and Communications, (2017).
  4. Ishrath Jahan C, Abitha E "Context Based Similarity Matching" International Journal of Science and Research (IJSR)
  5. Kuntal Dey, Ritvik Shrivastava, Saroj Kaus "A Paraphrase and Semantic Similarity Detection System for User Generated Short-Text Content on Microblogs" IBM Research India, NIST Delhi , IIT Delhi
  6. Wael H. Gomaa And Aly A. Fahmy"A Survey of Text Similarity Approaches", International Journal of Computer Applications, (2018)
  7. Samuel Fernando and Mark Stevenson "A Semantic Similarity Approach to Paraphrase Detection" ,University of Sheffield