Information Retrieval on Text using Concept Similarity
Abstract
— Retrieving proper information from internet is a huge task due to the high amount of information available there. Identifying the individual concepts according to the queries is time consuming. To retrieve documents, keyword based retrieval method was used before. Using this type searching, the relationship between associated keywords can't be identified. If the same concept is described by different keywords, inaccurate and improper results will be retrieved. Concept based retrieval methods are the solution for this scenario. This gives the benefit of getting semantic relationships among concepts in finding relevant documents. Irrelevant documents can be eliminated by detecting conceptual mismatches, which is another benefit obtained from this. The main challenges identified are the ambiguity occurring due to multiple nature of words for the same concepts. Semantic analysis can reveal the conceptual relationships among words in a given document. In this paper the potential of concept-based information access via semantic analysis is explored with the help of a lexical database called WordNet. The mechanism is applied in the selected text documents and extracting the Synonym, Hyponym, Hypernym of each word from WordNet. The ranking will be calculated after checking the frequency rate of each word in the input documents and a hierarchy model will be generated according to the ranking.
References (13)
- Hele-Mai Haav, An Application of Inductive Concept Analysis to Construction of Domain-specific Ontologies. Akadeemia tee 21, 12618 Tallinn, Estonia ...
- W.Bruce Croft, What Do People Want from Information Retrieval? www.dlib.org dlib /november95 /11croft.html.
- Urvi Shah, Tim Finin, Anupam Joshi, R. Scott Cost, James Mayfield, Information Retrieval on the Semantic web, http://www.csee.umbc.edu/~finin//papers/cikm02/cikm02.pdf.
- Rifat Ozcan, Y. Alp Aslandogan, Concept-based Information Retrieval Using Ontologies and Latent Semantic Analysis, www.cse.uta.edu/research/pblications/Downloads/CSE-2004- 8.pdf
- Hele-Mai Haav,,Tanel-Lauri Lubi, A Survey of Concept-based Information Retrieval Tools on the Web, 5th East-European Conference, ADBIS 2001 Vilnius, Lithuania: (2001) .
- Ide, N., J.Véronis. Word Sense Disambiguation: The State of the Art. Special issue of Computational linguistics on Word Sense Disambiguation, 24:1, Pages 1-40, 1998.
- Christian Safran, A Concept-Based Information Retrieval Approach for User-oriented Knowledge Transfer, Master's Thesis, 10th December 2005.
- Fensel, D]2001], Ontologies: Silver bullet for knowledge management and electronic commerce. Springer-Verlag, Berlin.
- Asuncion Gomez Perez and V. Richard Benjamins [1999], Overview of Knowledge Sharing and Reuse Components: Ontologies and Problem-Solving Methods. IJCAI-Workshop on Ontologies and Problem-Solving Methods: Lessons Learned and Future Trends.
- R. Bodner and F. Song[1996], -Knowledge-based Approaches to Query Expansion in Information Retrieval,‖ in Proc. of Advances in Artificial Intelligence, pp. 146-158, New York,Springer.
- Lopez, M.F.[1999], "Overview of the methodologies for building ontologies". Proceedings of the IJCAI-99 Workshop on Ontologies and Problem-Solving Methods (KRR5), Stockholm, Sweden, August.
- G.Madhu and Dr.A.Govardhan Dr.T.V.Rajinikanth[2011] , Intelligent Semantic Web Search Engines: A Brief Survey.
- Henrick Bulskov Styltsvig.Ontology based Information Retrieval , http://coitweb.uncc.edu/~ras/RS/Onto-Retrieval.pdf.