Semantic Similarity Calculation

description20 papers

group54 followers

lightbulbAbout this topic

Semantic similarity calculation is the process of quantifying the degree of similarity in meaning between two or more linguistic entities, such as words, phrases, or texts, using various computational methods and models. This field integrates concepts from linguistics, computer science, and artificial intelligence to enhance natural language processing applications.

lightbulbAbout this topic

Key research themes

1. How can ontology and lexical taxonomy structures improve semantic similarity and relatedness measurement?

This research area focuses on exploiting structured knowledge bases, such as WordNet and domain-specific ontologies, to calculate semantic similarity and relatedness. These methods leverage hierarchical relationships (hypernymy/hyponymy), synonyms, and sometimes meronymy to compute similarity measures that reflect human-like semantic closeness. The importance lies in achieving interpretable, knowledge-driven similarity metrics that outperform purely corpus-based methods in precision and enable applications such as ontology matching, information retrieval, and word sense disambiguation.

Measuring semantic similarity in the taxonomy of wordnet

by David Powers

2015

Key finding: Proposed novel edge-counting search algorithms (BDLS and UBFS) incorporating syn/antonym, hyper/hyponym, and hol/meronym links in WordNet taxonomy with differentiated weights, achieving high correlation (0.921) with human... Read more

articleView Paper downloadDownload

A Comparative Assessment of Ontology Weighting Methods in Semantic Similarity Search

by Michele Missikoff

2025, Proceedings of the 11th International Conference on Agents and Artificial Intelligence

Key finding: Presented SemSimp, a parametric semantic similarity method leveraging information content and weighted ontologies derived from both digital resource datasets and ontology structure; extensive evaluation shows it outperforms... Read more

articleView Paper downloadDownload

Information Retrieval Based on Semantic Similarity Using Information Content

by Satish Kolhe

2022, International Journal of Computer Science

Key finding: Introduced an information content-based approach combining corpus statistics with WordNet's taxonomy for semantic similarity in information retrieval; demonstrated that incorporating the information content of the lowest... Read more

articleView Paper downloadDownload

An overview of word and sense similarity

by Federico Martelli

2021, Natural Language Engineering

Key finding: Provided a comprehensive review distinguishing knowledge-based and distributional methods for computing semantic similarity of words and word senses; emphasized the importance of knowledge bases like WordNet to represent... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What corpus-based and distributional semantic models best capture semantic textual similarity in practical applications?

This line of research investigates methods that use statistical information from large corpora and distributional semantics to compute semantic similarity of words, sentences, or documents. These approaches rely on co-occurrence patterns, word embeddings, and vector space models to model meaning based on context and usage frequencies. They aim to deliver scalable, domain-independent solutions often used in natural language processing tasks such as semantic textual similarity, document clustering, and short text similarity.

Semantic Textual Similarity Methods, Tools, and Applications: A Survey

by Goutam Majumder

2022, Computación y Sistemas

Key finding: Surveyed semantic textual similarity approaches spanning topological (WordNet-based), statistical, and string-based methods; proposed a novel sentence similarity method integrating WordNet synsets with uni-gram language... Read more

articleView Paper downloadDownload

A Comparison of Semantic Similarity Methods for Maximum Human Interpretability

by Pujan Thapa

2024, 2019 Artificial Intelligence for Transforming Business and Society (AITB)

Key finding: Compared three semantic similarity methods—cosine similarity with tf-idf vectors, cosine similarity with word embeddings, and soft cosine similarity with word embeddings—for short news text; found that cosine similarity using... Read more

articleView Paper downloadDownload

UNIBA: Distributional Semantics for Textual Similarity

by Annalina Caputo

2023

Key finding: Applied distributional vector space models including Random Indexing and Latent Semantic Analysis to semantic textual similarity tasks, demonstrating consistent outperformance over baseline metrics; additionally introduced... Read more

articleView Paper downloadDownload

Comparison of Semantic Similarity Models on Constrained Scenarios

by Diogo Gomes

2023, Information Systems Frontiers

Key finding: Evaluated semantic similarity models within constrained and dynamic IoT/MEC/5G environments, showing that a distributional profile-based semantic model achieved competitive results compared to state-of-the-art corpus-based... Read more

articleView Paper downloadDownload

Web-Based Measure of Semantic Relatedness

by Jorge Gracia

2022, Lecture Notes in Computer Science

Key finding: Developed a semantic relatedness measure leveraging the Web as a knowledge source through search engine frequency data; demonstrated domain-independence and universality by outperforming traditional lexical-resource-bound... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. Can lexico-syntactic pattern-based and hybrid lexical-corpus methods provide effective semantic similarity without reliance on hand-crafted knowledge bases?

This theme explores semantic similarity measures derived from automatically harvested lexical patterns and statistical co-occurrence, often implemented via pattern extraction or web-based statistics. The goal is to achieve wide coverage and reasonable precision without depending on curated resources like WordNet, which have limited domain coverage. These methods facilitate scalable semantic similarity computation applicable to named entity similarity, relation extraction, and semantic search.

by Cédrick Fairon

2025

Key finding: Proposed PatternSim, a corpus-based semantic similarity measure that exploits a rich set of lexicosyntactic finite state transducer patterns to extract semantic relations from large corpora; achieved correlations up to 0.739... Read more

articleView Paper downloadDownload

by Dr.R.Menaha IT

2024, CVR Journal of Science & Technology

Key finding: Developed an automatic method combining web search engine page counts with a novel pattern extraction and clustering algorithm to compute word semantic similarity; integration with vector support machines optimized the... Read more

articleView Paper downloadDownload

UOW: Semantically Informed Text Similarity

by Miguel Ríos

2024

Key finding: Employed a supervised regression model combining lexical, syntactic, and semantic metrics such as named entity preservation and predicate-argument alignments to predict sentence-level semantic similarity; demonstrated that... Read more

articleView Paper downloadDownload

SAGAN: an approach to semantic textual similarity based on textual entailment

by JULIO GABRIEL MARTINEZ CASTILLO

2023

Key finding: Adapted a textual entailment system to compute graded semantic textual similarity by combining multiple WordNet-based word-to-word similarity measures aggregated at sentence level; results indicate the potential of... Read more

articleView Paper downloadDownload

Looking for the Best Historical Window for Assessing Semantic Similarity Using Human Literature

by Mario Pichler

2022

Key finding: Introduced a novel approach leveraging historical digitized book corpora to compute semantic similarity between words by statistically comparing their occurrence patterns over specific historical windows; preliminary findings... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Semantic Similarity Calculation

by Hema latha

2025, CVR Journal of Science & Technology

Semantic similarity measurement between words is a tedious task in web mining, information extraction and natural language processing. The semantic similarity measurement between entities is required in Web mining applications such as... more

descriptionView Paper arrow_downwardDownload

A hybrid extraction model for Chinese noun/verb synonym bi-gram

by Qin Lu

2025

Statistical-based collocation extraction approaches suffer from (1) low precision rate because high co-occurrence bi-grams may be syntactically unrelated and are thus not true collocations; (2) low recall rate because some true... more

descriptionView Paper arrow_downwardDownload

A review of Content and Collaborative filtering approaches on Movielens Data

by shraddha kumar

2025

Recommender System is a subclass of information filtering system. It identifies similarity among users or items. It can be used as information filtering tool in online social network. Collaborative filtering recommendations are based on... more

descriptionView Paper arrow_downwardDownload

by Jinzhu Gao

2025, Tsinghua Science and Technology

descriptionView Paper arrow_downwardDownload

by Dr.R.Menaha IT

2024, CVR Journal of Science & Technology

descriptionView Paper arrow_downwardDownload

Looking for the Best Historical Window for Assessing Semantic Similarity Using Human Literature

by Mario Pichler

2024, EDBT/ICDT Workshops

We describe the way to get benefit from broad cultural trends through the quantitative analysis of a vast digital book collection representing the digested history of humanity. Our research work has revealed that appropriately comparing... more

descriptionView Paper arrow_downwardDownload

Mobile recommender systems: Identifying the major concepts

by Elias Pimenidis

2024, arXiv (Cornell University)

This paper identifies the factors that have an impact on mobile recommender systems. Recommender systems have become a technology that has been widely used by various online applications in situations where there is an information... more

descriptionView Paper arrow_downwardDownload

Semantic Relation between Words with the Web as Information Source

by Tanmay Basu

2024, Pattern Recognition and Machine Intelligence

Semantic relation is an important concept of information science. Now a days it is widely used in semantic web. This paper aims to present a measure to automatically determine semantic relation between words using web as knowledge source.... more

descriptionView Paper arrow_downwardDownload

Semantic Relation between Words with the Web as Information Source

by Tanmay Basu

2024, Lecture Notes in Computer Science

descriptionView Paper arrow_downwardDownload

A Reliable Prediction Algorithm Based on Genre2Vec for Item-Side Cold-Start Problems in Recommender Systems with Smart Contracts

by Yong E Kim

2023, MDPI

This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY

descriptionView Paper arrow_downwardDownload

Recommendation Systems Using Event-Based Temporal Data Model

by Vinod Ingale and

2023, INTELLIGENT SYSTEMS AND APPLICATIONS IN ENGINEERING

Despite challenges like concept drifts, or temporal dynamics in RS, RS has grown in popularity due to its usefulness in meeting customers' needs by helping them find things they might like based on past purchases and interests. Despite... more

descriptionView Paper arrow_downwardDownload

INTELLIGENT SYSTEMS AND APPLICATIONS IN ENGINEERING

by Vinod Ingale

2023

descriptionView Paper arrow_downwardDownload

Word Semantic Similarity Based on Document's Title

by ramdane maamri

2023, 2013 24th International Workshop on Database and Expert Systems Applications

Measuring similarity between words using a search engine based on page counts alone is a challenging task. Search engines consider a document as a bag of words, ignoring the position of words in a document. In order to measure semantic... more

descriptionView Paper arrow_downwardDownload

A new method to find neighbor users that improves the performance of Collaborative Filtering

by Kourosh Kiani

2023, Expert Systems With Applications

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of... more

descriptionView Paper arrow_downwardDownload

A relational model of semantic similarity between words using automatically extracted lexical pattern clusters from the web

by M. Ishizuka

2023, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2 - EMNLP '09

Semantic similarity is a central concept that extends across numerous fields such as artificial intelligence, natural language processing, cognitive science and psychology. Accurate measurement of semantic similarity between words is... more

descriptionView Paper arrow_downwardDownload

A relational model of semantic similarity between words using automatically extracted lexical pattern clusters from the web

by M. Ishizuka

2023, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2 - EMNLP '09

descriptionView Paper arrow_downwardDownload

by Hema Latha

2023, CVR Journal of Science & Technology

descriptionView Paper arrow_downwardDownload

by hema latha

2023, CVR Journal of Science & Technology

descriptionView Paper arrow_downwardDownload

Evolutionary based matrix factorization method for collaborative filtering systems

by Parham Moradi

2023, 2013 21st Iranian Conference on Electrical Engineering (ICEE)

The Matrix-Factorization (MF) based models have become popular when building Collaborative Filtering (CF) recommender systems, due to the high accuracy and scalability. Most of nowadays matrix factorization models don't have acceptable... more

descriptionView Paper arrow_downwardDownload

An overview of textual semantic similarity measures based on web intelligence

by Jorge Ponce Martinez

2023, Artificial Intelligence Review

Computing the textual similarity between terms (or short text expressions) that have the same meaning but which are not lexicographically similar is a key challenge in many computer related fields. The problem is that traditional... more

descriptionView Paper arrow_downwardDownload

A multi-level collaborative filtering method that improves recommendations

by Christos K . Georgiadis

2023, Expert Systems with Applications

Collaborative filtering is one of the most used approaches for providing recommendations in various online environments. Even though collaborative recommendation methods have been widely utilized due to their simplicity and ease of use,... more

When recommendations need to be generated for a user, then the ratings are loaded into memory and a similarity function is used. The main par is how to estimate the similarity value between two users. This is called neighborhood identification and the job of the similarity function is to firstly identify a pre- specified of k nearest neighbors according to their similarity value. In present recommendations systems the value of k can vary from a few, possi bly 2 to 5, to as many as possible with the number ranging from 10 to 20 to 30 and so on up to hundreds of neighbors. A high number of neighbors doesn’t necessarily mean that the accuracy of the recommend ations will be high though.

While other methods use the method of absolute ratings either by adding weights, or by manipulatin different variables to achieve a better result, our approach is based on a multi-level division. Furthermor« we argue that every user has a rating expression tha should not be punished that way. Also, we agre with PCC based similarity approaches that consider higher the similarity when the values of the co-rate items are as close as possible. Our proposed method from it and the number of co-rated items. In order to which is defined in equation 5. In this equation, T s heavily relies on PCC, the similarity value returne achieve this we first introduce a similarity functio ands for the total number of co-rated items, x is positive real number such as x € R, y is a positive real number such as y € R. In our proposed method w argue that the by dividing the algorithm in multip improved. To show the effectiveness of our method in this we made experiments using four levels. In thi context, at the first four steps the number of co-rated items is checked and if it is more than the pre specified thresholds (t/, 12, t3, t4), then it can proceed to the next step to check the similarity valu derived for the two users from PCC. For users that a sufficient number of co-rated items exist and th PCC similarity value is greater than a pre-specified threshold (y) then a list of recommendations | returned. Otherwise, for users that there is not available a sufficient number of co-rated items, zero valu is returned. Finally, t/, (2, 13 and ¢4 are natural numbers that represent the constraints put on the numbe of co-rated items for each level (t1 € N,t2 EN,t3 € e levels the accuracy of the recommendations 1 N,t4EN / t4>t3>12>11)

The proposed method although it improves the accuracy of recommendations, is unable to provide recommendations to other users that do not have at least a number of co-rated items and a certain PCC similarity value. For this reason, a hybrid approach that can switch to PCC only if enough ratings are not available for the multi-level approach to provide recommendations. Algorithm 1 provides the hybrid approach.

The MAE results for the MovieLens 100k dataset are shown in figure 1. It is shown that when the neighborhood is small, up to 20 users, our proposed method outperforms all the other recommendation methods. When the number of neighbors is getting higher, i.e 40 or more, we can see that our proposed method becomes less effective.

The MAE results for the MovieLens 1m dataset are shown in figure 2. It is shown that our proposed method ourperforms the other methods. Besides that, we can see that as the number of neighbors is getting higher our proposed method becomes more effective. We can see that the larger the neighbrhood grows the results become better for all methods. This is due to the fact that a larger number of users is available to his dataset compared to the 100k one.

Figure 3. MAE results for the Jester dataset

Figure 4. MAE results for the Epinions dataset The MAE results for the Jester dataset are shown in figure 3. It is shown that our proposed method ourperforms the other methods. However in all cases (although our method achieves a lower MAE) there aren’t any significant changes when the size of the neighborhood grows. It is also remarkable that all the alternative recommendation methods have a noteworthy difference from PCC. Moreover, the results are almost identical when the neighborhood grows, since the number of users in the dataset is very small.

Figure 5. MAE results for the MovieTweetings dataset The MAE results for the MovieTweetings dataset are shown in figure 5. It is shown that our proposed method ourperforms the other methods. Besides that, we can see that as the number of neighbors is getting higher our proposed method becomes more effective. The MAE results for the Epinions dataset are shown in figure 4. It is shown that our proposed method ourperforms the other methods. However in all cases there aren’t any significant changes between our method and the others when the size of the neighborhood grows.

In figure 10 we can see the results obtained by using the Precision and Recall metrics using the MovieTweetings dataset. Sub figure 10,a represents the Precision results for 5 neighbors and 5 recommendations. Sub figure 10,b represents the Recall results for 5 neighbors and 5 recommendations. Sub figure 10,c represents the Precision results for 10 neighbors and 10 recommendations. Sub figure 10,d represents the Recall results for 10 neighbors and 10 recommendations. It is shown that our proposed method produces results of better quality in all cases.

descriptionView Paper arrow_downwardDownload

2 Syntactic vs. Semantic Locality: How Good Is a Cheap Approximation?

by Uli Sattler

2023

Extracting a subset of a given OWL ontology that captures all the ontology's knowledge about a specified set of terms is a wellunderstood task. This task can be based, for instance, on locality-based modules (LBMs). These come in two... more

descriptionView Paper arrow_downwardDownload

Learning multiple description logics concepts

by Raphael melo RCM

2023

Description logics based languages have became the standard representation scheme for ontologies. They formalize the domain knowledge using interrelated concepts, contained in terminologies. The manual definition of terminologies is an... more

descriptionView Paper arrow_downwardDownload

A new collaborative filtering metric that improves the behavior of recommender systems

by Jesus Bernal

2023, Knowledge-Based Systems

Recommender systems are typically provided as Web 2.0 services and are part of the range of applications that give support to large-scale social networks, enabling on-line recommendations to be made based on the use of networked... more

descriptionView Paper arrow_downwardDownload

A novel recommendation model of location-based advertising: Context-Aware Collaborative Filtering using GA approach

by Hiep Dao

2023, Expert Systems with Applications

Recommender systems are the efficient and most used tools that prevail over the information overload problem, provide users with the most appropriate content by considering their personal preferences (mostly, ratings). In addition to... more

descriptionView Paper arrow_downwardDownload

Hybrid fuzzy collaborative filtering: an integration of item-based and user-based clustering techniques

by Shweta Tyagi

2023, International Journal of Computational Science and Engineering

Clustering is one of the successful approaches of the model-based collaborative filtering techniques that deals with the problem of sparsity and provides quality recommendations. In the proposed work, fuzzy c-means clustering technique is... more

descriptionView Paper arrow_downwardDownload

Comparing Word Relatedness Measures Based on Google n-grams

by Aminul Islam

2023

Estimating word relatedness is essential in natural language processing (NLP), and in many other related areas. Corpus-based word relatedness has its advantages over knowledge-based supervised measures. There are many corpus-based... more

descriptionView Paper arrow_downwardDownload

An overview of textual semantic similarity measures based on web intelligence

by jorge orlando martinez

2022, Artificial Intelligence Review

descriptionView Paper arrow_downwardDownload

Learning multiple description logics concepts

by Raphael Melo

2022

descriptionView Paper arrow_downwardDownload

A Review on Recommendation System Using Rating Dataset

by Dr N Lakshmipathi Anantha

2022

The information available in the web is increasing daily. Searching for anything from web is very difficult because of availability of huge data and the disadvantage with the searching is, it simply mines data based on the keyword given... more

descriptionView Paper arrow_downwardDownload

Multi-criteria collaborative recommender

by ABDELFETTAH SEDQUI

2022, Second International Conference on the Innovative Computing Technology (INTECH 2012)

Collaborative filtering algorithm (CF) is a personalized recommendation algorithm that is the most widely used in e-commerce. CF still needs to be improved so that it can make adequate recommendations and solve the problems such as... more

descriptionView Paper arrow_downwardDownload

Developing a Robust and Flexible Web Service Framework for Medical Care

by Dhanasekaran K

2022, Webology

The rapid growth of Internet technologies and availability of web tools created an opportunity to develop a robust and user-friendly web service model for medical care, and it demands urgent solutions as the uncertainty of disease spread... more

descriptionView Paper arrow_downwardDownload

Computing Semantic Similarity Measure Between Words Using Web Search Engine

by L M Patnaik

2022, Computer Science & Information Technology ( CS & IT )

Semantic Similarity measures between words plays an important role in information retrieval, natural language processing and in various tasks on the web. In this paper, we have proposed a Modified Pattern Extraction Algorithm to compute... more

descriptionView Paper arrow_downwardDownload

by Tahani Alsubait

2022, Lecture Notes in Computer Science

descriptionView Paper arrow_downwardDownload

A Comparison of Collaborative Filtering-based Recommender Systems

by Jyoti Shokeen

2022, Journal of emerging technologies and innovative research

The proliferation of Internet has made people to rely on virtual recommendations. Recommender systems help out in giving important recommendations. Collaborative filtering is the most successful and widely used approach in designing... more

descriptionView Paper arrow_downwardDownload

Looking for the Best Historical Window for Assessing Semantic Similarity Using Human Literature

by Mario Pichler

2022

descriptionView Paper arrow_downwardDownload

A Survey on Hybrid Recommendation System for Movie dataset

by vijay birchha

2022

Recommendation System is a subclass of information filtering system. It identifies similarity among users or items. It can be used as information filtering tool in online social network. Collaborative filtering recommendations are based... more

descriptionView Paper arrow_downwardDownload

A Novel Method for Word-Pair Similarity Computing

by rim faiz

2022

Semantic similarity between words is fundamental to various fields such as Cognitive Science, Artificial Intelligence, Natural Language Processing and Information Retrieval. According to Baeza-Yates and Neto [2] an Information Retrieval... more

descriptionView Paper arrow_downwardDownload

A Survey on Hybrid Recommendation System for Movie dataset

by Krishna Patidar

2022

descriptionView Paper arrow_downwardDownload

by R. MENAHA

2022

Semantic similarity plays a significant role in the areas of Web mining, Information Retrieval, NLP and Text mining. Even though it is exploited in various applications accurately measuring semantic similarity still remains a challenging... more

descriptionView Paper arrow_downwardDownload

by R. MENAHA

2022

descriptionView Paper arrow_downwardDownload

Learning Multiple Description Logics Concepts

by Aline Paes

2022

descriptionView Paper arrow_downwardDownload

Auto-CaseRec: A Novel Automated Recommender System Framework

by Srijan Gupta

2022

Recommender Systems (RSs) are software tools and techniques that are used to produce recommendations for the users of a certain application in such a way that the recommendations generated are likely to be liked by the users. Popular... more

descriptionView Paper arrow_downwardDownload

National College of Ireland Project Submission Sheet School of Computing

by Utkarsh Mathur

2022

I hereby certify that the information contained in this (my submission) is information pertaining to research I conducted for this project. All information other than my own contribution will be fully referenced and listed in the relevant... more

descriptionView Paper arrow_downwardDownload

Employing opposite ratings users in a new approach to collaborative filtering

by abdellah el fazziki

2022, Indonesian Journal of Electrical Engineering and Computer Science

Over the past few decades, various recommendation system paradigms have been developed for both research and industrial purposes to satisfy the needs and preferences of users when they deal with enormous data. The collaborative filtering... more

Figure 2. An example of data collection Figure 1. Collaborative filtering recommendation process 2.1.1. Data representation The second step of CF consists in constructing the evaluation matrix and filling in the empty values. In fact, in most cases, the scoring matrix is usually filled in because users do not score items regularly [27]. The most used technique in the CF is replacing the empty squares of the matrix with the average user ratings. In the Figure 3, a small-scale exemple of data representation.

Figure 5. Example of GS cases in neighborhood-based techniques

Figure 7. Example of an opposite rating matrix on a 5-point scale In this section, many experiments are performed to demonstrate the novelty and efficiency of our approach. Therefore , we divided our dataset into 80% for the training set and 20% for the test set. We calculated the means of the results of a cross-validation of 10 times. We also implemented a system of film recommendation under R thanks to Recommend erlab [42] with the MovieLens and FilmTrust datasets. The objective is to check the performance of our proposed approach (AUBCF) with the traditional user-based CF approach (UBCF) using real-world datasets. A brief description of the datasets used will be in order, ensued by the evaluation procedure, and the specification test environment. Hence, the results were acquired from comparisons to come up the most successful approach.

Figure 9. MAE comparison using FilmTrust dataset Figure 10. MAE comparison using MovieLens dataset The Figures 9 and 10 shows the results obtained by comparing our proposed approach named AUBCE (UBCE augmented) and the user-based CF approach (UBCF) as a basic approach for the FilmTrust dataset. Figure 9 represents a comparison of MAE where the horizontal axis is the size of the neighborhood used for the calculation of MAE. The figure shows that our approach (AUBCF) decreases regularly for the MAE, while the traditional approach (UBCF) decreases to N=40 and then remains stable until N=60 where the MAE begins to increase. In Figure 10, we can see that the MAE of our approach (AUBCP), in green, and the traditional approach (UBCF), in red, are inversely proportional to the number of users in the neighborhood. The traditional approach (UBCF) has a higher MAE than our approach (AUBCF).

Figure 11. MAE comparison using FilmTrust dataset Figure 12. MAE comparison using MovieLens dataset observed is not statistically significant. Generally, we reject the null hypothesis if the p-value is less than a certain threshold (often 0.05). In other words, if p-value< 0.05 we can infer that the difference is statistically significant. Comparison of p-value between Pearson correlation and cosine by Wilcoxon test for both datasets as shown in Table 1. According to Table 1 all p-values are the threshold (0.05), we rejected the null hypothesis and we can say that the difference is statistically significant. Finally, the obtained results of algorithm AUBCF are better than the results of algorithm UBCF.

[42] M. Hahsler, “recommenderlab: A Framework for Developing and Testing Recommendation Algorithms,” Nov, no. December 2013, pp. 1-37, 2011, [Online]. Available: http://cran.r-project.org/web/packages/recommenderlab/vignettes/recommenderlab. pdf

descriptionView Paper arrow_downwardDownload

Towards higher relevance and serendipity in scholarly paper recommendation" by Kazunari Sugiyama and Min-Yen Kan with Martin Vesely as coordinator

by Min-Yen Kan

2022, ACM SIGWEB Newsletter

Finding relevant scholarly papers is an important task for researchers. Such a literature search involves identifying drawbacks in existing works and proposing new approaches that address them. However, the growing number of scientific... more

descriptionView Paper arrow_downwardDownload

Towards higher relevance and serendipity in scholarly paper recommendation by Kazunari Sugiyama and Min-Yen Kan with Martin Vesely as coordinator

by Min-Yen Kan

2022

descriptionView Paper arrow_downwardDownload

On Web Based Sentence Similarity for Paraphrasing Detection

by Panos Kostakos

2021, Proceedings of the 9th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management

Semantic similarity measures play vital roles in information retrieval, natural language processing and paraphrasing detection. With the growing plagiarisms cases in both commercial and research community, designing efficient tools and... more

descriptionView Paper arrow_downwardDownload

An exponential similarity measure for collaborative filtering

by Ali Moeini

2021, SN Applied Sciences

In this paper, we propose two exponential similarity measures for collaborative filtering in recommender systems. The proposed similarity measures are used to estimate the distance between two users or items. Furthermore, an algorithm is... more

descriptionView Paper arrow_downwardDownload

A Hybrid Web-Based Measure for Computing Semantic Relatedness Between Words

by Giorgos Siolas

2021, 2009 21st IEEE International Conference on Tools with Artificial Intelligence

In this paper, we build a hybrid Web-based metric for computing semantic relatedness between words. The method exploits page counts, titles, snippets and URLs returned by a Web search engine. Our technique uses traditional information... more

descriptionView Paper arrow_downwardDownload

Semantic Similarity Calculation

Key research themes

1. How can ontology and lexical taxonomy structures improve semantic similarity and relatedness measurement?

2. What corpus-based and distributional semantic models best capture semantic textual similarity in practical applications?

3. Can lexico-syntactic pattern-based and hybrid lexical-corpus methods provide effective semantic similarity without reliance on hand-crafted knowledge bases?

Related Topics

All papers in Semantic Similarity Calculation