Semantic similarity judgment

description7 papers

group1 follower

lightbulbAbout this topic

Semantic similarity judgment is the cognitive process of evaluating the degree of relatedness or similarity in meaning between two or more linguistic expressions, often measured through various methodologies in psycholinguistics and computational linguistics to understand language comprehension and semantic networks.

lightbulbAbout this topic

Key research themes

1. How can ontology- and taxonomy-based models enhance semantic similarity judgment in structured lexical databases?

This research theme focuses on leveraging structured lexical knowledge bases such as WordNet and domain-specific ontologies to assess semantic similarity. It examines how taxonomic relationships (e.g., hypernym/hyponym, meronym/holonym), information content, and edge-weighted path lengths within ontological hierarchies can quantify semantic proximity more closely aligned with human judgment. This is important because it grounds semantic similarity in well-curated resources and formalizes semantic distance metrics, facilitating applications in information retrieval, word sense disambiguation, and semantic search where lexical resources exist.

Measuring semantic similarity in the taxonomy of wordnet

by David Powers

2015

Key finding: Introduces a novel edge-counting model in the WordNet taxonomy that incorporates different weights for synonymy, hypernymy, and meronymy links, achieving a high correlation (0.921) with human similarity judgments on a... Read more

articleView Paper downloadDownload

by Michele Missikoff

2025, Journal of Web Semantics

Key finding: Presents SemSim^p, a parametric semantic similarity approach that uses information content weighting in ontologies and compares it across six established semantic similarity methods on large annotated datasets derived from... Read more

articleView Paper downloadDownload

A Comparative Assessment of Ontology Weighting Methods in Semantic Similarity Search

by Michele Missikoff

2025, Proceedings of the 11th International Conference on Agents and Artificial Intelligence

Key finding: Assesses four ontology weighting strategies, divided into intensional (ontology structure based) and extensional (also leveraging resource annotation data), within a semantic search engine framework. The study empirically... Read more

articleView Paper downloadDownload

by Stan Szpakowicz

2021

Key finding: Critically analyzes existing WordNet-based similarity tests (WBST) and identifies their insufficiency for robust semantic similarity function evaluation. Proposes a more demanding test framework targeting Polish nouns that... Read more

articleView Paper downloadDownload

by Diana Inkpen

2016

Key finding: Provides a comprehensive survey of WordNet- and Roget's thesaurus-based semantic similarity methods, emphasizing path-length and information content approaches within taxonomic hierarchies. Discusses the utility of... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What role do distributional and corpus-based models play in capturing semantic similarity, particularly for weakly related or dissimilar concepts?

This theme explores the use of statistical and distributional semantics, including semantic networks derived from co-occurrence information, latent semantic analysis, and lexico-syntactic pattern mining, to capture semantic similarity across words and texts. It is particularly concerned with modeling similarity judgments for weakly related or even dissimilar concepts, where knowledge-based resources offer limited coverage. Understanding these patterns is key for modeling human-like similarity judgments, expanding semantic coverage beyond taxonomic relations, and enhancing tasks such as semantic priming, episodic memory modeling, and robust semantic vector space embeddings.

Structure at every scale: A semantic network account of the similarities between unrelated concepts

by D. Navarro-reyes

2022, Journal of experimental psychology. General

Key finding: Demonstrates through four experiments that human similarity judgments among weakly related or apparently unrelated concepts are systematic and stable, contradicting the notion that such similarities are arbitrary. Introduces... Read more

articleView Paper downloadDownload

Establishing semantic relatedness through ratings, reaction times, and semantic vectors: A database in Polish

by Karolina Rataj

2023, PLOS ONE

Key finding: Develops a comprehensive Polish semantic priming dataset, revealing graded semantic priming effects and a linear relationship between semantic relatedness strength and reaction times. Evaluates semantic spaces against human... Read more

articleView Paper downloadDownload

by Cédrick Fairon

2025

Key finding: Introduces PatternSim, a novel corpus-based semantic similarity measure derived from lexicosyntactic patterns extracted via finite-state transducers. Achieves a correlation up to 0.739 with human judgments without relying on... Read more

articleView Paper downloadDownload

Word Association Spaces for Predicting Semantic Similarity Effects in Episodic Memory

by Suci Uci

2023, Experimental cognitive psychology and its applications.

Key finding: Constructs Word Association Spaces (WAS) by applying singular value decomposition and multidimensional scaling on large free association norm datasets. Shows that WAS representations capture latent associative semantic... Read more

articleView Paper downloadDownload

Comparison of Semantic Similarity Models on Constrained Scenarios

by Diogo Gomes

2023, Information Systems Frontiers

Key finding: Focuses on semantic similarity modeling within constrained, dynamic environments such as IoT and edge computing by proposing a new distributional profile model leveraging search engine data. Evaluates this model against... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can semantic similarity be operationalized and evaluated in applied NLP tasks involving human interpretability and textual entailment?

This theme addresses approaches combining semantic similarity measures with supervised learning and task evaluation to maximize interpretability and to support downstream applications such as semantic textual similarity (STS), textual entailment, and semantic search. It covers feature-rich models integrating lexical, syntactic, and semantic similarity metrics, and discusses the design and assessment of tests and benchmarks to evaluate how well computational similarity mimics human judgments, including the development of datasets and test tasks focusing on human interpretability and graded semantic equivalence.

A Comparison of Semantic Similarity Methods for Maximum Human Interpretability

by Pujan Thapa

2024, 2019 Artificial Intelligence for Transforming Business and Society (AITB)

Key finding: Compares three semantic similarity computation methods—cosine similarity with TF-IDF, cosine similarity with word embeddings, and soft cosine similarity with word embeddings—on short text similarity tasks. Finds that cosine... Read more

articleView Paper downloadDownload

UOW: Semantically Informed Text Similarity

by Miguel Ríos

2024

Key finding: Presents a supervised system combining lexical (bag-of-words cosine similarity), syntactic (BLEU score over base-phrases), and semantic similarity metrics (preservation of named entities and predicate-argument alignment)... Read more

articleView Paper downloadDownload

SAGAN: an approach to semantic textual similarity based on textual entailment

by JULIO GABRIEL MARTINEZ CASTILLO

2023

Key finding: Develops an STS system leveraging textual entailment techniques and eight WordNet-based word-to-word similarity metrics to derive semantic similarity between sentence pairs. The system models text similarity as a function of... Read more

articleView Paper downloadDownload

UNIBA: Distributional Semantics for Textual Similarity

by Annalina Caputo

2023

Key finding: Implements semantic textual similarity using distributional word representations constructed from large corpora via Random Indexing and Latent Semantic Analysis, including a novel vector permutation technique embedding... Read more

articleView Paper downloadDownload

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

by Archives of Rehabilitation

2020

Key finding: Develops and validates a Semantic Similarity Judgment (SSJ) test specifically for Persian action verbs and non-action nouns to assess lexical-semantic processing deficits in brain injury patients. Uses expert semantic ratings... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Semantic similarity judgment

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

by Archives of Rehabilitation and

2020, Archives of Rehabilitation

Abstract:
Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is related to the activity of motor and pre-motor areas of the brain. Researchers use different tasks to evaluate action verb processing. The most common tasks are action naming and action fluency tasks. Although these types of tasks are sensitive to deficits in action verb processing, they do not specify the nature of the injury. To understand whether dysfunction in action verb processing is due to difficulty in lexical access or specific impairment in semantic processing, it is necessary to design a specific test to evaluate lexical-semantic processing. Semantic Similarity Judgment (SSJ) test targets the lexical-semantic encoding at a deep and controlled processing level. The purpose of the present study was to develop a SSJ test for Persian action verbs and non-action nouns and determine its content validity.
Materials & Methods: In this methodological study, 70 Persian action concrete verbs and 80 Persian non-action concrete nouns were first selected. For each word, a semantically related word based on functional, physical, categorical features and similarity in action was selected according to the opinion of 4 experts (3 speech-language pathologists and one linguist) using a 7-point scale. For semantic similarity rating, only the pairs of words with a high semantic similarity score (5 to 7) remained and the rest were omitted. Then, for each pair of semantically related words, a semantically unrelated word was selected. After determining content validity qualitatively by three experts and removing inappropriate items, for matching the two sets of nouns and verbs, the lexical and psycholinguistic characteristics of the remaining words (207 nouns and 156 verbs) including frequency, number of syllables, phonemes, letters, phonological and orthographic neighbors, action association, imageability, familiarity and age of acquisition were extracted by 18 volunteers (13 speech-language pathologists and linguists and 5 parents selected by a convenience sampling method) based on a 7-point scale. The verbs with low action associations and the nouns with high action association were removed and then, the two sets of words were matched for other lexical and psycholinguistic characteristics. Finally, 34 triples of verbs with high action association and 34 triples of nouns with low action association were selected. In both noun and verb sets, the words were chosen in such a way that, in order to judge, the semantic features of the words need to be carefully considered. Data analysis was performed using descriptive statistics and independent t-test.

descriptionView Paper arrow_downwardDownload