Similarity Measures

description370 papers

group251 followers

lightbulbAbout this topic

Similarity measures are quantitative metrics used to assess the degree of similarity between two or more entities, such as objects, datasets, or patterns. These measures are fundamental in various fields, including statistics, machine learning, and information retrieval, facilitating tasks like clustering, classification, and recommendation.

lightbulbAbout this topic

Key research themes

1. How can component-wise and higher-order dissimilarity measures enhance similarity assessments in heterogeneous and complex data spaces?

This research area focuses on devising and analyzing dissimilarity/similarity measures tailored for complex real-world objects represented by heterogeneous, multi-component data. Traditional metric or Euclidean assumptions often fail in such unconventional spaces where data comprise mixed types (numerical, categorical, time series, graphs). Component-wise dissimilarities allow each heterogeneous component to be compared using domain-appropriate submeasures, combined often through weighted convex combinations. Theoretical and experimental studies explore how these weighted measures affect metric properties and Euclidean embeddability. Further, the concept of meta-distances introduces higher-order similarities that consider the relative similarities of objects with respect to the entire dataset, thereby capturing richer relational patterns beyond pairwise comparisons. These measures prove essential for improving pattern recognition and local classification performance in complex domains.

On component-wise dissimilarity measures and metric properties in pattern recognition

by Antonello Rizzi

2023, PeerJ

Key finding: The paper formalizes component-wise dissimilarity measures to accommodate heterogeneous real-world data described by mixed features and demonstrates that such dissimilarities often produce non-Euclidean matrices, limiting... Read more

articleView Paper downloadDownload

A new concept of higher-order similarity and the role of distance/ similarity measures in local classification methods

by Davide Ballabio and

2016

Key finding: Introducing meta-distances constructed from primary classical distances by incorporating an adjunct dissimilarity factor encoding higher-order similarity relationships among all objects in a dataset, this study demonstrates... Read more

articleView Paper downloadDownload

R package 'brsim'

by Gianmarco Alberti

2023

Key finding: The 'brsim' R package operationalizes the Brainerd-Robinson similarity coefficient designed for compositional data, facilitating significance testing through permutation methods and hierarchical clustering analyses. Its... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How can semantic and fuzzy similarity measures be parametrically adapted and combined to improve conceptual reasoning and decision-making?

This theme covers theoretical and applied advancements in parametrically flexible similarity measures designed for semantic resources and fuzzy set representations. Semantic similarity methods leverage information content and ontology-based taxonomies with weights informed by either resource frequency or ontology structure, allowing improved assessment of concept relatedness capturing both statistical and domain-specific knowledge. Similarly, in fuzzy logic, combining distance and similarity measures into unified parametric forms addresses challenges in fuzzy set comparison, avoiding ambiguous interpretations when sets are disjoint or partially overlapping. Parametric adjustments and combinations enable tailoring similarity measures to better reflect nuanced semantic or fuzzy relationships, thereby enhancing applications such as semantic retrieval, multi-attribute decision making, and reasoning under uncertainty.

by Michele Missikoff

2025, Journal of Web Semantics

Key finding: The paper presents SemSim p, a parametric semantic similarity method that improves upon its predecessor by adjusting ontology concept weights and normalization factors. Experiments using the ACM Computing Classification... Read more

articleView Paper downloadDownload

Analysing fuzzy sets through combining measures of similarity and distance

by Uwe Aickelin

2015

Key finding: This work formulates a novel combined measure of similarity and distance between fuzzy sets using an ordered weighted averaging (OWA) operator. It overcomes limitations when similarity or distance measures are used... Read more

articleView Paper downloadDownload

by Iranian Journal of Fuzzy Systems and

2024, A parametric similarity measure between picture fuzzy sets and its applications in multi-attribute decision-making

Key finding: Addressing limitations in existing picture fuzzy set similarity measures, the paper introduces a parametric similarity measure with three adjustable parameters (m1, m2, m3) enabling flexible decision-making styles. Analytical... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What novel similarity measures improve performance and interpretability in collaborative filtering and image similarity tasks?

This research cluster investigates new or hybrid similarity metrics tailored to enhance the effectiveness of collaborative filtering (CF) recommender systems and image similarity assessment. For CF, combining classical numerical similarity measures (e.g., cosine, Pearson correlation) with Jaccard similarity—which emphasizes presence/absence of ratings rather than rating magnitude—has been shown to produce superior neighbor identification and recommendation accuracy. In image similarity, beyond traditional pixel-wise metrics (PSNR, SSIM), novel approaches leverage fuzzy set solutions derived via max–min and min–max compositions or convolutional neural networks (CNNs) to capture nuanced perceptual similarities and increase robustness to noise. These advances address key challenges including sparsity, noise sensitivity, and semantic expressiveness, advancing both theory and practical applications in recommendation systems and image quality assessment.

by Loc Nguyen's Academic Network and

2023, Proceedings of the 9th International Conference on Advanced Intelligent Systems and Informatics 2023 (AISI2023), part of the book series: Lecture Notes on Data Engineering and Communications Technologies (LNDECT), volume 184, pages 221-229

Key finding: Experiments on MovieLens and FilmTrust datasets demonstrate that hybrid similarity measures combining Jaccard similarity—which captures rating presence—and numerical measures like cosine and Pearson outperform any single... Read more

articleView Paper downloadDownload

On the Impact of Jaccard Fusion with Numerical Measures for Collaborative Filtering Enhancement

by Loc Nguyen's Academic Network and

2023, Research Square preprints

Key finding: Extending prior findings, this paper empirically validates that fusing Jaccard similarity with classical numerical similarity metrics yields significant improvements in collaborative filtering performance. Rigorous testing on... Read more

articleView Paper downloadDownload

A Novel Image Similarity Measure Based on Greatest and Smallest Eigen Fuzzy Sets

by salvatore sessa

2025, Symmetry

Key finding: The study proposes a novel fuzzy-based image similarity measure that computes similarity using the greatest and smallest fuzzy sets derived as symmetrical solutions of fuzzy relation equations for image blocks. Evaluation on... Read more

articleView Paper downloadDownload

Proposal of a similarity measure for unified modeling language class diagram images using convolutional neural network

by International Journal of Electrical and Computer Engineering (IJECE)

2024, International Journal of Electrical and Computer Engineering (IJECE)

Key finding: Introducing a deep learning-based approach, this paper develops a CNN model to assess similarity between UML class diagram images, thereby enabling automatic, objective evaluation of student diagrams in education. The model... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Similarity Measures

Computation of semantic similarity within an ontology of breast pathology to assist inter-observer consensus

by M-C Jaulent

2025, Computers in Biology and Medicine

Computer-assisted consensus in medical imaging involves automatic comparison of morphological abnormalities observed by physicians in images. We built an ontology of morphological abnormalities in breast pathology to assist inter-observer... more

descriptionView Paper arrow_downwardDownload

Analysis and implementation of computer-based system development of stemming algorithm for finding Arabic root word

by Khaerul Umam

2025, Journal of physics

At present many experts in the field of information technology have designed and developed algorithms to solve stemming problems, especially in Arabic. But of the many stemming analyses in Arabic, there is no standardization of a good... more

descriptionView Paper arrow_downwardDownload

Evaluation and Analysis of Grammatical Linguistic Pattern Over Social Science and Technology Textbooks

by Phub Namgay

2025, Procedia Computer Science

Every textbook is built upon the foundation of key concepts. Books that contain concepts that share some common properties and are semantically related are more lucid and intelligible than those that contain many unrelated concepts. These... more

descriptionView Paper arrow_downwardDownload

Pattern

by Jin Han Park

2025

measure between intuitionistic fuzzy sets and its application to

descriptionView Paper arrow_downwardDownload

An Evaluation of Educational Process with K-Means Clustering for Students Grouping

by Catur E widodo

2025, International Journal of Computer Applications

K-means clustering is a method of grouping data by looking for similarities between attributes possessed by data points and can overcome high data dimensions because of the simplicity of the algorithms it has. The disadvantage of the... more

descriptionView Paper arrow_downwardDownload

CENTROID INITIALIZATION IN K-MEANS CLUSTERING USING GATCAM

by Rupert William

2025, Science World Journal

Clustering is one of the most widely used machine learning techniques in data processing. Clustering has a wide range of applications, including market research, pattern recognition, data analysis, and image processing, among others. The... more

descriptionView Paper arrow_downwardDownload

Solmization Expertise Correlates with Superior Pitch Memory

by Nancy Rogers

2025, Em Pauta

Memory is a complex phenomenon, and musical memory is especially interesting because it can involve so many facets: a visual image of the score, an aural recollection of the melody, the kinesthetic response of a performer, an analytical... more

descriptionView Paper arrow_downwardDownload

Scalable clustering of categorical data and applications

by Periklis Andritsos

2025

Clustering is widely used to explore and understand large collections of data. In this thesis, we introduce LIMBO, a scalable hierarchical categorical clustering algorithm based on the Information Bottleneck (IB) framework for quantifying the relevant information preserved when clustering. As a hierarchical algorithm, LIMBO can produce clusterings of different sizes in a single execution. We also define a distance measure for categorical tuples and values of a specific attribute. Within this framework, we define a heuristic for discovering candidate values for the number of meaningful clusters. Next, we consider the problem of database design, which has been characterized as a process of arriving at a design that minimizes redundancy. Redundancy is measured with respect to a prescribed model for the data (a set of constraints). We consider the problem of doing database redesign when the prescribed model is unknown or incomplete. Specifically, we consider the problem of finding structural clues in a data instance, which may contain errors, missing values, and duplicate records. We propose a set of tools based on LIMBO for finding structural summaries that are useful in characterizing the information content of the data. We study the use of these summaries in ranking functional dependencies based on their data redundancy. We also consider a different application of LIMBO, that of clustering software artifacts. The majority of previous algorithms for this problem utilize structural information in order to decompose large software systems. Other approaches using non-structural iniii formation, such as file names or ownership information, have also demonstrated merit. We present an approach that combines structural and non-structural information in an integrated fashion. We apply LIMBO to two large software systems, and the results indicate that this approach produces valid and useful clusterings. Finally, we present a set of weighting schemes that specify objective assignments of importance to the values of a data set. We use well established weighting schemes from information retrieval, web search and data clustering to assess the importance of whole attributes and individual values.

descriptionView Paper arrow_downwardDownload

A Terminological Search Algorithm for Ontology Matching

by Mohammad Nematbakhsh

2025, International journal of sciences

Most of the ontology alignment tools use terminological techniques as the initial step and then apply the structural techniques to refine the results. Since each terminological similarity measure considers some features of similarity,... more

descriptionView Paper arrow_downwardDownload

Intuitionistic fuzzy parameterised fuzzy soft set

by Entisar El-yagubi

2025

In this paper, the definition of intuitionistic fuzzy parameterised fuzzy soft set (ifpfs-sets) is introduced with their properties. Two operations on ifpfs-set, namely union and intersection are introduced. Also, some examples for these... more

descriptionView Paper arrow_downwardDownload

Image and Video Quality Assessment Based on the Similarity of Edge Projections

by Dong-O Kim

2025, Signal and image processing : an international journal

The goal of image or video quality assessment is to evaluate if a distorted image or video is of a good quality by quantifying the difference between the original and distorted images or videos. In this paper, to assess the visual quality... more

descriptionView Paper arrow_downwardDownload

Image and Video Quality Assessment Based on the Similarity of Edge Projections

by Dong-O Kim

2025, Signal & Image Processing : An International Journal

descriptionView Paper arrow_downwardDownload

The Dynamic evolution of core documents: an experimental study based on h-related literature (2005–2013)

by Fred Y. Ye

2024, Scientometrics

Using the example of the 'h-related' publication dataset created for a previous study on the literature of Hirsch-type measures (Zhang et al. in J Informetri 5(3):583-593, 2011) and updated for the present paper, we attempt to study the... more

descriptionView Paper arrow_downwardDownload

Targeting Grenada's most deprived population: A multidimensional living conditions assessment

by Yadira Díaz

2024

Public policies concerned with the reduction of poverty increasingly rely on identifying the most deprived households with the use of statistical targeting techniques. Targeting methods aim to measure deprivation as accurately as possible... more

descriptionView Paper arrow_downwardDownload

Perceptual Texture Similarity Estimation: An Evaluation of Computational Features

by Mike Chantler

2024, IEEE Transactions on Pattern Analysis and Machine Intelligence

Estimation of texture similarity is fundamental to many material recognition tasks. This study uses fine-grained human perceptual similarity ground-truth to provide a comprehensive evaluation of 51 texture feature sets. We conduct two... more

descriptionView Paper arrow_downwardDownload

Optimizing New Technology Implementation Through Fuzzy Hypersoft Set: A Framework Incorporating Entropy, Similarity Measure, and TOPSIS Techniques

by Amad Sarwar

2024, IEEE Access

As each day passes by the world's NT requirements increase due to increasing population and technological advancements. Currently, traditional technologies are inadequate to support the requirement. It is vital to investigate cost-effective and suitable green environmental technologies as a response. Future connectivity(5G, 6G), programming, artificial intelligence and new technologies might be a resolution to this resource crisis in this setting. Now, choosing amongst the most suitable option present itself as a Multi-Criteria Decision Making (MCDM) challenge in which a judgment must be made in terms of a wide variety of characteristics. In this paper, the extended MCDM strategies are proposed to optimizing new technologies implementation. The novelty of the Fuzzy Hypersoft (FHS) set is discussed, which can deal with uncertainties, vagueness, and unclear data. This framework is more flexible than the structures found in literature as it can deal with the information where the attributes can be further sub-partitioned into attribute values for a better understanding. It may not always be possible to analyze these criteria using precise figures; instead, an assessment must be made using human and expert judgments for a more adaptable and sensitive review. The adaptive MCDM design with fuzzy edges incorporates Entropy (EN), Similarity Measure (SIM), and TOPSIS techniques rely on FHS. The conveyed frameworks are better for probing NT issues because they analyze a more expansive range of attributes, which can handle a component with multiple different sub-attribute values. Expert ratings are used to demonstrate a practical application to highlight the relevance of the proposed approach. In addition, a sensitivity analysis is done to investigate the impact of primary criterion weights in sorting. INDEX TERMS New technologies (NT), risk factors (RF), planning and development, multi-criteria decision making (MCDM), entropy (EN), similarity measures (SIM), fuzzy hypersoft set (FHS).

descriptionView Paper arrow_downwardDownload

Sélection de mesures de proximité pour une analyse des correspondances topologique

by Rafik Abdesselam

2024

L'approche proposée consisteà comparer puisà classer des mesures de proximité dans un contexte topologique afin de sélectionner la meilleure mesure en vue d'effectuer une analyse des correspondances topologique. Les mesures de similarité... more

descriptionView Paper arrow_downwardDownload

Pentingnya Peranan Bahasa dalam Interoperabilitas Informasi berbasiskan Komputer karena Keragaman Semantik

by Setia Wirawan

2024

Proceeding, Seminar Nasional PESAT 2005 Auditorium Universitas Gunadarma, Jakarta, 23-24 Agustus 2005 ISSN : 18582559 PENTINGNYA PERANAN BAHASA D ALAM INTEROPERABILITAS INFORMASI BERBASISKAN KOMPUTER KARENA KERAGAMAN SEMANTIK I Wayan ...

descriptionView Paper arrow_downwardDownload

Klasifikasi Berita Berdasarkan Pendekatan Semantik

by DIMAS EKA PRASETYO

2024, Prosiding Kommit

Berita elektronik telah menjadi semakin popular sejak dimulainya perkembangan internet. Melalui internet, berita elektronik dikemas sedemikian rupa sehingga mampu mengalirkan informasi secara up-to-date kepada masyarakat. Namun, hal ini... more

descriptionView Paper arrow_downwardDownload

Optimizing New Technology Implementation Through Fuzzy Hypersoft Set: A Framework Incorporating Entropy, Similarity Measure, and TOPSIS Techniques

by Amad Sarwar

2024

descriptionView Paper arrow_downwardDownload

Authorship Verification, Neighborhood-based Classification

by Daniel Castro

2024, Computación y Sistemas

Resumen. El análisis de autoría se ha convertido en una herramienta determinante para el análisis de documentos digitales en las ciencias forenses. Proponemos un método de Verificación de Autoría mediante el análisis de las semejanzas... more

descriptionView Paper arrow_downwardDownload

Correlation Among Similarity Measurements for Collaborative Filtering Techniques: An Improved Similarity Metric

by Afreen Khan

2024

Information is rising exponentially over the Internet. The World Wide Web has emerged as a treasure trove of knowledge and provide relevant information pertaining to any exclusive topic as per the individual’s performance or demand.... more

descriptionView Paper arrow_downwardDownload

by MECS Press and

2024, International Journal of Intelligent Systems and Applications (IJISA)

Recommender Systems (RSs) work as a personal agent for individuals who are not able to make decisions from the potentially overwhelming number of alternatives available on the World Wide Web (or simply Web). Neighborhood-based algorithms... more

descriptionView Paper arrow_downwardDownload

Roma Poverty and Deprivation: The Need for Multidimensional Anti-Poverty Measures

by Sheena Keller

2024

Reliable data and robust conceptual framework are two necessary preconditions for anti-poverty measures need to be effective and achieve their goals - bringing people out of poverty. Both preconditions are far from met in the case of Roma... more

descriptionView Paper arrow_downwardDownload

Multimodal Evaluation for Medical Image Segmentation

by Matthieu Bultelle

2024, Springer eBooks

This paper is a joint effort between five institutions that introduces several novel similarity measures and combines them to carry out a multimodal segmentation evaluation. The new similarity measures proposed are based on the location... more

descriptionView Paper arrow_downwardDownload

Centroid index: Cluster level similarity measure

by mohammad rezaei

2024, Pattern Recognition

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution and sharing with... more

descriptionView Paper arrow_downwardDownload

Spell Checker Using Norvig Algorithm for Gujarati Language

by BRIJESHKUMAR Y . PANCHAL

2024, Springer, Singapore.

The spelling checker approach is planned to validate and rectify incorrectly spelled words by providing a list of alternative vocabulary that are more related to the erroneous phrase. Currently, English language spell checkers are well... more

descriptionView Paper arrow_downwardDownload

Application of Fuzzy Risk Analysis for Selecting Critical Processes in Implementation of SPC with a Case Study

by indra gunawan

2024, Group Decision and Negotiation

Fuzzy risk analysis is widely used in risk assessment of components by linguistic terms. Fuzzy numbers are used to quantify the associated uncertainty. This study employs fuzzy risk analysis to evaluate processes for implementing... more

descriptionView Paper arrow_downwardDownload

An Empirical Investigation of Difficulty of Modules of Description Logic Ontologies

by Uli Sattler

2024

Very expressive Description Logics in the SH family have worst case complexity ranging from EXPTIME to double NEXPTIME. In spite of this, they are very popular with modellers and serve as the foundation of the Web Ontology Language (OWL),... more

descriptionView Paper arrow_downwardDownload

Plagiarism Checking and Document Editor System

by Rohit Rode Patil

2024

Plagiarism of digital documents seems a serious problem in today’s era. Plagiarism refers to the use of someone’s data, language and writing without proper acknowledgment of the original source. Plagiarism can be of different types. This... more

For each word, in our dictionary we maintain the usage frequency. For the words, which are within a threshold edit distance of th given incorrect word, we look at the frequency of each suggested word. The word with higher frequency is presented higher in th suggestion list. That it gives efficient and accurate results. Plagiarism in text documents can be in several forms like plagiarize text may be entered one-tone, passages may be modified to a greater or lesser extent or they may be translated or it is act of claimin of information that actually user wrote. So, the focus of this paper is to give a plagiarism detection technique using semanti technology that will better catch the plagiarism. Till now several techniques for plagiarism analysis have been proposed. Matc with the Tense Rules Using the noun type select the root in the rules, start matching the words in sentence against the next allowe values. If matched, proceed to next word and continue the process until we reach the leaf node in our rules tree. If we reach to a lez node, the sentence is correct. If the current word does not match with any of the rules in next allowed values, the sentence ; incorrect. Dictionary Lookup A dictionary is maintained, where each word should be matched with it. If the word is there in th dictionary then it is considered as correct and we simply increase its frequency by 1, but if not there in the dictionary then we wi further proceed.

A Component Design is a design specification for oneof these Adaptable Components. Each component must be designed to satisfy relevant aspects of the Product Requirements and all designistructures of the Product Architecture. A component diagram, also known as a UML component diagram, deseribesjthevorganization andywiring of the physical components in a system Component diagrams are often drawn to help model implementation details and double- check that every aspect of the system’s required function is covered by planned development. In the first version of UMLycomponents included in these diagrams were physical: documents, database table, files, and executables, all physical elements*with a location. In the world of UML 2, these components are less physical and more conceptual standalone design elements such as a business process that provides or requires interfaces to interact with other constructs in the system. The physical elements described in UML 1, like files and documents, are now referred to as artefacts. A UML 2 component may contain multiple physical artefacts if they naturally belong together. User can register itself on system and then after get login. On system the user can type paragraph and check grammar of th. paragraph. Check plagiarism based on uploaded documents by user. User can also upload the documents. Also search papers or documents according to domain or title. User can also chat with expert according to grammatical suggestion

Ill. SCENARIO IN WHICH MULTI- CORE, EMBEDDED AND DISTRIBUTED COMPUTING USED In our proposed technologies, multi-core is usually the term used to describe two or more CPUs working together on the same chip. Also called multicore technology, it is a type of architecture where a single physical processor contains the core logic of two o1 more processors. This system will support the multi- core functionality. Our system is embedded system this is controller with a dedicated function within a larger mechanical or electrical system, ofter with real- time computing constraints. It is embedded as part of a complete device often including hardware and mechanical parts. Embedded systems control many devices in common use today.

descriptionView Paper arrow_downwardDownload

Boosting the Item-Based Collaborative Filtering Model with Novel Similarity Measures

by Basheer AL-Maqaleh

2024, International Journal of Computational Intelligence Systems

Collaborative filtering (CF), one of the most widely employed methodologies for recommender systems, has drawn undeniable attention due to its effectiveness and simplicity. Nevertheless, a few papers have been published on the CF-based... more

descriptionView Paper arrow_downwardDownload

Content based Image Retrieval Review on its Methods and Transforms

by Anurag Jain

2024, International Journal of Computer Applications

CBIR (content based image retrieval) is the process which mainly focuses to provide efficient retrieval of digital image from the huge collection/database of the images. As many researchers and PhD scholars are working on this topic. So... more

descriptionView Paper arrow_downwardDownload

A Clustering Technique for Email Content Mining

by Deepa Patil

2024, International Journal of Computer Science and Information Technology

In today's world of internet, with whole lot of e-documents such, as html pages, digital libraries etc. occupying considerable amount of cyber space, organizing these documents has become a practical need. Clustering is an important... more

descriptionView Paper arrow_downwardDownload

A New Approach to Group Decision-Making Method Based on TOPSIS Under Fuzzy Soft Environment

by Faruk Karaaslan

2024, journal of new results in science

TOPSIS, developed in 1981 by Hwang and Yoon, is one of the known multi-criteria decision-making (MCDM) methods. In 2015, the group decision-making method based on TOPSIS under fuzzy soft environment was defined and applied to a... more

descriptionView Paper arrow_downwardDownload

Klasifikasi Berita Berdasarkan Pendekatan Semantik

by faiz mubarak

2024, Prosiding Kommit

descriptionView Paper arrow_downwardDownload

Content Based Image Retrieval: An Overview of Structure of the Image and Image Database

by Sinnathamby Mahesan

2024

There are a number of challenging problems in Content Based Image Retrieval (CBIR) particularly on structure of the image and image database. Separating an image into its constituent parts is a major task in this area. In fact, an image... more

descriptionView Paper arrow_downwardDownload

Contributions to fuzzy object comparison and applications. Similarity measures for fuzzy and heterogeneous data and their applications

by Yasmina Bashon

2024

This thesis makes an original contribution to knowledge in the eld of data objects' comparison where the objects are described by attributes of fuzzy or heterogeneous (numeric and symbolic) data types.

descriptionView Paper arrow_downwardDownload

Contributions to fuzzy object comparison and applications : similarity measures for fuzzy and heterogeneous data and their applications

by Yasmina Bashon

2024

descriptionView Paper arrow_downwardDownload

Weighted Minimum Backward Frechet Distance

by Jörg-rüdiger Sack

2024, Canadian Conference on Computational Geometry

The minimum backward Fréchet distance (MBFD) problem is a natural optimization problem for the weak Fréchet distance, a variant of the well-known Fréchet distance. In this problem, a threshold ε and two polygonal curves, T 1 and T 2 , are... more

descriptionView Paper arrow_downwardDownload

Clustering textual documents by extracting sequence from word-of-graph

by Muhammad Rafi

2024, Journal of Independent Studies and Research Computing

Document clustering is an unsupervised machine learning technique that organizes a large collection of documents into smaller, topic homogenous, meaningful sub-collections (clusters). Traditional document clustering approaches use... more

descriptionView Paper arrow_downwardDownload

Text matching to measure patent similarity

by Juan Carlos Gomez

2024, Strategic Management Journal

Research Summary: We propose using text matching to measure the technological similarity between patents. Technology experts from different fields validate the new similarity measure and its improvement on measures based on the United... more

descriptionView Paper arrow_downwardDownload

Architecture design of a reinforcement environment for learning sign languages

by Mario Chacon-Rivas

2024, PeerJ Computer Science

Different fields such as linguistics, teaching, and computing have demonstrated special interest in the study of sign languages (SL). However, the processes of teaching and learning these languages turn complex since it is unusual to find... more

descriptionView Paper arrow_downwardDownload

Human Face Recognition using Stationary Multiwavelet Transform

by Tarik Ismaeel

2024, International journal of computer applications

Face recognition is a complex visual classification task which plays an important role in computer vision, image processing, and pattern recognition. SMWT is proposed to extract the features in images before using the PCA and histogram... more

descriptionView Paper arrow_downwardDownload

Human Face Recognition using Stationary Multiwavelet Transform

by Tarik Ismaeel

2024, International Journal of Computer Applications

descriptionView Paper arrow_downwardDownload

Matching health information seekers ' queries to medical

by Zied Moalla

2024

descriptionView Paper arrow_downwardDownload

Multimodal Evaluation for Medical Image Segmentation

by Matthieu Bultelle

2024, Springer eBooks

descriptionView Paper arrow_downwardDownload

Using Empirical Recurrence Rates Ratio for Time Series Data Similarity

by Moinak Bhaduri

2024, IEEE Access

Several methods exist in classification literature to quantify the similarity between two time series data sets. Applications of these methods range from the traditional Euclidean-type metric to the more advanced Dynamic Time Warping... more

descriptionView Paper arrow_downwardDownload

For an Independent Spell-Checking System from the Arabic Language Vocabulary

by Hamza Bakkali

2024, International Journal of Advanced Computer Science and Applications

In this paper, we propose a new approach for spellchecking errors committed in Arabic language. This approach is almost independent of the used dictionary, of the fact that we introduced the concept of morphological analysis in the... more

descriptionView Paper arrow_downwardDownload

by Dr. Rishi Sayal

2024, International Journal of Computer Applications

descriptionView Paper arrow_downwardDownload

FSSC: An Algorithm for Classifying Numerical Data Using Fuzzy Soft Set Theory

by Tutut Herawan

2024

Introduced is a new algorithm for the classification of numerical data using the theory of fuzzy soft set, named Fuzzy Soft Set Classifier (FSSC). The algorithm uses the fuzzy approach in the pre-processing stage to obtain features, and... more

descriptionView Paper arrow_downwardDownload

Similarity Measures

Key research themes

1. How can component-wise and higher-order dissimilarity measures enhance similarity assessments in heterogeneous and complex data spaces?

2. How can semantic and fuzzy similarity measures be parametrically adapted and combined to improve conceptual reasoning and decision-making?

3. What novel similarity measures improve performance and interpretability in collaborative filtering and image similarity tasks?

Related Topics

All papers in Similarity Measures