Text Similarity Functions Research Papers

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

2024, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity

by Sergio Jimenez

2024

Soft cardinality has been shown to be a very strong text-overlapping baseline for the task of measuring semantic textual similarity (STS), obtaining 3 rd place in SemEval-2012. At *SEM-2013 shared task, beside the plain textoverlapping... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by Sergio Jimenez

2024

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Sergio Jimenez

2024

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by Sergio Jimenez

2024, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by claudia becerra

2024

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Unsupervised method for the authorship identification task

by Darnes Vilariño Ayala

2023

This paper presents an approach for tackling the authorship identification task. The approach is based on comparing the similarity between a given unknown document against the known documents using a number of different phrase-level and... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by Sergio Jiménez

2023

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Sergio Jiménez

2023

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Baselines for Natural Language Processing Tasks Based on Soft Cardinality Spectra

by Sergio Jiménez

2023, Appl. Comput. Math

Abstract. Soft-cardinality spectra (SC spectra) is a new method of approximation for text strings in linear time, which divides text strings into character q-grams of different sizes. The method allows simultaneous use of weighting at... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by Sergio Jiménez

2023, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Soft cardinality: A parameterized similarity function for text comparison

by Sergio Jiménez

2023, In Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (* SEM 2012), Montreal, Canada

We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity

by Claudia Elizabeth Saldias Becerra

2023

The soft cardinality proved to be a very strong text-overlapping baseline for the task of semantic-textual-similarity (STS) obtaining the third place in SemEval-2012. This year, besides to the plain text-overlapping approach, two... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by Claudia Elizabeth Saldias Becerra

2023

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Claudia Elizabeth Saldias Becerra

2023

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by Claudia Elizabeth Saldias Becerra

2023, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by Claudia Alfaro Becerra

2023, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Unsupervised method for the authorship identification task

by Esteban Castillo

2023

This paper presents an approach for tackling the authorship identification task. The approach is based on comparing the similarity between a given unknown document against the known documents using a number of different phrase-level and... more

descriptionView Paper arrow_downwardDownload

Unsupervised method for the authorship identification task

by Esteban Castillo

2022

This paper presents an approach for tackling the authorship identification task. The approach is based on comparing the similarity between a given unknown document against the known documents using a number of different phrase-level and... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity

by Claudia Alheli Torrez Becerra

2022

Soft cardinality has been shown to be a very strong text-overlapping baseline for the task of measuring semantic textual similarity (STS), obtaining 3 rd place in SemEval-2012. At *SEM-2013 shared task, beside the plain textoverlapping... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by Claudia Alheli Torrez Becerra

2022

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Claudia Alheli Torrez Becerra

2022

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by Claudia Alheli Torrez Becerra

2022, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Soft cardinality: A parameterized similarity function for text comparison

by Claudia Alheli Torrez Becerra

2022, In Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (* SEM 2012), Montreal, Canada

We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and... more

descriptionView Paper arrow_downwardDownload

Measuring Semantic Textual Similarity of Sentences Using Modified Information Content and Lexical Taxonomy

by GOUTAM MAJUMDER

2022

In this paper, we present a survey and comparative studies on semantic textual similarity methods, those are based on WordNet taxonomy. We also proposed a new method for measuring semantic similarity between sentences. This proposed... more

descriptionView Paper arrow_downwardDownload

Narrative Similarity as Common Summary

by Elektra Kypridemou

2022

The ability to identify similarities between narratives has been argued to be central in human interactions. Previous work that sought to formalize this task has hypothesized that narrative similarity can be equated to the existence of a... more

descriptionView Paper arrow_downwardDownload

Baselines for Natural Language Processing Tasks Based on Soft Cardinality Spectra

by Sergio Jimenez

2022, Appl. Comput. Math

Abstract. Soft-cardinality spectra (SC spectra) is a new method of approximation for text strings in linear time, which divides text strings into character q-grams of different sizes. The method allows simultaneous use of weighting at... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Claudia Becerra

2022

In this paper we describe our system submitted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in average. This system consists of a SVM classifier with... more

descriptionView Paper arrow_downwardDownload

Uma proposta de recuperação de imagens mamográficas baseada em conteúdo

by Engenharia eletrica

2022

Resumo-O presente trabalho apresenta o desenvolvimento de um sistema computacional para recuperação de imagens baseada em conteúdo, denominado SRIM-Sistema de Recuperação de Imagens Mamográficas. O SRIM tem como objetivo permitir a... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by sergio jimenez

2022, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Soft cardinality: A parameterized similarity function for text comparison

by sergio jimenez

2022, In Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (* SEM 2012), Montreal, Canada

We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and... more

descriptionView Paper arrow_downwardDownload

Soft cardinality: A parameterized similarity function for text comparison

by Sergio Sarmiento Jimenez

2021, In Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (* SEM 2012), Montreal, Canada

We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and... more

descriptionView Paper arrow_downwardDownload

Using CBAR Concepts to Automate Test Oracles for TTS systems

by Fatima Santos Nunes

2021

This research aims to explore CBAR concepts to implement test oracles to support testing activities of TTS Systems, helping the human in quality evaluations. In an automated software testing environment, Test Oracles represent the... more

descriptionView Paper arrow_downwardDownload

Estrutura para Utilização de Recuperação de Imagens Baseada em Conteúdo em Oráculos de Teste de Software com Saída Gráfica

by Fatima Santos Nunes

2021

This paper presented a prototype of a system whose goal is to highlight the opportunity to explore computer vision applied in the Content-based Image Retrievial (CBIR), in order to testing oracles for software that generate graphical... more

descriptionView Paper arrow_downwardDownload

Avaliaçao de Funçoes de Similaridade em um Framework de Teste para Programas com Saıdas Gráficas

by Fatima Santos Nunes

2021

No contexto de teste de software, um desafio a ser vencidoé o teste de programas com saídas gráficas. A Recuperação de Imagens Baseada em Conteúdo (CBIR) constitui uma abordagem factível para esses testes, mas seus resultados podem variar... more

descriptionView Paper arrow_downwardDownload

Avaliando Diferentes Implementações Do Descritor De Cor Dominante

by Caio Benedito

2021, Revista Mundi Engenharia, Tecnologia e Gestão (ISSN: 2525-4782)

Resumo: A recuperação de imagens por conteúdo, tem atraído bastante atenção, principalmente, em grandes conjuntos de imagens onde solicitar dos usuários rótulos para cada uma das imagens se torna um processo custoso e mais suscetível a... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Claudia Becerra

2021, Second Joint Conference on Lexical and Computational Semantics (*SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013)

In this paper we describe our system submitted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in average. This system consists of a SVM classifier with... more

descriptionView Paper arrow_downwardDownload

UNAL-NLP: Combining Soft Cardinality Features for Semantic Textual Similarity, Relatedness and Entailment

by Julia Baquero

2021, Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)

This paper describes our participation in the SemEval-2014 tasks 1, 3 and 10. We used an uniform approach for addressing all the tasks using the soft cardinality for extracting features from text pairs, and machine learning for predicting... more

descriptionView Paper arrow_downwardDownload

Text comparison using soft cardinality

by Sergio Jimenez

2021, String Processing and Information Retrieval

The classical set theory provides a method for comparing objects using cardinality and intersection, in combination with well-known resemblance coefficients such as Dice, Jaccard, and cosine. However, set operations are intrinsically... more

descriptionView Paper arrow_downwardDownload

Data clustering using efficient similarity measures

by Desmond B A L A Bisandu

2019, Journal of Statistics and Management Systems

The need for appropriate applications of the various similarity measures for clustering has arisen over the years as data massively keep on increasing. The issue of deciding which similarity measure is the best and on what kind of dataset... more

m = Number of terms / characters, n = Size of the data set/ document Comparison of the various similarity measures Table 1

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Similarity

by CLAUDIA RIOS BECERRA

2017

Soft cardinality has been shown to be a very strong text-overlapping baseline for the task of measuring semantic textual similarity (STS), obtaining 3 rd place in SemEval-2012. At *SEM-2013 shared task, beside the plain textoverlapping... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by CLAUDIA RIOS BECERRA

2017

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by CLAUDIA RIOS BECERRA

2017

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

Soft Cardinality+ ML: Learning Adaptive Similarity Functions for Cross-lingual Textual Entailment

by CLAUDIA RIOS BECERRA

2017, Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012)

This paper presents a novel approach for building adaptive similarity functions based on cardinality using machine learning. Unlike current approaches that build feature sets using similarity scores, we have developed these feature sets... more

descriptionView Paper arrow_downwardDownload

Soft cardinality: A parameterized similarity function for text comparison

by CLAUDIA RIOS BECERRA

2017, In Proceedings of the 6th International Workshop on Semantic Evaluation (SemEval 2012), in conjunction with the First Joint Conference on Lexical and Computational Semantics (* SEM 2012), Montreal, Canada

We present an approach for the construction of text similarity functions using a parameterized resemblance coefficient in combination with a softened cardinality function called soft cardinality. Our approach provides a consistent and... more

descriptionView Paper arrow_downwardDownload

SC spectra: A linear-time soft cardinality approximation for text comparison

by Alexander Gelbukh

2016

Soft cardinality (SC) is a softened version of the classical cardinality of set theory. However, given its prohibitive cost of computing (exponential order), an approximation that is quadratic in the number of terms in the text has been... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: hierarchical text overlap for student response analysis

by Alexander Gelbukh

2016

In this paper we describe our system used to participate in the Student-Response-Analysis task-7 at SemEval 2013. This system is based on text overlap through the soft cardinality and a new mechanism for weight propagation. Although there... more

descriptionView Paper arrow_downwardDownload

SC Spectra: A Linear-Time Soft Cardinality Approximation for Text Comparison

by Alexander Gelbukh

2016, Lecture Notes in Computer Science

Soft cardinality (SC) is a softened version of the classical cardinality of set theory. However, given its prohibitive cost of computing (exponential order), an approximation that is quadratic in the number of terms in the text has been... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY: Learning to Identify Directional Cross-Lingual Entailment from Cardinalities and SMT

by Alexander Gelbukh

2016

In this paper we describe our system submit- ted for evaluation in the CLTE-SemEval-2013 task, which achieved the best results in two of the four data sets, and finished third in av- erage. This system consists of a SVM clas- sifier with... more

descriptionView Paper arrow_downwardDownload

SOFTCARDINALITY-CORE: Improving Text Overlap with Distributional Measures for Semantic Textual Simil

by Claudia Becerra

2016

Soft cardinality has been shown to be a very strong text-overlapping baseline for the task of measuring semantic textual similarity (STS), obtaining 3 rd place in SemEval-2012. At *SEM-2013 shared task, beside the plain textoverlapping... more

descriptionView Paper arrow_downwardDownload

Text Similarity Functions

Key research themes

1. What are the primary methodological categories for text similarity functions and how do their strengths and weaknesses compare?

2. How can advanced linguistic and semantic resources enhance text similarity detection beyond surface-level measures?

3. What are effective parameterized and empirical similarity functions for text comparison and how can parameters be optimized?

All papers in Text Similarity Functions

Text Similarity Functions

Key research themes

1. What are the primary methodological categories for text similarity functions and how do their strengths and weaknesses compare?

2. How can advanced linguistic and semantic resources enhance text similarity detection beyond surface-level measures?

3. What are effective parameterized and empirical similarity functions for text comparison and how can parameters be optimized?

Related Topics

All papers in Text Similarity Functions