Multiword Expressions Research Papers

Espressioni idiomatiche gestuo-cinesiche. Risultati di un' analisi contrastiva ITA-FRA-ENG

2025, Le espressioni idiomatiche gestuo-cinesiche tra corpo e cultura: un'analisi contrastiva italiano-inglese e italiano francese

Il lavoro espone i risultati di due indagini contrastive italiano-inglese e italiano-francese riguardanti le espressioni idiomatiche (EI) gestuo-cinesiche, cioè caratterizzate dal fare riferimento a gesti e ad altri comportamenti cinesici... more

descriptionView Paper arrow_downwardDownload

A Matrix-Based Heuristic Algorithm for Extracting Multiword Expressions from a Corpus

by Orhan Bilgin

2025

This paper describes an algorithm for automatically extracting multiword expressions (MWEs) from a corpus. The algorithm is nodebased, i.e. extracts MWEs that contain the item specified by the user, using a fixed window-size around the... more

descriptionView Paper arrow_downwardDownload

XI Congresso Internazionale di Fraseologia e Paremiologia

by Marco Luchi

2025

In this paper, we will focus on the structural, cognitive, and cross-cultural properties of proverbs. The first part examines the structural characteristics of proverbial sentences, while the second part explores the cognitive framework... more

In this paper, we will focus on the structural, cognitive, and cross-cultural properties of proverbs. The first part examines the structural characteristics of proverbial sentences, while the second part explores the cognitive framework that underlies their structure. The data includes 300 proverbs from three different languages (English, Croatian and Italian). This kind of analysis stands at the crossroads between cognitive and structural linguistics. The cognitive aspect includes two key theories: conceptual metaphor theory and relevance theory. Our primary assumption is that the basic features of proverbs-phonetic, syntactic, and semanticarise from an adjustment of cognitive patterns to the demands of language use. The high frequency of many communicative situations, coupled with the natural desire for brevity (cognitive economy), influences the structure of proverbs. A significant portion of the proverbs in our dataset (62%) rely on metaphors. This analysis leads to an underlying motivation for the figurative meanings conveyed by proverbs. By mapping concrete experiences (source domains) onto abstract concepts (target domains), metaphors enhance our understanding of abstract ideas and universal truths. While current research sheds light on several key aspects, it also highlights important conceptual and linguistic gaps-particularly regarding the effects of associative word networks and the roles of binarism and symmetry in the memorization and retrieval of proverbs. Our study also emphasizes the structural distinctions between literal and metaphorical proverbs. To conclude, our data analysis confirms that the syntactic, semantic, and phonological markers in proverbs are interdependent. There is a high degree of correlation between these features, leading to a potentially universal pattern utilized across different languages. Furthermore, the principles of memorability and cognitive economy are confirmed to play pivotal roles in the structure of proverbs. This study aims to address theoretical challenges and deepen understanding of the universal cognitive features inherent in proverbs, encouraging further exploration of conceptual power of proverbs across diverse cultures. Danica Skara is retired professor emeritus at the University of Split, Croatia. She recieved her MA from the University of Zagreb; her PhD from the University of Zadar, Croatia. Her area of specialization includes linguistics and related disciplines: semantics, cognitive linguistics, psycholinguistics, paremiology. Apart from a variety of articles (43) on the subjects, she has published 5 books and participated in the work of 45 conferences. She was the University's Vice-Chancellor (2003. In 2000 she founded a doctoral programme in linguistics at the University of Zadar and acted as the director of the programme. She was a Head of postgraduate programme: European studies. In 2003 she was awarded a Fulbrigt Fellowship at the Cornell University (USA). She acted as a plenary speaker/coorganizer of several international conferences; she was a visiting scholar at

descriptionView Paper arrow_downwardDownload

Breaking the mold: WhatsApp-enhanced intentional vocabulary learning

by Ines Boufahja

2025

This comparative study examines the impact of WhatsApp-based instruction versus text-based instruction on intentional media-related vocabulary learning and learners' perceptions. Specifically, it investigates the effects of WhatsApp... more

This comparative study examines the impact of WhatsApp-based instruction versus text-based instruction on intentional media-related vocabulary learning and learners' perceptions. Specifically, it investigates the effects of WhatsApp functionalities (e.g. multimodal input, media sharing and group chat) on deliberate learning, word retention and collaboration among final-year undergraduate students. Within a mixedmethods framework, a convenient sample of 70 students from a language institute in Tunisia was assigned to experimental group (EG) and control group (CG). While vocabulary achievement was assessed through a pretest-posttest design, qualitative data was gleaned through a semi-structured interview run with 10 informants from the Mobile Assisted Vocabulary Learning (MAVL) group. The eight-session MAVL intervention included the intentional instruction of commonly used idioms in media context through the different affordances of WhatsApp. Findings from paired t-tests and independent t-tests, processed with SPSS, revealed higher lexical performance among the WhatsApp group. Drawing on reflexive thematic analysis (RTA), qualitative data displayed the participants' favorable attitudes toward the app's potential to improve word recall and cultivate a spirit of collaboration. The theoretical and pedagogical implications of these findings offer clarity for educators in their ongoing search for cutting-edge vocabulary teaching approaches. This research demonstrates that using the in-built functionalities of WhatsApp (e.g., multiple input types, chat groups) in intentional vocabulary learning significantly enhances students' word knowledge and word retention and increases their collaboration. The findings provide valuable guidance for EFL teachers, curriculum developers, and policymakers on how to effectively integrate MAVL into the EFL classroom. The study's results are important because they show a practical and engaging way to improve student learning vocabulary which is often an underexposed skill in the EFL classroom. By harnessing modern technology, educators can help more students develop not only lexical knowledge but also 21st century skills such as collaoration.

descriptionView Paper arrow_downwardDownload

Differences between spoken and written English: The case of the predicative prepositional phrases in the ICE-GB (abstract)

by Antonio Vicente Casas Pedrosa

2025, AELINCO 2015. Book of Abstracts. 7th Conference on Corpus Linguistics

This paper is aimed at describing the main differences between spoken and written English. More specifically, attention is paid to the different examples which are classified as predicative Prepositional Phrases (PPs) in the International... more

This paper is aimed at describing the main differences between spoken and written English. More specifically, attention is paid to the different examples which are classified as predicative Prepositional Phrases (PPs) in the International Corpus of English-Great Britain (ICE-GB) and their frequency in spoken and written texts. These units can be defined as those phrases which are introduced by a preposition and
followed by a Noun Phrase (NP) acting as its complement. Furthermore, they perform the function of Subject Complement (Cs) at clause level. Such is the case of “She first fell in love with Will when she was eighteen, and she adores him still” (ICE‐GB:W2F‐019#47:1). Although in terms of frequency this is not the syntactic function PPs more often perform, they are taken into account because of their complexity and
due to the lack of detailed analyses. In most cases they are described as isolated examples and this phenomenon is not considered to be a very productive one. After introducing some basic notions, these structures are analyzed focusing on their presence in both spoken and oral texts within the ICE-GB. This is a one-million-word corpus which is both morphologically tagged and syntactically parsed. Moreover, it was compiled in the nineties and consists of both spoken (60%) and written material (40%).
The ICECUP (ICE Corpus Utility Program) software retrieved 3307 examples from 3223 sentences. These instances were then filtered since some of them were later classified as “noise” (in some cases the PPs were performing other functions either at phrase or at clause level and in others the element acting as the complement of the preposition was not a NP). For these reasons the final subcorpus consists of 1332 examples.
67.49% of these instances (899) are found in oral texts whereas 32.51% of them (433) belong to written texts. All these examples have been classified into different groups and subgroups corresponding to the different text categories available in this corpus (Nelson, Wallis and Aarts, 2002: 307-8). The results are presented in charts by means of both figures and percentages and different conclusions are later drawn based on the analysis of these charts.
Thus, for example, it can be noticed that, although it was expected that the amount of structures under study would be higher in spoken than in written texts because of the structure of the corpus itself, the relative frequency (which takes into account the relationship between the number of examples and the number of words) proves so, too: 0.1410% in spoken texts as opposed to 0.1022% in written texts, with an average of 0.1255% in the whole corpus. Moreover, there are more examples in dialogues (581) than in monologues (318) and in printed texts (332) than in non-printed ones (101).
This information proves especially relevant for non-native speakers of English since it allows them to become aware of the differences between speaking and writing. According to the evidence, some units are used more often in spoken language than in written English. Therefore, when producing any kind of text, students will feel more confident for they will be able to choose the appropriate structures bearing in mind these issues.

descriptionView Paper arrow_downwardDownload

A Corpus-Based Study on the Syntactic Behaviour of German Particle Verbs

by Stefan Bot

2025

Particle Verbs (PVs) are a very frequent and productive word class in German. They can occur in different syntactic paradigms. In verb-first and verb-second clauses which do not contain auxiliary verbs they occur syntactically separated.... more

descriptionView Paper arrow_downwardDownload

Features of Compositionality in English and German Noun-Noun-Compounds

by Stefan Bot

2025

Noun-noun compounds are complex words with two simplex nouns as constituents. In English and German, the first constituent represents the modifier of the compound, and the second constituent represents the head. A compound may have... more

descriptionView Paper arrow_downwardDownload

Fixed Similes: Measuring aspects of the relation between MWE idiomatic semantics and syntactic flexibility

by Stella Markantonatou

2025

We shed light on aspects of the relation between the semantics and the syntactic flexibility of multiword expressions by investigating fixed adjective similes (FS), a predicative multiword expression class not studied in this respect... more

descriptionView Paper arrow_downwardDownload

The role of multiword sequences in fluent speech* The case of listener-based judgment in L2 argumentative speech

by Kotaro Takizawa

2025, Studies in Second Language Acquisition

This study explored how second language (L2) speakers' use of multiword sequences in speech predicted perceived fluency ratings while controlling for their utterance fluency. A total of 102 Japanese speakers of English delivered an... more

descriptionView Paper arrow_downwardDownload

Extracting Verbal Multiword Data from Rich Treebank Annotation

by Jan Hajič

2025

The PARSEME Shared Task on automatic identification of verbal multiword expressions aims at identifying such expressions in running texts. Typology of verbal multiword expressions, very detailed annotation guidelines and gold-standard... more

descriptionView Paper arrow_downwardDownload

Combinazioni di parole che costituiscono entrata. Fenomeni, rappresentazione lessicografica e aspetti lessicologici

by Valentina PIUNNO

2025, Studi e Saggi Linguistici

Although the interest of literature in word combinations has significantly increased over the last decades, the full classification of their types and comprehensive collection of their forms is far from complete and flawless. This paper... more

descriptionView Paper arrow_downwardDownload

Support verbs that are not verbs

by Eric G . C . Laporte

2025, Language Sciences

In support verb constructions (SVC), as 'have poise', the support verb is explicitly assumed to be a verb, here 'have'. However, during the last 50 years, the notion of SVC has been extended to a large range of new cases. With this new... more

descriptionView Paper arrow_downwardDownload

IL CONTRIBUTO DEL LATINO ALL’INTERROGATIVE-INDEFINITE PUZZLE: FORME E FUNZIONI DEL RADICALE INDEFINITO/ INTERROGATIVO

by Francesca Pagliara

2025, CLASSICAL LANGUAGES AND LINGUISTICS LENGUAS CLÁSICAS Y LINGÜÍSTICA

This paper aims to analyze the structure and meanings of the Latin indefinite pronouns that can be traced back to the Indo-European root *kw-e-/*kw-i. All of them are morphologically derivational forms: this property is supported by... more

descriptionView Paper arrow_downwardDownload

INVESTIGATING SEMANTIC ERRORS IN ENGLISH TO INDONESIAN TRANSLATIONS: A CASE STUDY OF DEEPL TRANSLATOR

by Sahmiral Amri Rajagukguk and

2025, How to Cite (APA7): Guk Guk, S. A. R., Pratiwi, A. S., & Batubara, A. A. H. (2025). Investigating Semantic Errors in English to Indonesian Translations: A Case Study of DeepL Translator. LINGUISTICA, 14(2). https://doi.org/10.24114/jalu.v14i2.65047

This study focuses on investigating semantic errors in English to Indonesian translation using DeepL Translate, with the aim of evaluating the extent of semantic accuracy of this translation tool. This study uses a qualitative approach... more

descriptionView Paper arrow_downwardDownload

Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French

by Christopher D Manning

2025, HAL (Le Centre pour la Communication Scientifique Directe)

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or... more

descriptionView Paper arrow_downwardDownload

Where users search for Italian meanings online: An eye-tracking study. ‘Digital native’ dictionaries and ‘combinations of words’

by Annalisa Greco and

2025, Lexicography and Semantics: Book of Abstracts of the XXI EURALEX International Congress, 8–12 October 2024, Cavtat, Croatia

The objective of this study is to investigate how learners of Italian as a second or foreign language search for new meanings in online Italian dictionaries. Using eye-tracking technology, we carried out experiments inviting users to do... more

descriptionView Paper arrow_downwardDownload

VMWE identification with models trained on GUD (a UDv.2 treebank of Standard Modern Greek)

by Stavros Bompolas

2025, Proceedings of the 21st Workshop on Multiword Expressions (MWE 2025)

UD_Greek-GUD (GUD) is the most recent Universal Dependencies (UD) treebank for Standard Modern Greek (SMG) and the first SMG UD treebank to annotate Verbal Multiword Expressions (VMWEs). GUD contains material from fiction texts and... more

descriptionView Paper arrow_downwardDownload

Hungarian Corpus of Light Verb Constructions

by Janos Csirik

2025, International Conference on Computational Linguistics

The precise identification of light verb constructions is crucial for the successful functioning of several NLP applications. In order to facilitate the development of an algorithm that is capable of recognizing them, a manually annotated... more

descriptionView Paper arrow_downwardDownload

Szeged Corpus 2.5: Morphological Modifications in a Manually POS-tagged Hungarian Corpus

by Janos Csirik

2025

The Szeged Corpus is the largest manually annotated database containing the possible morphological analyses and lemmas for each word form. In this work, we present its latest version, Szeged Corpus 2.5, in which the new harmonized... more

descriptionView Paper arrow_downwardDownload

Hungarian corpus of light verb constructions

by Janos Csirik

2025

The precise identification of light verb constructions is crucial for the successful functioning of several NLP applications. In order to facilitate the development of an algorithm that is capable of recognizing them, a manually annotated... more

descriptionView Paper arrow_downwardDownload

Representation and parsing of multiword expressions: Current trends

by Helge Dyvik

2025

In this introductory chapter, we first present the topic and context of this volume. We then summarize its contributions, which have been collected through an open call for submissions and a peer-reviewing process.

descriptionView Paper arrow_downwardDownload

Quantitative determinants of prefabs: A corpus-based, experimental study of multiword units in the lexicon

by Clay Beckner

2025

for my studies, since I could not pursue empirical research without their help. I am grateful to all of my committee members, Jill Morford, Joan Bybee, Bill Croft, and Andy Wedel, for their mentorship. My research is better due to the... more

descriptionView Paper arrow_downwardDownload

О ВЛИЯНИИ ГЕНЕТИЧЕСКОГО РОДСТВА ЯЗЫКОВ НА КАЧЕСТВО МЕЖЪЯЗЫКОВОГО ПЕРЕНОСА

by Alexandra Baiuk

2025

Аннотация: Данная статья посвящена анализу результатов zero-shot межъязыкового переноса автоматической лингвистической разметки в стандарте CoBaLD с русского языка на близкородственные и неродственные языки. Исследование показывает, что... more

descriptionView Paper arrow_downwardDownload

IDIOME IM DAF-UNTERRICHT Ergebnisse einer Frequenzuntersuchung für das Projekt PhraseoLab

by Anna Sulikowska

2025

Korpusgesteuerte und korpusbasierte Untersuchungen führen überzeugend vor Augen, dass die Sprache zu einem viel stärkeren Grad aus konventionalisierten Mehrworteinheiten besteht als früher angenommen wurde. Daraus ergibt sich das... more

descriptionView Paper arrow_downwardDownload

The Effects of a Usage-driven Feedback Approach on Students' Use of Functional Lexical Chunks

by Kenn Arcenal

2025, Korea Journal of English Language and Linguistics

There has been a consensus among language researchers regarding the apparent advantages of learning lexical chunks. Conventional pedagogies (e.g., memorizing, drilling, input flooding, typographic enhancement) have been utilized in... more

descriptionView Paper arrow_downwardDownload

Constructing an Old English WordNet: The Case of Guilt

by Javier E . Díaz-Vera and

2025, La Memoria Digitale: Forme del Testo e Organizzazione della Conoscenza

In this paper, we look at the manual construction of a lexicon of emotion terms in Old English organised as a wordnet lexicon and based on a pre-existing dataset which categorises emotion terms on the basis of cognitive criteria. This is... more

descriptionView Paper arrow_downwardDownload

Representation and parsing of multiword expressions: Current trends

by Nasredine Semmar

2025

In this introductory chapter, we first present the topic and context of this volume. We then summarize its contributions, which have been collected through an open call for submissions and a peer-reviewing process.

descriptionView Paper arrow_downwardDownload

Automatic Construction of a MultiWord Expressions Bilingual Lexicon: A Statistical Machine Translation Evaluation Perspective

by Nasredine Semmar

2025

Identifying and translating MultiWord Expressions (MWES) in a text represent a key issue for numerous applications of Natural Language Processing (NLP), especially for Machine Translation (MT). In this paper, we present a method aiming to... more

descriptionView Paper arrow_downwardDownload

Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

by Nasredine Semmar

2025

MultiWord Expressions (MWEs) repesent a key issue for numerous applications in Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper, we describe a strategy for detecting translation pairs of MWEs in a... more

descriptionView Paper arrow_downwardDownload

A New Approach for Idiom Identification Using Meanings and the Web

by Rakesh Verma

2025

There is a great deal of knowledge available on the Web, which represents a great opportunity for automatic, intelligent text processing and understanding, but the major problems are finding the legitimate sources of information and the... more

descriptionView Paper arrow_downwardDownload

Leaving No Stone Unturned When Identifying and Classifying Verbal Multiword Expressions in the Romanian Wordnet

by Maria Mitrofan

2025

We present here the enhancement of the Romanian wordnet with a new type of information, very useful in language processing, namely types of verbal multi-word expressions. All verb literals made of two or more words are attached a label... more

descriptionView Paper arrow_downwardDownload

Предложните изрази в българския език. (Деадвербиални комитативни предложни изрази)

by Laska Laskova

2024, Предложните изрази в българския език

Рецензенти проф. д-р Йовка Тишева доц. д-р Атанас Атанасов СЪДЪРЖАНИЕ Използвани знаци и съкращения / 9 Въведение / 11 Глава I Българската лингвистика за предложните изрази / 15 84 Приема се, че е възможно съществуването и на хибридни... more

descriptionView Paper arrow_downwardDownload

MWE-Finder: Querying for multiword expressions in large Dutch text corpora

by Martin Kroon

2024, Multiword expressions in lexical resources: Linguistic, lexicographic, and computational perspectives

We present MWE-Finder, an application that enables a user to search for multiword expressions (MWEs) in large Dutch text corpora. Components of many MWEs in Dutch can occur in multiple forms, need not be adjacent, and can occur in... more

descriptionView Paper arrow_downwardDownload

MWE-Finder: a Demonstration

by Martin Kroon

2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

This paper introduces and demonstrates MWE-Finder, an application to search for flexible multiword expressions (MWEs) in Dutch text corpora, starting from an example. If the example is in canonical form, the application automatically... more

descriptionView Paper arrow_downwardDownload

A Canonical Form for Flexible Multiword Expressions

by Martin Kroon

2024, Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

This paper proposes a canonical form for Multiword Expressions (MWEs), in particular for the Dutch language. The canonical form can be enriched with all kinds of annotations that can be used to describe the properties of the MWE and its... more

descriptionView Paper arrow_downwardDownload

MWE-Finder: An evaluation through three case studies

by Martin Kroon

2024, Selected papers from the CLARIN Annual Conference 2023

In this paper we showcase and evaluate MWE-Finder, a system that allows users to search for occurrences of an MWE in a large Dutch text corpus. To this end, we conduct three small case studies, and discuss the results in detail. We make... more

descriptionView Paper arrow_downwardDownload

Looking Behind the Scenes of Syntactic Dependency Corpus Annotation: Towards a Motivated Annotation Schema of Surface-Syntax in Spanish

by LEO WANNER

2024

Over the last decade, the prominence of statistical NLP applications that use syntactic rather than only word-based shallow clues increased very significantly. This prominence triggered the creation of large scale treebanks, i.e., corpora... more

descriptionView Paper arrow_downwardDownload

Writing assistants and automatic lexical error correction: word combinatorics

by LEO WANNER

2024

Genuine lexical writing assistants that attempt to detect lexical errors such as miscollocations are traditionally less common in Computer Assisted Language Learning than spell and grammar checkers. However, there is empirical evidence of... more

descriptionView Paper arrow_downwardDownload

Writing assistants and automatic lexical error correction: word combinatorics

by LEO WANNER

2024

Genuine lexical writing assistants that attempt to detect lexical errors such as miscollocations are traditionally less common in Computer Assisted Language Learning than spell and grammar checkers. However, there is empirical evidence of... more

The red line indicates the percentage of support verbs (SVs) that are available for a given base (noun) in the set of suggestions offered to the user, depending on the size of the set. It shows that nearly all SVs are contained in the first eight correction suggestions. The blue line indicates the percentage of SVs in the set of correction suggestions, again depending on the size of the set. It shows that in our experiment on French the first suggested collocate was indeed a support verb for 73% of the bases considered. metrics.? According to the MRR of the top five suggestions for the 673 nominal bases we analyzed, Z-score and the product of this association measure with frequency lead to the best results: 0.87 and 0.88 of MRR. Both measures are superior to simple frequency, which seems to be used, e.g., by the MUST collocation checker !° for ranking their correction suggestions, because they give less weight to very frequent verbs (avoir, étre, faire). They are comparable to the performance of the ranking metrics used in the Just The Word (jtw) collocation checker.!!

Figure 2: Interactive Language Toolbox: automatic collocation error detection and correction reprendre reprendre reprendre reprendre reprendre reprendre reprendre des forces ses forces quelques forces les forces mes forces leurs forces vos forces

Figure 5 shows the correction suggestions provided for the erroneous collocatior tomar [un] paseo, lit. take [a] walk’.

Esta colocaci6n es incorrecta, te mostramos algunas sugerencias de correccién:

descriptionView Paper arrow_downwardDownload

GUD: a new Modern Greek treebank enriched with VMWE annotations

by Stavros Bompolas

2024, 3rd UniDive Workshop in Budapest

We report on UD_Greek-GUD (henceforth GUD), the most recent Universal Dependencies (UD) treebank of Standard Modern Greek (SMG). GUD adheres to UD.v2 (de Marneffe et al., 2021) and is the first SMG UD treebank to annotate Verbal Multiword... more

descriptionView Paper arrow_downwardDownload

Is Old French tougher to parse?

by Sophie Prévost

2024

Medieval French is known to be relatively hard to parse, with several possible sources of confusion for automatic parsers, among which its flexible word order and its graphical and syntactic variation, both synchronically and... more

descriptionView Paper arrow_downwardDownload

Language resources for Italian: towards the development of a corpus of annotated Italian multiword expressions

by Ruslan Mitkov

2024, Accademia University Press eBooks

descriptionView Paper arrow_downwardDownload

Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions

by Ruslan Mitkov

2024, arXiv (Cornell University)

We introduce a new method to tag Multiword Expressions (MWEs) using a linguistically interpretable language-independent deep learning architecture. We specifically target discontinuity, an under-explored aspect that poses a significant... more

descriptionView Paper arrow_downwardDownload

Evaluation of machine translation systems and related procedures

by Musatafa Albadr

2024, ARPN journal of engineering and applied sciences

Currently, the high volume of international information exchange involves a wide range of localities. As each locality comes with its own distinctive dialect, the need for an effective means of language translation is becoming more and... more

descriptionView Paper arrow_downwardDownload

Automatically Assessing Whether a Text Is Cliched, with Applications to Literary Analysis

by Graeme Hirst

2024, North American Chapter of the Association for Computational Linguistics

Clichés, as trite expressions, are predominantly multiword expressions, but not all MWEs are clichés. We conduct a preliminary examination of the problem of determining how clichéd a text is, taken as a whole, by comparing it to a... more

descriptionView Paper arrow_downwardDownload

Terms Specification and Extraction within a Linguistic-based Intranet Service

by Elisabeth Maier

2024

This paper describes the adaptation and extension of an existing morphological system and its integration into an intranet service of a large international bank. The system includes a tool for the analysis and extraction of simple and... more

descriptionView Paper arrow_downwardDownload

Publishing a Quality Context-aware Annotated Corpus and Lexicon for Harassment Research

by Krishnaprasad Thirunarayan

2024, ArXiv

Having a quality annotated corpus is essential especially for applied research. Despite the recent focus of Web science community on researching about cyberbullying, the community dose not still have standard benchmarks. In this paper, we... more

descriptionView Paper arrow_downwardDownload

Agere e i nomi dell’azione scenica

by Francesca Pagliara

2024, Verbi supporto, Fenomeni e teorie

This paper describes the combinatorial properties of agere in association with nouns that designate a scenic activity. I therefore examine combinations such as fabulam, tragoediam, comoediam, partes, gestum, personam agere. The aim of the... more

descriptionView Paper arrow_downwardDownload

TEACHING PERSIAN COMPLEX PREDICATES FROM A PEDAGOGICAL CONSTRUCTION GRAMMAR STANCE

by Maryam Pakzadian

2024

This paper addresses Persian Complex Predicates (CPs) from an Applied/Pedagogical Construction Grammar (PCxG) stance. PCxG is an approach to foreign language pedagogy that emphasises the importance of constructions (form-meaning... more

descriptionView Paper arrow_downwardDownload

Language resources for Italian: towards the development of a corpus of annotated Italian multiword expressions

by Manuela Cherchi

2024, Accademia University Press eBooks

descriptionView Paper arrow_downwardDownload

Multiword Expressions

Related Topics