While a wealth of resources is available for teaching research writing of traditional IMRD research papers, instructors have little to draw on when working with graduate students in mathematics. The present study offers insight into... more
This paper presents an exploratory study on large-scale detection of idiomatic expressions in essays written by non-native speakers of English. We describe a computational search procedure for automatic detection of idiom-candidate... more
The paper addresses the issue of the syntax-lexicon distinction from a Construction Grammar perspective. In the constructionist framework, the CONSTRUCTION is the basic unit of linguistic analysis and there is no clear-cut division... more
The need to integrate songs into English Language Teaching (ELT) has been recognized on numerous occasions. Song lyrics host multi-word units which learners can reuse as building blocks in their English, thereby reducing language... more
This paper introduces PersPred, the first manually elaborated syntactic and semantic database for Persian Complex Predicates (CPs). Beside their theoretical interest, Persian CPs constitute an important challenge in Persian lexicography... more
Questo articolo è una rielaborazione di alcune parti della mia tesi dottorale (Masini 2007), diretta da Raffaele Simone. Mi è sembrato doveroso contribuire al volume in suo onore con uno scritto che riguardasse i temi che a suo tempo ci... more
Le espressioni multiparola costituiscono una parte fondamentale del nostro lessico mentale, nonché l’oggetto di studio privilegiato di discipline applicative come la linguistica computazionale, la fraseologia e la lessicografia.... more
The third volume of the “Intercontinental Dialogue on Phraseology” series is a continuation of the project, which was inspired by the symposium "Phraseology 2009" organised by Professor Katsumasa Yagi at Kwansei Gakuin University in... more
L’Università di Roma “La Sapienza” di Roma dal 2016 ha promosso il progetto CREAMY (Italo Calvino REpository for Analysis of Multilingual phraseologY) con lo scopo di analizzare tutte le traduzioni prodotte delle unità fraseologiche... more
Multi-word expressions are linguistic objects formed by two or more words that behave like a ‘unit’ by displaying formal and/or functional idiosyncratic properties with respect to free word combinations. They include an extremely varied... more
In contemporary linguistics the definition of those entities which are referred to as multiword expressions (MWEs) remains controversial. It is intuitively clear that some words, when appearing together, have some “special bond” in terms... more
This paper investigates the presence and use of basic color terms within Italian multiword expressions, in particular phrasal lexemes (e.g. libro bianco ‘white book’, vedere nero ‘be pessimistic’). The starting hypothesis is that most... more
Since the publication of the famous work by on universality and evolution of basic colour terms, several studies on colours have tried to support or reject the hypothesis that language is shaped by universals of perception. In fact,... more
Despite the growing interest of lexicological and lexicographical studies in frozen word combinations, some multiword lexemes are still partially unexplored. This paper focuses on a specific type of multiword lexical units: prepositional... more
In this thesis we describe and evaluate a tool for automatic generation of translations for multiword English terms into Spanish from a monolingual specialized Spanish corpus, compiled by means of web crawling. The resulting translations... more
In Construction Grammar, grammar is conceived as an inventory of form-function-meaning complexes of varying degrees of internal complexity and lexical fixity. These complexes range from single lexemes like the verb demur to multiword... more
This book chapter discusses the opportunities offered by internet television, its pitfalls and the strategies that teachers and learners may use to maximise its benefits to L2 vocabulary acquisition.
To some extent, we seem to use language in chunks—multiple words that are co-selected and used as gestalt units. By some estimates, these chunks constitute more than 50% of a given text (Erman & Warren, 2000). The extent to which our... more
Investigating the validity of internet television as a resource for acquiring L2 formulaic sequences
This article investigates the validity of the suggestion that internet television can facilitate EFL learners’ acquisition of formulaic sequences (FSs). A quantitative study was conducted to examine the extent to which FS use on internet... more
Multiword units (MWUs) is a term used in the current study to broadly cover what second language acquisition (SLA) researchers refer to as collocations, conventional expressions, chunks, idioms, formulaic sequences, or other such terms,... more
This investigation proposes an empirically-based description of a particular type of multiword units which have been observed during the lexicographic design of a database of Italian word combinations (CombiNet dictionary): obligatorily... more
In this work, we present the construction of multilingual parallel corpora with annotation of multiword expressions (MWEs). MWEs include verbal MWEs (vMWEs) defined in the PARSEME shared task that have a verb as the head of the studied... more
Traducción de Expresiones Multipalabra del castellano al euskera: análisis lingüístico y experimento de detección Uxoa Iñurrieta IXA NLP group, University of the Basque Country usoa.inurrieta@ehu.eus
This paper describes two machine translation tasks that require language expertise: (1) paraphrasing as a technique to prepare texts for translation and a method for linguistic quality assurance, and (2) the evaluation of translation... more
This paper provides a compositional, lexically based analysis of the infinitival, verb-headed idiom exemplified by the sentences What does this have to do with me? and It may have had something to do with money. Using conventions of... more
The present thesis aims to provide an insight from a lexicographic perspective into the problematic field of research concerned with multiword expressions, criteria according to which they are identified and classified, the identification... more
vi letting me be me and loving my sometimes dislocated and dramatic sense of humour. Whatever future choice we are presented with, I am reluctant to go there without the kind of mutual understanding, respect and plain old fun I think of... more
Phraseology is central to specialized language. In scientific and technical communication , multiword terms (MWTs) (e.g. volatile organic compound) are the most frequent type of phraseological units. Rendering them into another language... more
Conclusioni e lavori futuri A Part of speech Tagset -PAIS À B Dati sul pattern NA C Dati sul pattern AN D Dati sul pattern NPN E Dati sul pattern NPdN F Dati sul pattern NPVinf G Dati sul pattern NN H Dati sul pattern NCN I Dati sul... more
Cet article a pour but de proposer une méthodologie de description des séquences verbales figées (SVF) en utilisant des traitements automatiques afin de simplifier la tâche. La finalité étant de proposer un moteur de reconnaissance... more
Nominal compounds (NCs) are lexical units that consist of two or more elements that exist on their own, function as a noun and have a special added meaning. Here, we present the results of our experiments on how the growth of Wikipedia... more
Expressions with an aspectual variant of a light verb, e.g. take on debt vs. have debt, are frequent in texts but often difficult to classify between verbal idioms, light verb constructions or compositional phrases. We investigate the... more
This paper takes into account a particular set of Italian Complex Prepositions signaling the agent of an action (e.g. da parte di 'by', a nome di 'by'). Italian exhibits several types of complex prepositions revealing a different... more
Maurice Gross (1934-2001) was both a great linguist and a pioneer in natural language processing. This article is written in homage to his memory, taking a scientific point of view.
Este artigo tem o objetivo de discutir sobre o aspecto verbal nas Construções com Verbo-Suporte (CVS) e com extensões de verbo-suporte, focalizando a função semântica que os verbos podem ter quando exprimem aspecto. Para isso, analisamos... more
This paper describes the ongoing process of integrating approximately 3,000 predicate nouns into Port4NooJ, the Portuguese module for NooJ. The integration of these resources enables us to further extend the paraphrastic capabilities of... more
To apply the same approaches to analysing spoken and written formulaic language is problematic; to do so masks the fact that the contextual meaning of spoken formulaic language is encoded, to a large extent, in its prosody. In The Prosody... more
This research proposes a theoretical and descriptive study of a set of Italian Multiword Adverbs extracted from a corpus; the corpus-based investigation has been driven simultaneously considering the interaction of distributional and... more
____________________________________________________________________________________________________________ Cet article présente une méthodologie permettant d'extraire et de décrire des locutions verbales vis-à-vis de leur comportement... more
Semantic prosody is typically referred to as an evaluative function of certain words or multi-word items appearing within collocates of positive or negative meaning. The present study deals with the semantic prosody (context properties)... more
This paper presents our work on the refinement and improvement of the Serbian language part of Hurtlex, a multilingual lexicon of words to hurt. We pay special attention to adding Multi-word expressions that can be seen as abusive, as... more
A problem that crops up repeatedly in shallow and deep syntactic parsing approaches to South Asian languages like Urdu/Hindi is the proper treatment of complex predications. Problems for the NLP of complex predications are posed by their... more
In this work we present an empirical analysis performed on Italian nominal multiword expressions (MWEs) of the form [noun + adjective] that aims at studying quantitatively their syntactic and semantic features in order to improve their... more