KNOW2: Language understanding technologies for multilingual domain-oriented information access
Procesamiento de Lenguaje Natural, Sep 30, 2010
Abstract: The goal of the project is to explore integrated environments allowing the cost-effecti... more Abstract: The goal of the project is to explore integrated environments allowing the cost-effective deployment of vertical information access portals for specific domains. The project started in January 2010, and will last three years. Keywords: Natural Language Processing, Syntactic Analysis, Semantic Interpretation, Knowledge Acquisition, Information Extraction, Information Retrieval
Uploads
Papers by Jordi Turmo
----
In this work we focus on the automatic acquisition of verbal classifications for Spanish. To do so, we perform a series of experiments with 20 verbal senses that belong to the Sensem corpus. We use di↵erent kinds of features that include diverse linguistic information and an agglomerative hierarchical clustering method to generate a number of classifications. We compare each of these automatic classifications with a semi-automatically created gold standard, which is built on the basis of linguistic constructions proposed by theoretical linguistics. This comparison allows us to investigate which features are adequate to build a verb classification coherent with linguistic constructions theory and which are the similarities and di↵erences between an automatic verbal classification and a verb classification based on the theory of linguistic constructions.