Local Grammar Graphs

description12 papers

group1 follower

lightbulbAbout this topic

Local Grammar Graphs are structured representations that capture the syntactic and semantic relationships within a specific linguistic context. They facilitate the analysis of language by modeling the rules and patterns governing the formation of phrases and sentences, enabling a detailed examination of grammatical structures in localized settings.

lightbulbAbout this topic

Key research themes

1. How can local grammar graphs (LGGs) be utilized for practical natural language understanding and data generation in domain-specific systems?

This theme focuses on the development and application of local grammar graphs as robust linguistic resources that capture lexico-syntactic patterns for diverse, domain-specific natural language understanding (NLU) tasks. The importance lies in their ability to generate large-scale, high-quality labeled datasets automatically, which address the scarcity and privacy concerns of authentic user data, and facilitate training effective machine learning models for conversational AI in complex domains like law, finance, and customer service.

Towards a theory of syntactic workspaces: neighbourhoods and distances in a lexicalised grammar

by Diego Gabriel Krivochen

2023, The Linguistic Review

Key finding: Introduces a graph-theoretic formalization for syntactic workspaces that models local regions of syntactic operations as directed graphs. This foundational approach informs how syntactic structure manipulation can be... Read more

articleView Paper downloadDownload

Generating Training Datasets for Legal Chatbots in Korean

by Changhoe Hwang and

2023, International conference on Law and Society

Key finding: Demonstrates the use of local grammar graphs to capture and generalize legal vocabulary and local syntax, enabling the generation of 700 million labeled utterances for training a DIET classifier in Korean legal chatbot NLU.... Read more

articleView Paper downloadDownload

Building Korean linguistic resource for NLU data generation of banking app CS dialog system

by Eric G . C . Laporte

2023, COLING Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning (Pan-DL)

Key finding: Develops a modular linguistic resource named FIAD based on local grammar graphs capturing three core linguistic components (TOPIC, EVENT, DISCOURSE MARKER) derived from banking app review corpora. FIAD enables generation of... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. How do graph-theoretic and topological frameworks advance linguistic theory by modeling syntax and grammar structures as graphs?

This research area addresses theoretical syntactic modeling by leveraging graph theory, topology, and formal grammar graphs to represent syntactic dependencies, workspace operations, and morphological processes. It is significant because it offers precise mathematical characterizations of syntactic derivations and grammar structures, allows new computational interpretations of movement and locality, supports morphological-syntactic integration, and offers a unifying formalism beyond string-based representations.

Towards a theory of syntactic workspaces: neighbourhoods and distances in a lexicalised grammar

by Diego Gabriel Krivochen

2023, The Linguistic Review

Key finding: Proposes a topological and graph-theoretic formalization of the syntactic workspace within minimalist syntax as local directed graphs. This approach explicitizes how syntactic operations affect local regions of derivations,... Read more

articleView Paper downloadDownload

Graphs of Generative Grammars

by Benedek Nagy

2021

Key finding: Introduces the concept of the graph of a generative grammar (Γ), a specialized and-or graph that extends classical dependency graphs and finite automata to represent production rules of arbitrary generative grammars,... Read more

articleView Paper downloadDownload

Graph representation of context-free grammars

by Alex Shkotin

2013, Computing Research Repository

Key finding: Presents a systematic method to transform context-free grammars into Directed Marked Graphs (DMGs) annotated with typed nodes representing AND/OR nonterminals and terminals. This representation facilitates grammatical... Read more

articleView Paper downloadDownload

Graph Grammars and Operations on Graphs

by Jan Joris Vereijken

2024

Key finding: Develops a formal framework to interpret string languages as graph languages using typed i/o-hypergraphs and sequential graph composition respecting input/output interfaces. This formalism relates existing string grammar... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. What computational approaches enable unsupervised or weakly supervised learning of construction grammars integrating multi-level, multi-length linguistic patterns?

This theme explores algorithms and computational modeling for induction of construction grammars from corpus data without requiring strong innate linguistic constraints. Emphasis is placed on learning flexible units that generalize across mixed representations ranging from item-specific to schematic forms, including recursive and discontinuous structures. Understanding these learning mechanisms is critical for data-driven grammar acquisition, linguistic typology, and modeling language evolution.

Computational Learning of Construction Grammars

by Jonathan Dunn

2016, Language & Cognition

Key finding: Presents an algorithm inducing construction grammars from large corpora by identifying minimal sets of multi-length, multi-level schematic and item-specific constructions, including recursive and discontinuous patterns, based... Read more

articleView Paper downloadDownload

Chromatic transitions in the emergence of syntax networks

by Ricard SOLÉ

2022, Royal Society Open Science

Key finding: Employs graph-theoretic concepts, specifically the chromatic number (minimal coloring), to analyze syntactic network evolution during child language acquisition and detect phase transitions from simple two-word structures to... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Local Grammar Graphs

Một Vài So Sánh Về Ngữ Nghĩa Từ Đi Trong Tiếng Việt Với Từ가다 Trong Tiếng Hàn

by Phương Phương

2023, Tạp chí Khoa học

Trong ngôn ngữ, chuyển nghĩa là một trong những cách vừa tiện lợi vừa tiết kiệm để phát triển nghĩa của từ. Kết quả của hiện tượng chuyển nghĩa sẽ tạo ra từ đa nghĩa. Từ nghĩa gốc ban đầu của một từ, người ta sẽ dựa vào những mối liên hệ... more

descriptionView Paper arrow_downwardDownload

DECO-MWE: Building a Linguistic Resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis

by Changhoe Hwang

2023

This paper aims to construct a linguistic resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis (FBSA): DECO-MWE. Dealing with multiword expressions (MWEs) has been a critical issue in FBSA since many constructs... more

descriptionView Paper arrow_downwardDownload

Building Korean linguistic resource for NLU data generation of banking app CS dialog system

by Jeongwoo Yoon

2023, HAL (Le Centre pour la Communication Scientifique Directe)

Natural language understanding (NLU) is integral to task-oriented dialog systems, but demands a considerable amount of annotated training data to increase the coverage of diverse utterances. In this study, we report the construction of a... more

descriptionView Paper arrow_downwardDownload

DECO-MWE: Building a Linguistic Resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis

by Eric G . C . Laporte

2023

descriptionView Paper arrow_downwardDownload

Generating Training Datasets for Legal Chatbots in Korean

by Changhoe Hwang and

2023, International conference on Law and Society

Chatbots are robots that can communicate with humans using text or voice signals. Legal chatbots improve access to justice, since legal representation and legal advice by lawyers come with a high cost that excludes disadvantaged and... more

descriptionView Paper arrow_downwardDownload

English Lexical Loanwords in Indonesian: Exploring in Tourism Magazine

by Tatu Rohbiah

2023, Journal of English Language Teaching and Literature (JELTL)

The aim of this research is to know English lexical loanwords into Indonesian languages in tourism magazine. In this research, the writer uses descriptive qualitative method where she describes the corpus of English lexical loanwords... more

descriptionView Paper arrow_downwardDownload

Interpersonal Relations in Biographical Dictionaries. A Case Study

by Matthias Reinert

2023

Adopting the concept of “Local Grammars” (M. Gross), which were successfully applied in practice by (Geierhos, 2010) to biographical information extraction in English our project aims to detect, encode, and finally visualize relations... more

descriptionView Paper arrow_downwardDownload

Building Korean linguistic resource for NLU data generation of banking app CS dialog system

by Eric G . C . Laporte

2023, COLING Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning (Pan-DL)

descriptionView Paper arrow_downwardDownload

SSP-based construction of evaluation-annotated data for fine-grained aspect-based sentiment analysis

by Eric G . C . Laporte

2023, COLING Workshop on Pattern-based Approaches to NLP in the Age of Deep Learning (Pan-DL)

We report the construction of a Korean evaluation-annotated corpus, hereafter called 'Evaluation Annotated Dataset (EVAD)', and its use in Aspect-Based Sentiment Analysis (ABSA) extended in order to cover e-commerce reviews containing... more

descriptionView Paper arrow_downwardDownload

A semi-automatic method for constructing MUSE sentiment-annotated corpora

by Eric G . C . Laporte

2022, International Conference on Asian Linguistics (ICAL)

This study describes a methodology we adopted for constructing Multilingual Sentiment-Annotated Corpora (named MUSE), that consist of two types of annotated corpora: Sentence-based Sentiment-Annotated Corpora (MUSE-SESAC) and Token-based... more

descriptionView Paper arrow_downwardDownload

Finite-State Descriptions of Various Levels of Linguistic Phenomena

by Max Silberztein

2022

Finite State Automata (FSA) and their variants are natural tools adapted to the description of various linguistic phenomena which must be dealt with at various points in different types of automatic processing of texts written in Natural... more

descriptionView Paper arrow_downwardDownload

Variable Unification in NooJ v3

by Max Silberztein

2022

NooJ's linguistic engine integrates all its parsers (from the lexical to the syntactic level) with its morphological and paraphrase generators. In particular, both NooJ's syntactic parser and NooJ's transformational generator... more

descriptionView Paper arrow_downwardDownload

Can we parse without tagging?

by Cedrick Fairon

2022, Proceedings of the Language & …

Syntactic parsing is a major area of NLP which has been widely studied with the help of many approaches. Usually, parsers take in input tagged texts, that is to say texts whose lexical units have been annotated with informations such as... more

descriptionView Paper arrow_downwardDownload

Arabic named entity extraction: A local grammar-based approach

by Hayssam Traboulsi

2022, 2009 International Multiconference on Computer Science and Information Technology

descriptionView Paper arrow_downwardDownload

PEAS, the first instantiation of a comparative framework for evaluating parsers of French

by A. Vilnat

2022, Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - EACL '03

This paper presents PEAS, the first comparative evaluation framework for parsers of French whose annotation formalism allows the annotation of both constituents and functional relations. A test corpus containing an assortment of different... more

descriptionView Paper arrow_downwardDownload

Easy, evaluation of parsers of french: what are the results

by A. Vilnat

2022, Proceedings of the 6th …

This paper presents EASY, which has been the first campaign evaluating syntactic parsers on all the common syntactic phenomena and a large set of dependency relations. During this campaign, an annotation scheme has been elaborated with... more

descriptionView Paper arrow_downwardDownload

Large scale production of syntactic annotations to move forward

by A. Vilnat

2022, Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation - CrossParser '08

This article presents the methodology of the PASSAGE project, aiming at syntactically annotating large corpora by composing annotations. It introduces the annotation format and the syntactic annotation specifications. It describes an... more

descriptionView Paper arrow_downwardDownload

Generating a Resource for Products and Brandnames Recognition. Application to the Cosmetic Domain

by Luca Dini

2022

Named Entity Recognition task needs high-quality and large-scale resources. In this paper, we present RENCO, a based-rules system focused on the recognition of entities in the Cosmetic domain (brandnames, product names, â¦). RENCO has... more

descriptionView Paper arrow_downwardDownload

Interpersonal Relations in Biographical Dictionaries. A Case Study

by Matthias Reinert

2022

Figure 1: Example of a simple bootstrap graph detecting place names

Figure 3: Masking pre-tagged text and entities, using TOKEN-loops with ![ ]|-negative context mentioned in the text along with profession or position in life, their birth and death dates and references to the printed volumes. All in all the core data base consists of 92.000 in- dividuals and several hundred families. Almost each entry has been aligned with or added to the bibliographic author- ity file Gemeinsame Normdatei (GND).

Figure 4: A local grammar describing the post positioned arguments of studieren/to study, boxes after prepositions branch into subgraphs

Figure 7: The relations of Philipp Jonkheer von Siebold and three relatives, manually expanded.°

The matches and errors were counted as follows: Table 1: Assertion of errors to precision and recall

descriptionView Paper arrow_downwardDownload

A Rough Set Formalization of Quantitative Evaluation with Ambiguity

by Patrick Paroubek

2022

In this paper, we present the founding elements of a formal model of the evaluation paradigm in natural language processing. We propose an abstract model of objective quantitative evaluation based on rough sets, as well as the notion of... more

descriptionView Paper arrow_downwardDownload

Arabic Named Entity Extraction: A Local Grammar-Based Approach

by Hayssam Traboulsi

2022

descriptionView Paper arrow_downwardDownload

Constraint-based parsing as an efficient solution: Results from the parsing evaluation campaign easy

by Philippe Blache

2021

This paper describes the unfolding of the EASy evaluation campaign for french parsers as well as the techniques employed for the participation of laboratory LPL to this campaign. Three symbolic parsers based on a same resource and a same... more

The EASY project (see http://www.elda.org/easy) aims at the evaluation of parsers for French. It proposes an evalu- ation methodology making it possible to compare syntactic analyzers and, as a side effect, produce a large validated linguistic resource by combining automatically the results of the campaign. Figure 1: EASy campaign and three LPL parsers based on PGs

Deep parsing is globally better than shallow parsing, even for this evaluation framework which needs flat non hierar- chical structures. Several conclusions come with this data:

Figure 4: Strict and fuzzy scores for LPL-3 Figure 3: Strict and fuzzy scores for LPL-2

Figure 2: Strict and fuzzy scores for LPL-1

—H- general —O— literature -O— email —A-medical —<- oral —-G— questions)

The parsers show different results corresponding with what we could foresee: the shallow parser (LPL2) is globally less effi cient than the two others. Mean f-scores for LPL1, LPL2 and LPL3 are respectively 84.8, 79.3 and 81 showing a clear correlation with the techniques and the strategies. More precisely, differences in parsing and determinization techniques can significantly explain the different scores. Even if grammar and lexicon were the same for the three parsers, their impact should not be forgotten: for exam- ple, deep parsing techniques can overcome tagging errors, which is not the case for the shallow parser. In our experi- ments, the pos-tagger performance was less than 90%: im- proving the pos-tagger will obviously improve the parsers. One interesting result is that there is a good stability for the three parsers of the results from one corpus type to another: only 5 points separate the literary and general corpora from the oral and mail ones. This is a clear indication of the ro- bustness of the approach. With thace recnlte we wranld lke tn make cnme ramarkc

descriptionView Paper arrow_downwardDownload

A French Corpus Annotated for Multiword Nouns Éric Laporte

by Stavroula Voyatzi

2021

This paper presents a French corpus annotated for multiword nouns. This corpus is designed for investigation in information retrieval and extraction, as well as in deep and shallow syntactic parsing. We delimit which kind of multiword... more

descriptionView Paper arrow_downwardDownload

A French Corpus Annotated for Multiword Expressions with Adverbial Function

by Stavroula Voyatzi

2021

This paper presents a French corpus annotated for multiword expressions (MWEs) with adverbial function. This corpus is designed for investigation on information retrieval and extraction, as well as on deep and shallow syntactic parsing.... more

descriptionView Paper arrow_downwardDownload

A French Corpus Annotated for Multiword Nouns

by Stavroula Voyatzi

2021

descriptionView Paper arrow_downwardDownload

A Computational Lexicon of Portuguese for Automatic Text Parsing

by Elisabete Ranchhod

2021

Using standard methods and formats established at LADL, and adopted by several European research teams to construct largecoverage electronic dictionaries and grammars, we elaborated for Portuguese a set of lexlcal resources, that were... more

descriptionView Paper arrow_downwardDownload

Disambiguation of Proper Names Using Finite-State Local Grammars

by Elisabete Ranchhod

2021

Like common noun phrases, proper names contain ambiguous conjoined phrases that make their delimitation and classification difficult in text. This paper presents a finite-state approach to the disambiguation of Portuguese candidate proper... more

descriptionView Paper arrow_downwardDownload

Can we parse without tagging?

by Cedrick Fairon

2021, Proceedings of the Language & …

descriptionView Paper arrow_downwardDownload

Shape Analysis as an Aid for Grammar Induction

by Ife Adebara

2021

Visual shapes inherent in di↵erent aspects of language processing have been manifesting themselves as important not only for enhancing that process itself, but also for helping solve open problems in ways that are more economical and more... more

descriptionView Paper arrow_downwardDownload

DECO-MWE: Building a Linguistic Resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis

by Eric G . C . Laporte

2021, LREC Workshop on Asian Language Resources

descriptionView Paper arrow_downwardDownload

Open source multi-platform NooJ for NLP

by Marko Tadić

2021

The purpose of this demo is to introduce the linguistic development tool NooJ. The tool has been in development for a number of years and it has a solid community of computational linguists developing grammars in two dozen languages... more

descriptionView Paper arrow_downwardDownload

PASSAGE: from French Parser Evaluation to Large Sized Treebank

by Patrick Paroubek

2021, Lrec

In this paper we present the PASSAGE project which aims at building automatically a French Treebank of large size by combining the output of several parsers, using the EASY annotation scheme. We present also the results of the of the... more

descriptionView Paper arrow_downwardDownload

The ongoing evaluation campaign of syntactic parsing of french: Easy

by Patrick Paroubek

2021, Proceedings of the …

This paper presents EASY (Evaluation of Analyzers of SYntax), an ongoing evaluation campaign of syntactic parsing of French, a subproject of EVALDA in the French TECHNOLANGUE program. After presenting the elaboration of the annotation... more

descriptionView Paper arrow_downwardDownload

A Local Grammar of French determiners for deep syntactic parsing

by Eric G . C . Laporte

2016, Bases de données lexicales : construction et applications

Existing syntactic grammars of natural languages, even with a far from complete coverage, are complex objects. Assessments of the quality of parts of such grammars are useful for the validation of their construction. We extended a grammar... more

descriptionView Paper arrow_downwardDownload

Disambiguation of Proper Names Using Finite-State Local Grammars

by Samuel Eleuterio and

2016

descriptionView Paper arrow_downwardDownload

A Computational Lexicon of Portuguese for Automatic Text Parsing

by Elisabete Ranchhod

2016, Proceedings of SIGLEX99: …

descriptionView Paper arrow_downwardDownload

Classification of non-analyzable word types in web documents to implement an effective Korean e-learning system

by Eric G . C . Laporte

2016, Proceedings of the International Conference 'Doing Research in Applied Linguistics'

E-learning systems should deliver contents which reflect various phenomena of the language as it is used. E-learning systems that would include real-world Korean expressions such as those in web documents, mobile text messages, or twitter... more

descriptionView Paper arrow_downwardDownload

How to find the right path?(On the morphological disambiguation of sentence in Serbian)

by Cvetana Krstev

2015

descriptionView Paper arrow_downwardDownload

Evaluation and interoperable annotations: the point of view of a parser developer

by Gil Francopoulo

2015

The present paper is written within the framework of the French ANR-Passage project that gathers ten parser developers. The main motivations of the project are to evaluate parsers for French, to test their ac- curacy and robustness on... more

descriptionView Paper arrow_downwardDownload

ON DESIGNING INTERLANGUAGE CORPUS OF INDONESIAN STUDENTS WITH ERROR ANALYSIS ANNOTATION

by pri hantoro

2015

This paper proposes a model for the design of interlanguage corpus with error analysis annotation. The data is obtained from ICNALE (Ishikawa, 2013) corpus, a corpus of English learners in Asia. I focus the extraction on the Indonesian... more

descriptionView Paper arrow_downwardDownload

MACHINE READABLE GRAMMAR FOR OPTIMIZING AUTOMATIC RETRIEVAL: A Comparison of Regular Expression and Local Grammar Graph

by pri hantoro

2015

Machine Readable Grammar (MRG) is aimed at supporting the computer to perform Natural Language Processing (NLP) tasks. As for this paper, it discusses the one of the essences of MRG, which is to perform automatic retrieval in a text... more

descriptionView Paper arrow_downwardDownload

ANNOTATION MODEL FOR LOANWORDS IN INDONESIAN CORPUS

by pri hantoro

2015

There is a considerable number for loanwords in Indonesian language as it has been, or even continuously, in contact with other languages. The contact takes place via different media; one of them is via machine readable medium. As the... more

descriptionView Paper arrow_downwardDownload

Large scale production of syntactic annotations to move forward

by Gil Francopoulo and

2015, Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation - CrossParser '08

descriptionView Paper arrow_downwardDownload

The PASSAGE syntactic representation

by Gil Francopoulo

2015

We present the PASSAGE syntactic representation based on syntactic relations, initially developed for French in the scope of national evaluation campaigns. After a brief presentation of the non-nested chunks and syntactic relations of... more

descriptionView Paper arrow_downwardDownload

Building a lexicon of French deverbal nouns from a semantically annotated corpus

by Antonio Balvet and

2015

The ongoing project Nomage aims at describing the aspectual properties of deverbal nouns in an empirical way. It is centered on the development of two resources: a semantically annotated corpus of deverbal nouns, and an electronic... more

descriptionView Paper arrow_downwardDownload

Arabic Named Entity Extraction: A Local Grammar-Based Approach

by Андрей Федоровский

2014

The local grammar approach was first used to discuss recursive phrases that are commonly found in specialist literature like biochemistry and then extended to extract time, date and address expressions from letters. It has recently been... more

descriptionView Paper arrow_downwardDownload

A French Corpus Annotated for Multiword Expressions with Adverbial Function

by Stavroula Voyatzi and

2013, Language Resources and Evaluation Conference (LREC). Linguistic Annotation Workshop, Marrakech : Morocco

descriptionView Paper arrow_downwardDownload

A French Corpus Annotated for Multiword Nouns

by Stavroula Voyatzi and

2013, Language Resources and Evaluation Conference. Workshop Towards a Shared Task on Multiword Expressions, Marrakech : Morocco

descriptionView Paper arrow_downwardDownload

Evaluation of a Grammar of French Determiners

by Eric G . C . Laporte

2013, Computing Research Repository

descriptionView Paper arrow_downwardDownload

Extension of a grammar of French determiners

by Eric G . C . Laporte

2013, 26th International Conference on Lexis and Grammar, Bonifacio : France

Assessments of the quality of parts of syntactic grammars of natural languages are useful for the validation of their construction. We extended a grammar of French determiners that takes the form of a recursive transition network and... more

descriptionView Paper arrow_downwardDownload

Local Grammar Graphs

Key research themes

1. How can local grammar graphs (LGGs) be utilized for practical natural language understanding and data generation in domain-specific systems?

2. How do graph-theoretic and topological frameworks advance linguistic theory by modeling syntax and grammar structures as graphs?

3. What computational approaches enable unsupervised or weakly supervised learning of construction grammars integrating multi-level, multi-length linguistic patterns?

Related Topics

All papers in Local Grammar Graphs