The evaluation metric in generative grammar

John Goldsmith

Outline

Title

Abstract

Introduction

Minimum Description Length Analysis: Rissanen

Conclusion

All Topics

Languages and Linguistics

Syntax

The evaluation metric in generative grammar

John Goldsmith

2011

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

The subject which I would like to treat in this paper is the evaluation metric in generative grammar. Why? Arguably, the evaluation metric is both the most novel and the most important concept in the development of generative grammar by Noam Chomsky. And yet it is at the same time one of the least recognized and surely most misunderstood of the core concepts of generative grammar. So there you are: the evaluation metric is critically important, it is arguably novel, it is misunderstood, and at some times and in some places, it has even been reviled. What better reasons could there be for spending our time today talking about it? I would like, first, to explain the idea of the evaluation metric in early generative grammar; this will mean exploring the separate ideas of (1) a prior over the set of grammars and (2) a measure of goodness of fit to the data. Second, I will very briefly trace how those two ideas have been developed in the world of machine learning over the last few decade...

Santiago Figueira

PLOS ONE, 2018

Probabilistic proposals of Language of Thoughts (LoTs) can explain learning across different domains as statistical inference over a compositionally structured hypothesis space. While frameworks may differ on how a LoT may be implemented computationally, they all share the property that they are built from a set of atomic symbols and rules by which these symbols can be combined. In this work we propose an extra validation step for the set of atomic productions defined by the experimenter. It starts by expanding the defined LoT grammar for the cognitive domain with a broader set of arbitrary productions and then uses Bayesian inference to prune the productions from the experimental data. The result allows the researcher to validate that the resulting grammar still matches the intuitive grammar chosen for the domain. We then test this method in the language of geometry, a specific LoT model for geometrical sequence learning. Finally, despite the fact of the geometrical LoT not being a universal (i.e. Turing-complete) language, we show an empirical relation between a sequence's probability and its complexity consistent with the theoretical relationship for universal languages described by Levin's Coding Theorem.

downloadDownload free PDF View PDFchevron_right

Hypothesis testing in generative grammar: Evaluation of predicted schematic asymmetries

Hajime Hoji

Journal of Japanese Linguistics, 2010

This paper explores how the hypothetico-deductive method can be applied to research concerned with the properties of the language faculty. The paper first discusses how we can try to identify informant judgments that are likely a reflection of properties of the Computational System (or properties of the language faculty that are directly related to the Computational System), proposes a method of hypothesis testing in line with the hypothetico-deductive method, and provides an illustration by examining the predictions made under the lexical hypothesis that otagai in Japanese is a local anaphor.

downloadDownload free PDF View PDFchevron_right

Marcel den Dikken (ed., 2013). The Cambridge Handbook of Generative Syntax. Cambridge/New York: Cambridge University Press.

Imola-Ágnes Farkas

Bucharest Working Papers in Linguistics XVII/2, 2015

downloadDownload free PDF View PDFchevron_right

A test of the leaf-ancestor metric for parse accuracy

Anna Babarczy

Natural Language Engineering, 2003

Chapter 2 discussed the problem of quantifying the degree of resemblance between separate analyses of a given stretch of wording. We mentioned in 2.4 that the metric we use, the leaf-ancestor metric, is not the one standardly used in our discipline but, we believe, is considerably superior to the standard metric. In that chapter, the purpose for which we needed a tree-comparison metric was investigation of how far a set of parsing guidelines succeeded in predicting a unique analysis for a given language sample, so that separate human analysts dealing with the same samples would be constrained to come up with the same analyses.

downloadDownload free PDF View PDFchevron_right

Book proposal: A critical biography of Generative Grammar

Diego Gabriel Krivochen

This monograph (written in Spanish) is a critical history of transformational generative models of syntax, from Chomsky’s seminal Logical Structure of Linguistic Theory (1955) to current advances in Minimalism. The primary objective of the book is to explicitly and clearly explain the changes that Generative syntax has undergone in this period, and how these changes relate not only to the internal dynamics of the Generative field (for example, disagreements among different authors as to how to analyse a particular phenomenon), but also the relationship between the transformational generative enterprise and other theories of natural and formal languages (including American and European structuralist formalisms, for natural languages, and Post-Turing formalisms, for formal languages). The book presents the evolution of Generative Grammar as a continuous process, emphasizing the main theoretical developments and proceeding to the specifics while maintaining a sense of historical unity and continuity.

downloadDownload free PDF View PDFchevron_right

Bridge the gap between statistical and hand-crafted grammars

Ali Basirat

Computer Speech & Language, 2013

LTAG is a rich formalism for performing NLP tasks such as semantic interpretation, parsing, machine translation and information retrieval. Depend on the specific NLP task, different kinds of LTAGs for a language may be developed. Each of these LTAGs is enriched with some specific features such as semantic representation and statistical information that make them suitable to be used in that task. The distribution of these capabilities among the LTAGs makes it difficult to get the benefit from all of them in NLP applications.

downloadDownload free PDF View PDFchevron_right

Transformational Generative Grammar

Touilaat Bouchaib

My gratitude to the supervisor of this research paper, Pr. Mounia Amrani, is beyond words.

downloadDownload free PDF View PDFchevron_right

Grammar Evaluation Results

Linas Vepstas

2019

This report presents measurements of the quality of various different dictionaries learned from two different language-learning pipelines; one is the ULL/Kolonin variant, the other is the Linas variant. The ULL/Kolonin dictionaries suggest two major breakthroughs: they suggest how the learning pipeline should be tuned, and they indicate how the pipeline is relatively insensitive to early stages of processing. The Linas variant results indicate that the sheer quantity of training data has a strong impact on the quality of the learned grammars. This report briefly reviews the dictionaries, the algorithms used to obtain them, and measurement results. It is assumed that the reader has a general familiarity with the project. 1 Baseline Measurements This provides the introduction and the baseline to the algorithm, the measurements, and the results.

downloadDownload free PDF View PDFchevron_right

Semantics in Generative Grammar, by Irene Heim and Angelika Kratzer

Keith Allan

1998

JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact

downloadDownload free PDF View PDFchevron_right

Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

Juan M Montero

The Association for Computational Linguistics, 2016

Despite the growing number of Computational Construction Grammar implementations, the field is still lacking evaluation methods to compare grammar fragments across different platforms. Moreover, the hand-crafted nature of most grammars requires profiling tools to understand the complex interactions between constructions of different types. This paper presents a number of evaluation measures, partially based on existing measures in the field of semantic parsing, that are especially relevant for reversible grammar formalisms. The measures are tested on a grammar fragment for European Portuguese clitic placement that is currently under development.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Lars Hellan

2007

A rather difficult point in grammar engineering evaluation is how to test and compare for analytic adequacy. A test design for 'deep' grammars is here proposed, where a parse is considered valid only if the assignment of syntactic and semantic structures that it displays obey certain conditions. The set of grammatical sentences in the test suite is construed as leaf types in a construction ontology, where the top types introduce the discriminants according to which constructions are categorized. These discriminants conform to notions shared across linguistic frameworks, and the validity conditions are defined within a well-known space of analytic parameters. One may envisage that with such a design, a meeting point can emerge for comparing frameworks with regard to agreed-upon aspects of linguistic content, and individual grammars with regard to their analytic aims and actual achievements relative to the aims. In Phillip Pullman's His dark materials, humans in one of the worlds have their soul partly realized as a little animal always accompanying them, sharing their thinking and emotions, but still behaving partly as independent agents; they are called daemons.

downloadDownload free PDF View PDFchevron_right

From algorithms to generative grammar and back again

John Goldsmith

downloadDownload free PDF View PDFchevron_right

Critical remarks on generative metrics

Wolfgang Klein

Poetics, 1974

downloadDownload free PDF View PDFchevron_right

What is a generative grammar

Bruce Derwing

Amsterdam studies in the theory and history of linguistic science, 1975

downloadDownload free PDF View PDFchevron_right

Meaning and Formal Semantics in Generative Grammar

Stephen Schiffer

The history of semantics in generative linguistics raises questions. By the time Chomsky's Aspects of Syntactic Theory was published in 1965 generative grammars were understood to have a semantic component in addition to a syntactic and phonological component, and it was assumed that a speaker's knowledge of her language required her to have tacit knowledge of a generative grammar of it. At that time the question that defined semantics in linguistics was the form that the internally represented semantic theory should take, and that is what the defining question was taken to be in Katz and Fodor's seminal 1963 manifesto, "The Structure of a Linguistic Theory," the first serious effort to do semantics in generative linguistics. Then around 1970 linguistic semantics took a curious turn. Without rejecting the claim that speaking a language requires tacit knowledge of a semantic theory of it, linguists turned away from the project of characterizing the nature of that theory in order to pursue instead the Montague-inspired project of providing for the languages we speak the same kind of formal semantics that logicians devise for the artificial languages of formal systems of logic. 'Formal semantics' originally signified semantics for formal languages devised for the mathematical study of formal systems of logic, but the expression now has a meaning akin to 'analytical philosophy' and signifies the Montague-inspired approach to the semantical study of natural languages. At the same time, many theorists-including many formal semanticists-recognize that the theories semanticists construct under the formal semantics rubric can't plausibly be regarded as theories of the kind needed to explain a speaker's knowledge of her language. The obvious question this bifurcation raises concerns the relation between, on the one hand, the psychologically explanatory semantic theories still thought to be needed but no longer the object of study in linguistic semantics and, on the other hand, the theories formal semanticists are concerned to construct. That question, I shall argue, becomes urgent when we understand the way in understanding a sentence is knowing what it means, and that, consequently, it must be the job of the semantic component of a generative grammar to issue, for each sentence of the language, in a theorem that tells us what the sentence means. That is indeed true, as far as it goes, but it's helpful only to the extent that we already know in what knowing a sentence's meaning consists; it doesn't relieve us of having to ask what way of understanding sentence understanding and semantic interpretation has the best chance of verifying GGH. Before turning to that and related questions, however, I should say a little something now about the parts of GGH that for present purposes I will assume not to be contentious. Linguists like to speak of a language user's "internal grammar"; but a grammar is a theory, and theories are abstract things that can't be in anyone's head or anywhere else. Better to speak, as linguists also do, of a person's internally represented grammar. The notion of tacit knowledge of a grammar is intended to make talk of an internally represented grammar somewhat more precise. One has tacit knowledge of a proposition p when one is in a subpersonal, or subdoxastic, 2 state that represents p, p is true, and the information-processing role of the state, qua representation of p, depends on p's being true. Generative grammar, as conceived by Chomsky and, I hazard, the vast majority of

downloadDownload free PDF View PDFchevron_right

What statistics do learners track? Rules, constraints and schemas in (artificial) grammar learning

Vsevolod Kapatsinski

2012

downloadDownload free PDF View PDFchevron_right

Grammar Specialization through Entropy Thresholds

Christer Samuelsson

1994

Explanation-based generalization is used to extract a specialized grammar from the original one using a training corpus of parse trees. This allows very much faster parsing and gives a lower error rate, at the price of a small loss in coverage. Previously, it has been necessary to specify the tree-cutting criteria (or operationality criteria) manually; here they are derived automatically from the training set and the desired coverage of the specialized grammar. This is done by assigning an entropy value to each node in the parse trees and cutting in the nodes with sufficiently high entropy values.

downloadDownload free PDF View PDFchevron_right

Richard Tunney

Experimental Psychology (formerly" Zeitschrift für …, 2010

downloadDownload free PDF View PDFchevron_right

On automata and language based grammar metrics

Tomaz Kosar

Computer Science …, 2010

Grammar metrics have been introduced to measure the quality and the complexity of the formal grammars. The aim of this paper is to explore the meaning of these notions and to experiment, on several grammars of domain specific languages and of general-purpose languages, existing grammar metrics together with the new metrics that are based on grammar LR automaton and on the language recognized. We discuss the results of this experiment and focus on the comparison between grammars of domain specific languages as well as of general-purpose languages and on the evolution of the metrics between several versions of the same language.

downloadDownload free PDF View PDFchevron_right

Why We Need New Evaluation Metrics for NLG

Jekaterina Novikova

Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

The majority of NLG evaluation relies on automatic metrics, such as BLEU. In this paper, we motivate the need for novel, system-and data-independent automatic evaluation methods: We investigate a wide range of metrics, including state-of-the-art word-based and novel grammar-based ones, and demonstrate that they only weakly reflect human judgements of system outputs as generated by data-driven, end-to-end NLG. We also show that metric performance is data-and system-specific. Nevertheless, our results also suggest that automatic metrics perform reliably at system-level and can support system development by finding cases where a system performs poorly. 4 https://github.com/glampouras/JLOLS_ NLG 5 Note that we use lexicalised versions of SFHOTEL and SFREST and a partially lexicalised version of BAGEL, where proper names and place names are replaced by placeholders ("X"), in correspondence with the outputs generated by the MR: inform(name=X, area=X, pricerange=moderate, type=restaurant) Reference: "X is a moderately priced restaurant in X."

downloadDownload free PDF View PDFchevron_right

The evaluation metric in generative grammar

Sign up for access to the world's latest research

Abstract

Related papers

Related papers

Related topics