A Theory of Stochastic Grammars

Samuelsson, Christer

doi:10.1007/3-540-45154-4_9

Outline

Title

Abstract

A Theory of Stochastic Grammars

Christer Samuelsson

https://doi.org/10.1007/3-540-45154-4_9

visibility

…

description

14 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

A novel theoretical framework for describing stochastic grammars is proposed based on a small set of basic random variables that generate tree structures and relate them to surface strings. A number of prominent statistical language models are formulated as stochastic processes over these basic random variables.

Rens Bod

Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics -, 1993

In Data Oriented Parsing (DOP), an annotated corpus is used as a stochastic grammar. An input string is parsed by combining subtrees from the corpus. As a consequence, one parse tree can usually be generated by several derivations that involve different subtrces. This leads to a statistics where the probability of a parse is equal to the sum of the probabilities of all its derivations. In an informal introduction to DOP is given, while (Bed, 1992a) provides a formalization of the theory. In this paper we compare DOP with other stochastic grammars in the context of Formal Language Theory. It it proved that it is not possible to create for every DOP-model a strongly equivalent stochastic CFG which also assigns the same probabilities to the parses. We show that the maximum probability parse can be estimated in polynomial time by applying Monte Carlo techniques. The model was tested on a set of hand-parsed strings from the Air Travel Information System (ATIS) spoken language corpus. Preliminary experiments yield 96% test set parsing accuracy.

downloadDownload free PDF View PDFchevron_right

Estimators for Stochastic "Unification-Based" Grammars

Mark Johnson

1999

Log-linear models provide a statistically sound framework for Stochastic "Unification-Based" Grammars (SUBGs) and stochastic versions of other kinds of grammars. We describe two computationally-tractable ways of estimating the parameters of such grammars from a training corpus of syntactic analyses, and apply these to estimate a stochastic version of Lexical-Functional Grammar.

downloadDownload free PDF View PDFchevron_right

A hybrid language model based on a combination of N -grams and stochastic context-free grammars

Joan Andreu Sánchez

ACM Transactions on Asian Language Information Processing, 2004

In this paper, a hybrid language model is defined as a combination of a word-based n-gram, which is used to capture the local relations between words, and a category-based stochastic contextfree grammar (SCFG) with a word distribution into categories, which is defined to represent the long-term relations between these categories. The problem of unsupervised learning of a SCFG in General Format and in Chomsky Normal Form by means of estimation algorithms is studied. Moreover, a bracketed version of the classical estimation algorithm based on the Earley algorithm is proposed. This paper also explores the use of SCFGs obtained from a treebank corpus as initial models for the estimation algorithms. Experiments on the UPenn Treebank corpus are reported. These experiments have been carried out in terms of the test set perplexity and the word error rate in a speech recognition experiment.

downloadDownload free PDF View PDFchevron_right

Weakly restricted stochastic grammars

Hugo ter Doest

Proceedings of the 15th conference on Computational linguistics -, 1994

A new type of stochastic grammars is introduced for investigation: weakly restricted stochastic grammars. In this paper we will concentrate on the consistency problem. To nd conditions for stochastic grammars to be consistent, the theory of multitype Galton-Watson branching processes and generating functions is of central importance. The unrestricted stochastic grammar formalism generates the same class of languages as the weakly restricted formalism. The inside-outside algorithm is adapted for use with weakly restricted grammars.

downloadDownload free PDF View PDFchevron_right

Entropy-rate driven inference of stochastic grammars

Unto Laine

Interspeech 2011, 2011

A new method for inferring specific stochastic grammars is presented. The process called Hybrid Model Learner (HML) applies entropy rate to guide the agglomeration process of type ab->c. Each rule derived from the input sequence is associated with a certain entropy-rate difference. A grammar automatically inferred from an example sequence can be used to detect and recognize similar structures in unknown sequences. Two important schools of thought, that of structuralism and the other of 'stochasticism' are discussed, including how these two have met and are influencing current statistical learning methods. It is argued that syntactic methods may provide universal tools to model and describe structures from the very elementary level of signals up to the highest one, that of language.

downloadDownload free PDF View PDFchevron_right

Precise n -gram probabilities from stochastic context-free grammars

Andreas Stolcke

Proceedings of the 32nd annual meeting on Association for Computational Linguistics -, 1994

We present an algorithm for computing n-gram probabilities from stochastic context-free grammars, a procedure that can alleviate some of the standard problems associated with n-grams (estimation from sparse data, lack of linguistic structure, among others). The method operates via the computation of substring expectations, which in turn is accomplished by solving systems of linear equations derived from the grammar. The procedure is fully implemented and has proved viable and useful in practice.

downloadDownload free PDF View PDFchevron_right

Learning of Stochastic Context-Free Grammars by Means of Estimation Algorithms and Initial Treebank Grammars

Joan Andreu Sánchez

Lecture Notes in Computer Science, 2003

The use of the Inside-Outside (IO) algorithm for the estimation of the probability distributions of Stochastic Context-Free Grammars is characterized by the use of all the derivations in the learning process. However, its application in real tasks for Language Modeling is restricted due to the time complexity per iteration and the large number of iterations that it needs to converge. Alternatively, several estimations algorithms which consider a certain subset of derivations in the estimation process have been proposed elsewhere. This set of derivations can be chosen according to structural criteria, or by selecting the-best derivations. These alternatives are studied in this paper, and they are tested on the corpus of the Wall Street Journal processed in the Penn Treebank project.

downloadDownload free PDF View PDFchevron_right

Stochastic process semantics for dynamical grammars

Eric Mjolsness

Annals of Mathematics and Artificial …, 2006

downloadDownload free PDF View PDFchevron_right

Probabilistic and weighted grammars

Arto Salomaa

Information and Control, 1969

Devices for the generation of languages, corresponding to the probabilistic recognition devices or probabilistic automata, are introduced and the resulting families of languages are investigated. Comparisons are made with some other recently introduced grammars, where restrictions are imposed not only on the form of the rewriting rules but also on the use of them. A uniform representation for such grammars is provided by the notion of a grammar with a prescribed control language for the derivations.

downloadDownload free PDF View PDFchevron_right

Introduction of Rules into a Stochastic Approach for Language Modelling

Thierry d'Avignon

Computational Models of Speech Pattern Processing, 1999

downloadDownload free PDF View PDFchevron_right

This document is currently being converted. Please check back in a few minutes.

francesco carravetta

Information and Computation, 2020

We define a Syntactic Stochastic Process (SSP) as a stochastic process valued in the set of terminal symbols of a grammar, and whose realizations are terminal strings generated by some stochastic grammar. and show that any SSP generated by a Stochastic Context Free Grammar (SCFG) can be consistently indexed by a subset of nodes of a suitable defined Graphical Random Field (GRF). In the second part of the paper we propose a definition of Stochastic Context-Sensitive Grammar (SCSG), and that the stochastic process generated by a SCFG admits a representation as a GRF. Finally, we show that strings generated by a Stochastic Tree Adjoining Grammar (STAG) are reciprocal processes, which allows the solution of the inference problem with a complexity linear with respect to string length.

downloadDownload free PDF View PDFchevron_right

Stochastic Languages and Stochastic Grammars

Hugo ter Doest

Two types of stochastic grammars are introduced for investigation: the (classical) unrestricted stochastic grammars and the (newly introduced) weakly restricted stochastic grammars. Especially consistency and conditions for it will be treated. The theory of multitype Galton-Watson branching processes and generating functions is of central importance to find criteria for consistency. We will prove that the unrestricted formalism generated the same class of grammars as the weakly restricted formalism. Finally we will motivate the introduction and use of weakly restricted grammars. 1 Stochastic Languages Before we turn to generating mechanisms for languages (and particularly stochastic languages) , we will first introduce languages and stochastic languages. A language is simply defined as a set of strings drawn from a finite set of symbols (Sigma). Languages can be defined as follows: Definition 1.1 A language over an alphabet Sigma is a (possibly empty) subset of Sigma . A lang...

downloadDownload free PDF View PDFchevron_right

Probabilistic grammars and languages

Andras Kornai

Journal of Logic, Language and Information, 2011

Using an asymptotic characterization of probabilistic finite state languages over a one-letter alphabet we construct a probabilistic language with regular support that cannot be generated by probabilistic CFGs. Since all probability values used in the example are rational, our work is immune to the criticism leveled by against the work of Ellis (1969) who first constructed probabilistic FSLs that admit no probabilistic FSGs. Some implications for probabilistic language modeling by HMMs are discussed.

downloadDownload free PDF View PDFchevron_right

Combination of n-grams and Stochastic Context-Free Grammars for language modeling

Joan Andreu Sánchez

Proceedings of the 18th conference on Computational linguistics -, 2000

This t)al)t;r de, scribes a hybrid prol)osal to combine n-grams and Stochastic Context-Free Grammars (SCFGs) tbr language modeling. A classical n-gram model is used to cat)lure the local relations between words, while a stochastic grammatical inodel is considered to represent the hmg-term relations between syntactical stru(:tm'es. In order to define this granmlatical model, which will 1)e used on large-vo(:almlary comph'~x tasks, a eategory-t)ased SCFG and a prol)abilisti(" model of' word (tistrilmtion in the categories have been 1)rol)osed. Methods for leanfing these stochastic models tTor complex tasks are described, and algorithms for con> puting the word transition probal)ilities are also 1)resented. Filmily, ext)erilnents using the Penn Treel)ank corpus improved by 30% the test; set; l)erph~xity with regard to the classical n-gram models. * This work has been partially SUl)l)<)rted by the S1)anish CICYT under contract (TIC98/0423-C(16).

downloadDownload free PDF View PDFchevron_right

On the Inference of Stochastic Regular Grammars

Antony Van der Mude

Information and Control, 1978

The relevance of grammatical inference techniques to the semiautomatic construction from empirical data, of a model of human decision making, is outlined. A grammatical inference problem is presented in which the least complex stochastic regular grammar is sought which describes a given set of strings. An upper bound on the complexity of the best grammar for a given data set is found, and some properties of the grammars which are less complex than the bound are proved. The technique of splitting grammars is used to organize a search of these grammars. An initial grammar is defined, and it is established that any best grammar is obtainable by repeated splitting of the initial grammar. The performance of a program based on these results is" described.

downloadDownload free PDF View PDFchevron_right

Solution of an Open Problem on Probabilistic Grammars

Oscar Garcia

IEEE Transactions on Computers, 2000

It has been proved that when the production probabilities of an unambiguous context-free grammar G are estimated by the relative frequencies of the corresponding productions in a sample S from the language L( G) generated by G, the expected derivation length and the expected word length of the words in L(G) are precisely equal to the mean derivation length and the mean world length of the words in the same S, respectively. Index Terms-Consistency, expected mean/derivation length, expected/ mean word length, probabilistic grammars.

downloadDownload free PDF View PDFchevron_right

Estimation of the probability distributions of stochastic context-free grammars from the k-best derivations

Joan Andreu Sánchez

1998

The use of the Inside-Outside (IO) algorithm for the estimation of the probability distributions of Stochastic Context-Free Grammars (SCFGs) in Natural-Language processing is restricted due to the time complexity per iteration and the large number of iterations that it needs to converge. Alternatively, an algorithm based on the Viterbi score (VS) is used. This VS algorithm converges more rapidly, but obtains less competitive models. We describe here a new algorithm that only considers the k-best derivations in the estimation process. The experimental results show that this algorithm achieves faster convergence than the IO and better models than the VS algorithm.

downloadDownload free PDF View PDFchevron_right

Stochastic inference of regular tree languages

Jose Oncina

Lecture Notes in Computer Science, 1998

We generalize a former algorithm for regular language identification from stochastic samples to the case of tree languages. It can also be used to identify context-free languages when structural information about the strings is available. The procedure identifies equivalent subtrees in the sample and outputs the hypothesis in linear time with the number of examples. The results are evaluated with a method that computes efficiently the relative entropy between the target grammar and the inferred one.

downloadDownload free PDF View PDFchevron_right

Statistical language generation from semantic structures

Simon Mille, Bernd Bohnet

Semantic stochastic sentence realization is still in its fledgling stage. Most of the available stochastic realizers start from syntactic structures or shallow semantic input structures which still contain numerous syntactic features. This is unsatisfactory since sentence generation traditionally starts from abstract semantic or conceptual structures. However, a change of this state of affairs requires first a change of the annotation of available corpora: even multilevel annotated corpora of the CoNLL competitions contain syntaxinfluenced semantic structures. We address both tasks-the amendment of an existing annotation with the purpose to make it more adequate for generation and the development of a semantic stochastic realizer. We work with the English CoNLL 2009 corpus, which we map onto an abstract semantic (predicateargument) annotation and into which we introduce a novel "deep-syntactic" annotation, which serves as intermediate structure between semantics and (surface-)syntax. Our realizer consists of a chain of decoders for mappings between adjacent levels of annotation: semantic → deep-syntactic → syntactic → linearized → morphological.

downloadDownload free PDF View PDFchevron_right

Random Context Tree Grammars and Tree Transducers

Frank Drewes

2005

Regular tree grammars and top-down tree transducers are extended by random context sensitivity as known from the areas of string and picture generation. First results regarding the generative power of the resulting devices are presented. In particular, we investigate the path languages of random context tree languages.

downloadDownload free PDF View PDFchevron_right

A Theory of Stochastic Grammars

Sign up for access to the world's latest research

Abstract

Related papers

Related papers