Effective parsing with generalised phrase structure grammar

Ramsay, Allan

doi:10.3115/976931.976939

Outline

Effective parsing with generalised phrase structure grammar

Allan Ramsay

1985, Proceedings of the second conference on European chapter of the Association for Computational Linguistics -

https://doi.org/10.3115/976931.976939

visibility

…

description

5 pages

link

1 file

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

Generalised phrase structure grammars (GPSG's) appear to offer a means by which the syntactic properties of natural languages may be very concisely described. The main reason for this is that the GPSG framework allows you to state a variety of meta-grammatical rules which generate new rules from old ones, so that you can specify rules with a wide variety of realisations via a very small number of explicit statements. Unfortunately, trying to analyse a piece of text in terms of such rules is a very awkward task, as even a small set of GPSG statements will generate a large number of underlying rules.

Jonathan Siemens

Proceedings of the 19th annual meeting on Association for Computational Linguistics -, 1981

downloadDownload free PDF View PDFchevron_right

SYNTAGMA. A Linguistic Approach to Parsing

Daniel Christen

SYNTAGMA is a rule-based parsing system, structured on two levels: a general grammar and a language specific grammars. The general grammar is implemented in the program; language specific grammars are resources conceived as text files which contain a lexical database with meaning-related grammatical features, a description of constituent structures, a meaning-specific syntactic constraints database and a semantic network. Since its theoretical background is principally Tesnière's Éléments de syntaxe, SYNTAGMA's grammar emphasizes the role of argument structure (valency) in constraint satisfaction, and allows also horizontal bounds, for instance treating coordination. Notions such as traces, empty categories are derived from Generative Grammar and some solutions are close to Government & Binding Theory, although they are the result of an autonomous research. These proprieties allow SYNTAGMA to manage complex syntactic configurations and well known weak points in parsing engineering. An important resource is the semantic network, which is used by SYNTAGMA in disambiguation tasks. In contrast to statistical and data driven parsers, the system's behavior may be controlled and fine-tuned, since gaps, traces and long-term relations are structurally set and its constituent generation process is not a linear left-to-right shift and reduce, but a bottom-up, rule driven procedure.

downloadDownload free PDF View PDFchevron_right

A Hybrid Strategy for Regular Grammar Parsing

Petya Osenova

2004

The paper outlines a hybrid architecture for a partial parser based on regular grammars over XML documents. The parser is used to support the annotation process in the BulTreeBank project. Thus the parser annotates only the 'sure' cases. To maximize the number of the analyzed phrases the parser applies a set of grammars in a dynamic fashion. Each grammar determines not only the constituent structure (plus some syntactic dependencies internal to the structure), but also a description of the local and global context of the recognized phrase. The grammars available to the parser are arranged in a network. The order of the grammars application depends on the initial ordering in the network and the descriptions associated with the grammars. Thus the traverse is not deterministic. Additionally, the application of the grammars can be interleaved with the applications of other XML tools like remove, insert and transform operations. This architecture provides a flexible means for g...

downloadDownload free PDF View PDFchevron_right

Realistic parsing: Practical solutions of difficult problems

Stan Szpakowicz

This paper describes work on the linguistic analysis of texts within a project devoted to knowledge acquisition from text. We focus on syntactic processing and present some key elements of the project's parser that allow it to deal successfully with technical texts. The parser is fully implemented and tested on a variety of real texts; improvements and enhancements are in progress. Because our knowledge acquisition method assumes no a priori model of the domain of the source text, the parser relies as much as possible on lexical and syntactic clues. That is why it strives for full syntactic analysis rather than some form of text skimming. We present a practical approach to four acknowledged difficult problems which to date have no generally acceptable answers: phrase attachment; time constraints for problematic input (how to avoid long and unproductive computation); parsing conjoined structures (how to preserve broad coverage without losing control of the parsing process); and the treatment of fragmentary input or fragments that are a by-product of a fallback parsing strategy. We review recent related work and conclude by listing several future work items.

downloadDownload free PDF View PDFchevron_right

Theory of Parsing

Giorgio Satta

2010

In the context of natural language processing, the term parsing refers to the process of automatically analyzing a given sentence, viewed as a sequence of words, in order to determine its possible underlying syntactic structures. Parsing requires a mathematical model of the syntax of the language of interest. In this chapter, these mathematical models are assumed to be formal grammars.

downloadDownload free PDF View PDFchevron_right

Fast and scalable HPSG parsing

Junichi Tsujii

Tal Traitement Automatique Des Langues, 2005

We investigated the efficacy of beam search parsing and deep parsing techniques in probabilistic HPSG parsing. We first tested the beam thresholding and iterative parsing. Next, we tested three techniques originally developed for deep parsing: quick check, large constituent inhibition, and hybrid parsing with a CFG chunk parser. The quick check, iterative parsing and hybrid parsing greatly contributed to total parsing performance. The accuracy and average parsing time for the Penn treebank were 87.2% and 355 ms. Finally, we tested robustness and scalability of HPSG parsing on the MEDLINE corpus consisting of around 1.4 billion words. The entire corpus was parsed in 9 days with 340 CPUs. RÉSUMÉ. Nous avons étudié l'efficacité de l'analyse de beam search et des techniques de l'analyse profonde dans le probabiliste HPSG analyse. D'abord, nous avons examiné le beam thresholding et l'analyse itérative. Ensuite, nous avons examiné trois techniques développées originalement pour l'analyse profonde: quick check, large constituent inhibition, et l'analyse hybride avec la CFG chunk parser. Le quick check, l'analyse itérative et l'analyse hybride contribuaient considérablement à la performance de l'analyse totale. L'exactitude et le temps d'analyse moyen pour le Penn Treebank étaient 87.2% et 355ms. Finalement, nous avons examiné la robustesse et la extensibilité de HPSG analyse sur le corpus de MEDLINE contenant presque 1.4 milliard de mots. Le corpus entier a été analysé en 9 jours avec 340 CPUs.

downloadDownload free PDF View PDFchevron_right

A strategy for the syntactic parsing of corpora: from Constraint Grammar output to unification-based processing

Toni Badia

This paper presents a strategy for syntactic analysis based on the combination of two different parsing techniques: lexical syntactic tagging and phrase structure syntactic parsing. The basic proposal is to take advantage of the good results on lexical syntactic tagging to improve the whole performance of unification-based parsing. The syntactic functions attached to every word by the lexical syntactic tagging are used as head features in the unification-based grammar, and are the base for grammar rules.

downloadDownload free PDF View PDFchevron_right

Syntactic constraints and efficient parsability

Robert Berwick

Proceedings of the 21st annual meeting on Association for Computational Linguistics -, 1983

A central goal of linguistic theory is to explain why natural languages are the way they are. It has often been supposed that com0utational considerations ought to play a role in this characterization, but rigorous arguments along these lines have been difficult to come by. In this paper we show how a key "axiom" of certain theories of grammar, Subjacency, can be explained by appealing to general restrictions on on-line parsing plus natural constraints on the rule-writing vocabulary of grammars. The explanation avoids the problems with Marcus' [1980] attempt to account for the same constraint. The argument is robust with respect to machine implementauon, and thus avoids the problems that often arise wilen making detailed claims about parsing efficiency. It has the added virtue of unifying in the functional domain of parsing certain grammatically disparate phenomena, as well as making a strong claim about the way in which the grammar is actually embedded into an on-line sentence processor.

downloadDownload free PDF View PDFchevron_right

Bottom-Up Filtering: a Parsing Strategy for GPSG

Jean-Yves Morin

Proceedings of the 13th conference on Computational linguistics -, 1990

In this paper, we propose an optimized strategy, called Bottom-Up Filtering, for parsing GPSGs. This strategy is based on a particular, high level, interpretation of GPSGs. It permiks a significant reduction of fl~e non-determinism inherent to the rule selection process.

downloadDownload free PDF View PDFchevron_right

Grammars and parsing

Johan Jeuring

2001

2 Context-Free Grammars 13 2.1 Languages 14 2.2 Grammars 17 2.2. 1 Notational convention s 20 2.3 The language of a gramma r 21 2.3. 1 Some basic language s 22 2.4 Parse tree s 24 2.4. 1 From context-free grammars to datatype s 26 2.5 Grammar transformation s 27 2.6 Concrete and abstract synta x 32 2.7 Constructions on grammar s 35 2.7. 1 SL: an exampl e 36 2.8 Parsin g 38 2.9 Exercise s 39

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (6)

Becket, The Phrasal Lexicon. TINLAP, 1975.
Gazdar, G. Klein, E., Pullum, G.K., Sag, I.A., Generalised Phrase Structure Grammar. Blackwell, Oxford (in press -1985).
Marcus, M., A Theory of Natural Language Processing PhD thesis, MIT, 1980.
Shieber, S.M., Direct Parsing of ID/LP Grammars Linguistics & Philosophy 7/2, 1984.
Thorne, J.P., Bratley, P. & Dewar, H., The Syntactic Analysis of English By Machine in Machine Intelligence 3, ed. Michie, Edinburgh UP, 1968.
Thomson, H. Handling Metarules In A Parser For GPSG DAIRP 175, University of Edinburgh, 1982.

Harish Karnick

1989

A parser is described here based on the Cocke-Young-Kassami algorithm which uses immediate dominance and linear precedence rules together with various feature inheritance conventions. The meta rules in the grammar are not applied beforehand but only when needed. This ensures that the rule set is kept to a minimum. At the same time, determining what rule to expand by applying which meta-rule is done in an efficient manner using the meta-rule reference table. Since this table is generated during “compilation” stage, its generation does not add to parsing time.

downloadDownload free PDF View PDFchevron_right

Parsing with Principles and Classes of Information

paola merlo

Studies in Linguistics and Philosophy, 1996

After Chomsky's (1981) introduction of the Government and Binding (GB) theory of grammar, a research area called GB parsing developed in the mid-eighties to explore parsing architectures based on that framework. In this area, parsing is viewed as the characterization of a mental process rather than a crude mapping from strings to syntactic structures. Therefore in GB parsing there is a need to develop a motivated mapping between the postulated model of humans' knowledge of language (the grammar) and the parsing architecture, an enterprise in which psychological as well as computational issues are at stake.

downloadDownload free PDF View PDFchevron_right

Phrase structure parsing and the island constraints

Janet Fodor

Linguistics and Philosophy, 1983

downloadDownload free PDF View PDFchevron_right

Processing English with a generalized phrase structure grammar

Ivan Sag

1982

Abstract This paper describes a natural language processing system implemented at Hewlett-Packard's Computer Research Center. The system's main components are: a Generalized Phrase Structure Grammar (GPSG); a top-down parser; a logic transducer that outputs a first-order logical representation; and a" disambiguator" that uses sortal information to convert" normal-form" first-order logical expressions into the query language for HIRE, a relational database hosted in the SPHERE system.

downloadDownload free PDF View PDFchevron_right

Syntax Parsing: Implementation Using Grammar-Rules for English Language

M. M. Raghuwanshi

2014 International Conference on Electronic Systems, Signal Processing and Computing Technologies, 2014

Syntactic parsing deals with syntactic structure of a sentence. The word 'syntax' refers to the grammatical arrangement of words in a sentence and their relationship with each other. The objective of syntactic analysis is to find syntactic structure of a sentence which is usually depicted as a tree. Identifying the syntactic structure is useful in determining the meaning of a sentence. Natural language processing is an arena of computer science and linguistics, concerned with the dealings amongst computers and human languages. It processes the data through lexical analysis, Syntax analysis, Semantic analysis, Discourse processing, Pragmatic analysis. This paper gives various parsing methods. The algorithm in this paper splits the English sentences into parts using POS tagger, It identifies the type of sentence (Facts, active, passive etc.) and then parses these sentences using grammar rules of Natural language. The algorithm has been tested on real sentences of English and it accomplished an accuracy of 81%.

downloadDownload free PDF View PDFchevron_right

Supertagging: An approach to almost parsing

Srinivas Bangalore

Computational linguistics, 1999

In this paper, we have proposed novel methods for robust parsing that integrate the flexibility of linguistically motivated lexical descriptions with the robustness of statistical techniques. Our thesis is that the computation of linguistic structure can be localized iflexical items are associated with rich descriptions (supertags) that impose complex constraints in a local context. The supertags are designed such that only those elements on which the lexical item imposes constraints appear within a given supertag. Further, each lexical item is associated with as many supertags as the number of different syntactic contexts in which the lexical item can appear. This makes the number of different descriptions for each lexical item much larger than when the descriptions are less complex, thus increasing the local ambiguity for a parser. But this local ambiguity can be resolved by using statistical distributions of supertag co-occurrences collected from a corpus of parses. We have explored these ideas in the context of the Lexicalized Tree-Adjoining Grammar (LTAG) framework. The supertags in LTAG combine both phrase structure information and dependency information in a single representation. Supertag disambiguation results in a representation that is effectively a parse (an almost parse), and the parser need "only" combine the individual supertags. This method of parsing can also be used to parse sentence fragments such as in spoken utterances where the disambiguated supertag sequence may not combine into a single structure.

downloadDownload free PDF View PDFchevron_right

Critical Review of Automatic Parsers The Stanford Parser and The Berkeley Parser

Ahmed Kassem

This paper provides a critical review of two highly esteemed parsers: the Stanford and the Berkeley Parsers. The parsers are examined in terms of the constituency. Both parsers prove efficiency, especially with complex sentences, yet Berkeley's parser provides more efficient accuracy. Both parsers could not deal with common syntactic challenges, namely garden path, elliptical sentences, and topicalization. It is recommended that both parsers provide additional user-friendly interfaces and services.

downloadDownload free PDF View PDFchevron_right

Syntactic parsing: a survey

Ruth Sanders

Computers and the Humanities, 1989

The appfication of syntactic parsing to computerassisted language instruction suggests approaches and presents problems not usually associated with non-educational parsing. The article identifies these issues and presents a general overview and assessment of grammar formalisms and parsing strategies in relation to language instruction. The discussion includes error analysis, morphology, bottom-up and top-down parsing, backtracking and deterministic parsing, wait-and-see parsing, context-free and augmented phrase structure grammar, augmented transition networks, logic grammars, and categorial grammars. Language teaching applications discussed include writing aids, reading aids, and conversational programs. have worked together on "Spion," an adventure game using syntactic and semantic parsing for German language instruction, and "Syncheck," a syntactic parser-based writing aM for intermediate and advanced college German students.

downloadDownload free PDF View PDFchevron_right

Preliminary analysis of a breadth-first parsing algorithm: theoretical and experimental results

wendy martin

1982

downloadDownload free PDF View PDFchevron_right

A Faster Parsing Algorithm for Lexicalized Tree-Adjoining Grammars

Giorgio Satta

2000

This paper points out some computational inefficiencies of standard TAG parsing algorithms when applied to LTAGs. We propose a novel algorithm with an asymptotic improvement, from

downloadDownload free PDF View PDFchevron_right

Effective parsing with generalised phrase structure grammar

Sign up for access to the world's latest research

Abstract

Related papers

References (6)

Related papers