Computational Learning of Construction Grammars
2017, Language & Cognition
https://doi.org/10.1017/LANGCOG.2016.7Abstract
This paper presents an algorithm for learning the construction grammar of a language from a large corpus. This grammar induction algorithm has two goals: First, to show that construction grammars are learnable without highly specified innate structure; Second, to develop a model of which units do or do not constitute constructions in a given dataset. The basic task of construction grammar induction is to identify the minimum set of constructions that represents the language in question with maximum descriptive adequacy. These constructions must (1) generalize across an unspecified number of units while (2) containing mixed levels of representation internally (e.g., both item-specific and schematized representations) and (3) allowing for unfilled and partially filled slots. Additionally, these constructions may (4) contain recursive structure within a given slot that needs to be reduced in order to produce a sufficiently schematic representation. In other words, these constructions are multi-length, multi-level, possibly discontinuous co-occurrences which generalize across internal recursive structures. These co-occurrences are modeled using frequency and the ΔP measure of association, expanded in novel ways to cover multi-unit sequences. This work provides important new evidence for the learnability of construction grammars as well as a tool for the automated corpus analysis of constructions.
References (65)
- Baroni, M., Bernardini, S., Ferraresi, A., & Zanchetta, E. (2009). "The WaCky Wide Web: A Collection of Very Large Linguistically Processed Web-crawled Corpora." Language Resources and Evaluation, 43: 209-226.
- Blunsom, P. & Cohn, T. (2010). "Unsupervised induction of tree substitution grammars for dependency parsing." In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 1,204-1,213.
- Bod, R. (2006). "Exemplar-based syntax: How to get productivity from examples. The Linguistic Review, 22: 291-320.
- Briscoe, T. (2000). "Grammatical Acquisition: Inductive bias and coevolution of language and the language acquisition device." Language, 76(2): 245-296.
- Bryant, J. (2004). "Scalable construction-based parsing and semantic analysis." In Proceedings of the Workshop on Scalable Natural Language Understanding (HLT-NAACL): 33-40.
- Bybee, J. (2006). "From usage to grammar: The mind's response to repetition." Language, 82(4): 711-733.
- Bybee, J. (2010). Language, Usage, and Cognition. Cambridge, UK: Cambridge University Press.
- Chang, N.; De Beule, J.; & Micelli, V. (2012). "Computational construction grammar: Comparing ECG and FCG." In Steels, L. (ed.), Computational Issues in Fluid Construction Grammar. Berlin: Springer. 259-288.
- Chomsky, N. (1965). Aspects of the Theory of Syntax. Cambridge, MA: MIT Press.
- Chomsky, N. (1975). Logical Structure of Linguistic Theory. Philadelphia: Springer.
- Clark, A. (2001). "Unsupervised induction of stochastic context-free grammars using distributional clustering." In Proceedings of the 5 th Conference on Natural Language Learning.
- da Silva, J. & Lopez, G. (1999). "A local maxima method and a fair dispersion normalization for extracting multi-word units from corpora." In Proceedings of the 6 th Meeting on the Mathematics of Language, 369-381.
- Daudaravičius, V. & Marcinkevičienė, R. (2004). "Gravity counts for the boundaries of collocations." International Journal of Corpus Linguistics, 9(2): 321-348.
- Davies, M. (2010). "The Corpus of Contemporary American English as the first reliable monitor corpus of English." Literary and Linguistic Computing, 25(4): 447-464.
- Dennis, S. (2005). "An exemplar-based approach to unsupervised parsing." In Proceedings of the 27 th Annual Conference of the Cognitive Science Society: 583-588.
- Dunn, J. (2015). "Review of The Semantic Representation of Natural Language." Studies in Language, 39(2): 492-500.
- Fillmore, C. (1988). "The Mechanisms of 'Construction Grammar.'" In Proceedings of the Fourteenth Annual Meeting of the Berkeley Linguistics Society. 35-55.
- Firth, J. (1957). Papers in Linguistics, 1934-1951. Oxford, Oxford University Press.
- Forsberg, M.; Johansson, R.; Bäckström, L.; Borin, L.; Lyngfelt, B.; Olofsson, J.; & Prentice, J. (2014). "From construction candidates to constructicon entries: An experiment using semi- automatic methods for identifying constructions in corpora." Constructions and Frames, 6(1): 114-135.
- Goldberg, A. (2006). Constructions at work: The nature of generalization in language. Oxford: Oxford University Press.
- Goldberg, A. (2009). "The nature of generalization in language." Cognitive Linguistics, 20(1): 93-127.
- Goldberg, A.; Casenhiser, D.; & Sethuraman, N. (2004). "Learning argument structure generalizations." Cognitive Linguistics, 15(3): 289-316.
- Goldsmith, J. (2001). "Unsupervised learning of the morphology of a natural language." Computational Linguistics, 27(2): 153-198.
- Goldsmith, J. (2006). "An algorithm for the unsupervised learning of morphology." Natural Language Engineering, 12(4): 353-371.
- Gries, S. (2008). "Dispersions and adjusted frequencies in corpora." International Journal of Corpus Linguistics, 13(4): 403-437.
- Gries, S. (2012). "Frequencies, probabilities, and association measures in usage-/ exemplar-based linguistics: Some necessary clarifications." Studies in Language, 11(3): 477-510.
- Gries, S. (2013). "50-something years of work on collocations: What is or should be next." International Journal of Corpus Linguistics, 18(1): 137-165.
- Gries, S. & Mukherjee, J. (2010). "Lexical gravity across varieties of English: An ICE-based study of n- grams in Asian Englishes." International Journal of Corpus Linguistics, 15(4): 520-548.
- Gries, S. & Stefanowitsch, A. (2004a). "Extending collostructional analysis: A corpus-based perspective on 'alternations'." International Journal of Corpus Linguistics, 9(1): 97-129.
- Gries, S. & Stefanowitsch, A. (2004b). "Co-varying lexemes in the into-causative." In Achard, M. & Kemmer, S. (eds.), Language, culture, and mind. Stanford: Stanford: CSLI. 225-236.
- Headden, W.; Johnson, M.; & McClosky, D. (2009). "Improving unsupervised dependency parsing with richer contexts and smoothing." In Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics, 101-109.
- Heinz, J.; de la Higuera, C.; van Zaanen, M. (2016). Grammatical inference for computational linguistics. San Rafael, CA: Morgan & Claypool Publishers.
- Hilpert, M. (2008). "New evidence against the modularity of grammar: Constructions, collocations, and speech perception." Cognitive Linguistics, 19(3): 483-503.
- Hopper, P. (1987). "Emergent grammar." Proceedings of the 13 th Annual Meeting of the Berkeley Linguistics Society, 139-157.
- Istvan, N. & Vincze, V. (2014). "VPCTagger: Detecting Verb-Particle constructions with syntax-based methods." In Proceedings of the 10 th Workshop on Multiword Expressions, 17-25.
- Jelinek, F. (1990). "Self-organizing language modeling for speech recognition." In A. Waibel & K. Lee (eds.), Readings in Speech Recognition. San Mateo, CA: Morgan Kaufmann. 450-506.
- Katzir, R. (2014). "A cognitively plausible model for grammar induction." Journal of Language Modelling, 2(2): 213-248.
- Kay, P. & Fillmore, C. (1999). "Grammatical constructions and linguistic generalizations: The What's X Doing Y? construction. "Language, 75(1): 1-33.
- Klein, D. & Manning, C. (2002). "A generative constituent-context model for improved grammar induction." In Proceedings of the 40 th Annual Meeting of the Association for Computational Linguistics: 128-135.
- Langacker, R. (1987). Foundations of Cognitive Grammar. Stanford: Stanford University Press.
- Langacker, R. (2006). "On the continuous debate about discreteness." Cognitive Linguistics, 17(1): 107-151.
- Langacker, R. (2008). Cognitive Grammar: A basic introduction. Oxford: Oxford University Press.
- Levison, M.; Lessard, G.; Thomas, C.; Donald, M. (2013). The Semantic Representation of Natural Language. New York: Bloomsbury Publishing.
- Lidz, J. & Williams, A. (2009). "Constructions on holiday." Cognitive Linguistics, 20(1): 177-189.
- Mareček, D. & Straka, M. (2013). "Stop-probability estimates computed on a large corpus improve unsupervised dependency parsing." In Proceedings of the 51 st Annual Meeting of the Association for Computational Linguistics, 281-290.
- Nirenburg, S. & Raskin, V. (2004). Ontological Semantics. Cambridge, MA: MIT Press.
- Nivre, J.; Hall, J; Nilsson, J.; Chanev, A.; Eryigit, G.; Kubler, S.; Marinov, S.; & Marsi, E. (2007). "MaltParser: A language-independent system for data-driven dependency parsing." Natural Language Engineering, 13(2): 95-135.
- O'Donnell, M. & Ellis, N. (2010). "Towards an inventory of English verb argument constructions." In Proceedings of the Workshop on Extracting and Using Constructions in Computational Linguistics (NAACL-HTL): 9-16.
- Piao, S.; Bianchi, F.; Dayrell, C.; D'Egidio, A.; & Rayson, P. (2015). "Development of the multilingual semantic annotation system." In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics -Human Language Technologies, 1268-1274.
- Schmid, H. (1994). "Probabilistic part-of-speech tagging using decision trees." In Proceedings of the International Conference on New Methods in Language Processing.
- Solan, Z.; Horn, D.; Ruppin, E.; Edelman, S. (2005). "Unsupervised learning of natural languages." Proceedings of the National Academy of Sciences, 102(33): 11,629-11,634.
- Spitkovsky, V.; Alshawi, H.; & Jurafsky, D. (2013). "Breaking out of local optima with count transforms and model recombination: A study in grammar induction." In Proceedings of 2013 Conference on Empirical Methods in Natural Language Processing, 1983-1995.
- Steels, L. (2004). "Constructivist development of grounded construction grammar." In Proceedings of the 42 nd Meeting of the Association for Computational Linguistics: 9-16.
- Steels, L. (2012). "Design methods for fluid construction grammar." In Steels, L. (ed), Computational Issues in Fluid Construction Grammar. Berlin: Springer. 3-36.
- Stefanowitsch, A. & Gries, S. (2003). "Collostructions: Investigating the interaction between words and constructions." International Journal of Corpus Linguistics, 8(2): 209-243.
- Stefanowitsch, A. & Gries, S. (2005). "Covarying lexemes." Corpus Linguistics and Linguistic Theory, 1(1): 1-43.
- Tomasello, M. (2003). Constructing a language. Cambridge, MA: Harvard University Press. Computational Learning of Grammars, 32
- Tsao, N. & Wible, D. (2013). "Word similarity using constructions as contextual features." In Proceedings of the Joint Symposium on Semantic Processing: Textual Inference and Structures in Corpora. 51-59.
- van de Cruys, T. (2011). "Two multivariate generalizations of pointwise mutual information." In Proceedings of the Workshop on Distributional Semantics and Compositionality, 16-20.
- van Zaanen, M. (2000). "ABL: Alignment-based learning." In Proceedings of the 18 th International Conference on Computational Linguistics, 961-967.
- Vincze, V.; Zsibrita, J.; & Istvan, N. (2013). "Dependency parsing for identifying Hungarian light-verb constructions." In Proceedings of the International Joint Conference on Natural Language Processing, 207-215.
- Wei, N. & Li, J. (2013). "A new computing method for extracting contiguous phraseological sequences from academic text corpora." International Journal of Corpus Linguistics, 18(4): 506-535.
- Wible, D. & Taso, N. (2010). "StringNet as a computational resource for discovering and investigating linguistic constructions." In Proceedings of the Workshop on Extracting and Using Constructions in Computational Linguistics (NAACL-HTL): 25-31.
- Zadrozny, W.; Szummer, M.; Jarecki, S.; Johnson, D.; & Morhenstern, L. (1994). "NL understanding with a grammar of constructions." In Proceedings of the International Conference on Computational Linguistics: 1,289-1,293.
- Zuidema, W. (2006). "What are the productive units of natural language grammar? A DOP approach to the automatic identification of constructions." In Proceedings of the 10 th Conference on Computational Natural Language Learning: 29-36.