A Local Grammar of French determiners for deep syntactic parsing
Bases de données lexicales : construction et applications
Abstract
Existing syntactic grammars of natural languages, even with a far from complete coverage, are complex objects. Assessments of the quality of parts of such grammars are useful for the validation of their construction. We extended a grammar of French determiners that takes the form of a recursive transition network. The result of the application of this local grammar gives deeper syntactic information than chunking or information available in treebanks. We evaluated its quality by comparison with a corpus independently annotated with information on determiners. We obtained 86% precision and 92% recall on text not tagged for parts of speech.
References (23)
- Abeillé, A. & Barrier, N. 2004. "Enriching a French Treebank", Lino et al. (eds.), Proceedings of the International Conference on Language Resources and Evaluation (LREC), Lisbon.
- Blanc, O. & Constant, M. 2005. "Lexicalisation of grammars with parameterized graphs", Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP), Borovets (Bulgaria), p. 117-121.
- Buvet, P.-A. 1994. "Détermination : les noms", Lingvisticae Investigationes 18:121-150.
- Buvet, P.-A. & Lim, J. 1996. "Les déterminants nominaux aspectuels", Lingvisticae Investigationes 20:271-285.
- Constant, M. 2000. "Description d'expressions numériques en français", Revue Informatique et Statistique dans les Sciences Humaines 36:119-135.
- Courtois, B. 1990. "Un système de dictionnaires électroniques pour les mots simples du français", Langue française 87:11-22.
- Danlos, L. 2005. "Automatic Recognition of French Expletive Pronoun Occurrences", Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP), Companion Volume, p. 73-78, Jeju, Korea.
- Fairon, C., Paumier, S. & Watrin, P. 2005. "Can we parse without tagging? ", Proceedings of the Language & Technology Conference: Human Language Technologies, Poznan, Poland, p. 473- 477.
- Gross, M. 1998-1999. "Lemmatization of Compound Tenses in English", Lingvisticae Investigationes 22:71-122, Amsterdam/Philadelphia: Benjamins.
- Gross, M. 2000. "A Bootstrap Method for Constructing Local Grammars", Bokan, N. (ed.), Proceedings of the Symposium on Contemporary Mathematics, University of Belgrad, Serbia, p. 229-250.
- Gross, M. 2001. "Grammaires locales de déterminants nominaux", Détermination et formalisation, LIS 23, Amsterdam/Philadelphia: Benjamins, p.177-193.
- Laporte, E., Ranchhod, E. & A. Yannacopoulou 2006. "Syntactic variation of support verb constructions", Proceedings of the Lexis and Grammar Conference (LGC), Palermo, Italy.
- Mason, O. 2004. "Automatic Processing of Local Grammar Patterns", Proceedings of the Annual Colloquium for the UK Special Interest Group for Computational Linguistics, Birmingham, p.166-171.
- Nam, J. & Choi, K. 1997. "A Local-Grammar-based Approach to Recognizing of Proper Names in Korean Texts". Zhou & Church (eds.), Proceedings of the Workshop on Very Large Corpora, ACL/Tsing-hua University/Hong-Kong University of Science and Technology, p. 273-288.
- Nenadic, G. 2000. "Local Grammars and Parsing Coordination of Nouns in Serbo-Croatian", Proceedings of Text, Speech and Dialogue (TSD), LNAI 1902, Springer, p. 57-62.
- Paroubek, P., Robba, I., Vilnat, A. & Ayache, Ch. 2006. "Data, Annotations and Measures in EASY, the Evaluation Campaign for Parsers of French", Proceedings of the International Conference on Language Resources and Evaluation (LREC), Genoa, Italy.
- Paumier, S. 2006. The Unitex Manual. http://igm.univ-mlv.fr/~unitex/.
- Poibeau, Th. 2006. "Dealing with Metonymic Readings of Named Entities", Proceedings of the Annual Conference of the Cognitive Science Society (COGSCI), Vancouver, Canada.
- Ranchhod, E., Carvalho, P., Mota, C. & Barreiro, A. 2004. "Portuguese Large-scale Language Resources for NLP Applications", Lino et al. (eds.), Proceedings of the International Conference on Language Resources and Evaluation (LREC), Lisbon, p.1755-1758.
- Saetre, R. 2004. "GeneTUC -BioMolecular Information Retrieval", Computer Science Graduate Student Conference (CSGSC), Trondheim, Norway.
- Senellart, J., Plitt, M., Bailly, Ch. & Cardoso, F. 2001. "Resource alignment and implicit transfer", Machine translation in the information age, MT Summit, p. 317-323.
- Silberztein, M. 2003. "Finite-State Description of the French Determiner System", Journal of French Language Studies 13(2):221-246, Cambridge University Press.
- Venkova, T. 2000. "A local grammar disambiguator of compound conjunctions as a pre- processor for deep analysers", Proceedings of the Workshop on Linguistic Theory and Grammar Implementation, European Summer School in Logic, Language and Information (ESSLLI), Birmingham.