Academia.eduAcademia.edu

Outline

A Local Grammar of French determiners for deep syntactic parsing

Bases de données lexicales : construction et applications

Abstract

Existing syntactic grammars of natural languages, even with a far from complete coverage, are complex objects. Assessments of the quality of parts of such grammars are useful for the validation of their construction. We extended a grammar of French determiners that takes the form of a recursive transition network. The result of the application of this local grammar gives deeper syntactic information than chunking or information available in treebanks. We evaluated its quality by comparison with a corpus independently annotated with information on determiners. We obtained 86% precision and 92% recall on text not tagged for parts of speech.

References (23)

  1. Abeillé, A. & Barrier, N. 2004. "Enriching a French Treebank", Lino et al. (eds.), Proceedings of the International Conference on Language Resources and Evaluation (LREC), Lisbon.
  2. Blanc, O. & Constant, M. 2005. "Lexicalisation of grammars with parameterized graphs", Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP), Borovets (Bulgaria), p. 117-121.
  3. Buvet, P.-A. 1994. "Détermination : les noms", Lingvisticae Investigationes 18:121-150.
  4. Buvet, P.-A. & Lim, J. 1996. "Les déterminants nominaux aspectuels", Lingvisticae Investigationes 20:271-285.
  5. Constant, M. 2000. "Description d'expressions numériques en français", Revue Informatique et Statistique dans les Sciences Humaines 36:119-135.
  6. Courtois, B. 1990. "Un système de dictionnaires électroniques pour les mots simples du français", Langue française 87:11-22.
  7. Danlos, L. 2005. "Automatic Recognition of French Expletive Pronoun Occurrences", Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP), Companion Volume, p. 73-78, Jeju, Korea.
  8. Fairon, C., Paumier, S. & Watrin, P. 2005. "Can we parse without tagging? ", Proceedings of the Language & Technology Conference: Human Language Technologies, Poznan, Poland, p. 473- 477.
  9. Gross, M. 1998-1999. "Lemmatization of Compound Tenses in English", Lingvisticae Investigationes 22:71-122, Amsterdam/Philadelphia: Benjamins.
  10. Gross, M. 2000. "A Bootstrap Method for Constructing Local Grammars", Bokan, N. (ed.), Proceedings of the Symposium on Contemporary Mathematics, University of Belgrad, Serbia, p. 229-250.
  11. Gross, M. 2001. "Grammaires locales de déterminants nominaux", Détermination et formalisation, LIS 23, Amsterdam/Philadelphia: Benjamins, p.177-193.
  12. Laporte, E., Ranchhod, E. & A. Yannacopoulou 2006. "Syntactic variation of support verb constructions", Proceedings of the Lexis and Grammar Conference (LGC), Palermo, Italy.
  13. Mason, O. 2004. "Automatic Processing of Local Grammar Patterns", Proceedings of the Annual Colloquium for the UK Special Interest Group for Computational Linguistics, Birmingham, p.166-171.
  14. Nam, J. & Choi, K. 1997. "A Local-Grammar-based Approach to Recognizing of Proper Names in Korean Texts". Zhou & Church (eds.), Proceedings of the Workshop on Very Large Corpora, ACL/Tsing-hua University/Hong-Kong University of Science and Technology, p. 273-288.
  15. Nenadic, G. 2000. "Local Grammars and Parsing Coordination of Nouns in Serbo-Croatian", Proceedings of Text, Speech and Dialogue (TSD), LNAI 1902, Springer, p. 57-62.
  16. Paroubek, P., Robba, I., Vilnat, A. & Ayache, Ch. 2006. "Data, Annotations and Measures in EASY, the Evaluation Campaign for Parsers of French", Proceedings of the International Conference on Language Resources and Evaluation (LREC), Genoa, Italy.
  17. Paumier, S. 2006. The Unitex Manual. http://igm.univ-mlv.fr/~unitex/.
  18. Poibeau, Th. 2006. "Dealing with Metonymic Readings of Named Entities", Proceedings of the Annual Conference of the Cognitive Science Society (COGSCI), Vancouver, Canada.
  19. Ranchhod, E., Carvalho, P., Mota, C. & Barreiro, A. 2004. "Portuguese Large-scale Language Resources for NLP Applications", Lino et al. (eds.), Proceedings of the International Conference on Language Resources and Evaluation (LREC), Lisbon, p.1755-1758.
  20. Saetre, R. 2004. "GeneTUC -BioMolecular Information Retrieval", Computer Science Graduate Student Conference (CSGSC), Trondheim, Norway.
  21. Senellart, J., Plitt, M., Bailly, Ch. & Cardoso, F. 2001. "Resource alignment and implicit transfer", Machine translation in the information age, MT Summit, p. 317-323.
  22. Silberztein, M. 2003. "Finite-State Description of the French Determiner System", Journal of French Language Studies 13(2):221-246, Cambridge University Press.
  23. Venkova, T. 2000. "A local grammar disambiguator of compound conjunctions as a pre- processor for deep analysers", Proceedings of the Workshop on Linguistic Theory and Grammar Implementation, European Summer School in Logic, Language and Information (ESSLLI), Birmingham.