Monolingual Corpus-based MT using Chunks
2005
Abstract
In the present article, a hybrid approach is pro- posed for implementing a machine translation system using a large monolingual corpus cou- pled with a bilingual lexicon and basic NLP tools. In the first phase of the METIS system, a source language (SL) sentence, after being tagged, lemmatised and translated by a flat lemma-to-lemma lexicon, was matched against a tagged
References (15)
- Y. Al-Onaizan, U. Germann, U. Hermjakob, K. Knight, P. Koehn, D. Marcu, & K. Yamada. 2000. Translating with Scarce Resources. American As- sociation for Artificial Intelligence Conference (AAAI'00), 30 July -3 August, Austin, Texas, pages 672-678 (T Uhttp://www.isi.edu/natural- language/projects/rewriteU T).
- S. Boutsis, P. Prokopidis, V. Giouli & S. Piperidis. 2000. A Robust Parser for Unrestricted Greek Text. In "Proceedings of the Second International Conference on Language Resources and Evalua- tion", 31 May-2 June, Athens, Greece, Vol. 1, pp. 467-482.
- P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, J. Lafferty, R. Mercer & P. S. Roosin. 1990. A Statistical Approach to Machine Transla- tion. Computational Linguistics, 16(2):79-85.
- L. Cranias, H. Papageorgiou & S. Piperidis. 1997. Example Retrieval from a Translation Memory. Natural Language Engineering, 3:255-277.
- I. Dologlou, S. Markantonatou, G. Tambouratzis, O. Yannoutsou, A. Fourla & N. Ioannou. 2003. Us- ing Monolingual Corpora for Statistical Machine Translation. In "Proceedings of EAMT/CLAW 2003", Dublin, Ireland, 15-17 May.
- H. Dyvik. 1995. Exploiting Structural Similarities in Machine Translation. Computers and the Hu- manities, 28:225-234.
- ILSP Internal Document 2004. Specifications for METIS lexicon.
- N. Ioannou. 2003. METIS: Statistical Machine Translation Using Monolingual Corpora. In "Proceedings of the Workshop on Text Process- ing for Modern Greek: From Symbolic to Statisti- cal Approaches" (held in conjunction with the 6 th International Conference of Greek Linguistics), Rethymno, Greece, 20 September. ISBN:960- 88268-0-2.
- P. Labropoulou, E. Mantzari & M. Gavrilidou. 1996. Lexicon-Morphosyntactic Specifications: Language Specific Instantiation (Greek), PP- PAROLE, MLAP 63-386 report. METEOR: TUhttp://www-2.cs.cmu.edu/~banerjee/
- K. McTait. 2003. Translation Patterns, Linguistic Knowledge and Complexity in EBMT. In "Recent Advances in Example-Based Machine Transla- tion", M. Carl and A. Way (eds.) Kluwer Aca- demic Publishers, pp. 307-338.
- M. Nagao. 1984. A Framework of a Mechanical Translation between Japanese and English by Analogy Principle. In "Artificial and Human In- telligence", A. Elithorn and R. Banerji (eds). North-Holland.
- NIST: TUhttp://www.nist.gov/speech/tests/mt/UT K. A. Papineni, S. Roukos, T. Ward & W. J. Zhu. 2002. Bleu, a method for automatic evaluation of Machine Translation. In "Proceedings of the 40P th P Annual Meeting of the Association for Computa- tional Linguistics", Philadelphia (USA), pages 311-318.
- M. Popovic and H. Ney. 2005. Exploiting Phrasal Lexica and Additional Morpho-syntactic Lan- guage Resources for Statistical Machine Transla- tion with Scarce Training Data. EAMT 10 th An- nual Conference, 30-31 May, Budapest, Hungary.
- V. Vandeghinste 2005. Manual for ShaRPa 2.0. Internal Report, Centre for Computational Lin- guistics, K.U.Leuven.
- A. Way 2003. Translating with Examples: The LFG-DOT Models of Translation, In "Recent Ad- vances in Example-Based Machine Translation", M. Carl and A. Way (eds.). Kluwer Academic Publishers, pages 443-472.