Quality estimation for translation selection
Abstract
We describe experiments on quality estimation to select the best translation among multiple options for a given source sentence. We consider a realistic and challenging setting where the translation systems used are unknown, and no relative quality assessments are available for the training of prediction models. Our findings indicate that prediction errors are higher in this blind setting. However, these errors do not have a negative impact in performance when the predictions are used to select the best translation, compared to non-blind settings. This holds even when test conditions (text domains, MT systems) are different from model building conditions. In addition, we experiment with quality prediction for translations produced by both translation systems and human translators. Although the latter are on average of much higher quality, we show that automatically distinguishing the two types of translation is not a trivial problem.
References (11)
- E. Avramidis. Sentence-level ranking with qual- ity estimation. Machine Translation, 28:1-20,
- J. Blatz, E. Fitzgerald, G. Foster, S. Gandrabur, C. Goutte, A. Kulesza, A. Sanchis, and N. Ueff- ing. Confidence Estimation for Machine Trans- lation. In Coling, pages 315-321, Geneva, 2004.
- O. Bojar, C. Buck, C. Callison-Burch, C. Feder- mann, B. Haddow, P. Koehn, C. Monz, M. Post, R. Soricut, and L. Specia. Findings of the 2013 WMT. In 8th WMT, pages 1-44, Sofia, 2013.
- L. Formiga, M. González, A. Barrón-Cedeno, J. A. Fonollosa, and L. Màrquez. The TALP-UPC ap- proach to system selection: Asiya features and pairwise classification using random forests. In 8th WMT, pages 359-364, Sofia, 2013.
- M. Gamon, A. Aue, and M. Smets. Sentence- level MT evaluation without reference transla- tions: beyond language modeling. In EAMT- 2005, Budapest, 2005.
- Y. He, Y. Ma, J. van Genabith, and A. Way. Bridg- ing smt and tm with translation recommenda- tion. In ACL-2010, pages 622-630, Uppsala, Sweden, 2010.
- S. Hildebrand and S. Vogel. MT quality estima- tion: The CMU system for WMT'13. In 8th WMT, pages 373-379, Sofia, 2013.
- K. Shah, E. Avramidis, E. Bic ¸ici, and L. Specia. Quest -design, implementation and extensions of a framework for machine translation quality estimation. Prague Bull. Math. Linguistics, 100: 19-30, 2013.
- L. Specia, M. Turchi, N. Cancedda, M. Dymet- man, and N. Cristianini. Estimating the Sentence-Level Quality of Machine Transla- tion Systems. In EAMT-2009, pages 28-37, Barcelona, 2009.
- L. Specia, D. Raj, and M. Turchi. Machine trans- lation evaluation versus quality estimation. Ma- chine Translation, pages 39-50, 2010.
- L. Specia, K. Shah, J. G. C. d. Souza, and T. Cohn. Quest -a translation quality estimation frame- work. In ACL-2013 Demo Session, pages 79- 84, Sofia, 2013.