Model adaptation for statistical machine translation

Resource type
Thesis type
(Thesis) M.Sc.
Date created
2009
Authors/Contributors
Abstract
Statistical machine translation (SMT) systems use statistical learning methods to learn how to translate from large amounts of parallel training data. Unfortunately, SMT systems are tuned to the domain of the training data and need to be adapted before they can be used to translate data in a different domain. First, we consider a semi-supervised technique to perform model adaptation. We explore new feature extraction techniques, feature combinations and their effects on performance. In addition, we introduce an unsupervised variant of Minimum Error Rate Training (MERT), which can be used to tune the SMT model parameters. We do this by using another SMT model that translates in the reverse direction. We apply this variant of MERT to the model adaptation task. Both of the techniques we explore in this thesis produce promising results in exhaustive experiments we performed for translation from French to English in different domains.
Document
Copyright statement
Copyright is held by the author.
Scholarly level
Language
English
Member of collection
Attachment Size
ETD4817.pdf 467.74 KB