Grewal, Ajeet

Resource type

Thesis

Thesis type

(Thesis) M.Sc.

Date created

2009

Authors/Contributors

Author: Grewal, Ajeet

Abstract

Statistical machine translation (SMT) systems use statistical learning methods to learn how to translate from large amounts of parallel training data. Unfortunately, SMT systems are tuned to the domain of the training data and need to be adapted before they can be used to translate data in a different domain. First, we consider a semi-supervised technique to perform model adaptation. We explore new feature extraction techniques, feature combinations and their effects on performance. In addition, we introduce an unsupervised variant of Minimum Error Rate Training (MERT), which can be used to tune the SMT model parameters. We do this by using another SMT model that translates in the reverse direction. We apply this variant of MERT to the model adaptation task. Both of the techniques we explore in this thesis produce promising results in exhaustive experiments we performed for translation from French to English in different domains.

Keywords

Copyright statement

Copyright is held by the author.

Scholarly level

Graduate student (Masters)

Language

English

Member of collection

Computing Science Theses

Download file	Size
ETD4817.pdf	467.74 KB

Model adaptation for statistical machine translation

Keywords

Views & downloads - as of June 2023