Moradi, Pooya

Resource type

Thesis

Thesis type

(Thesis) M.Sc.

Date created

2020-07-17

Authors/Contributors

Author: Moradi, Pooya

Abstract

Can we trust that the attention heatmaps produced by a neural machine translation (NMT) model reflect its true internal reasoning? We isolate and examine in detail the notion of faithfulness in NMT models. We provide a measure of faithfulness for NMT based on a variety of stress tests where model parameters are perturbed and measuring faithfulness based on how often each individual output changes. We show that our proposed faithfulness measure for NMT models can be improved using a novel differentiable objective that rewards faithful behaviour by the model through probability divergence. Our experimental results on multiple language pairs show that our objective function is effective in increasing faithfulness and can lead to a useful analysis of NMT model behaviour and more trustworthy attention heatmaps. Our proposed objective improves faithfulness without reducing the translation quality and it also seems to have a useful regularization effect on the NMT model and can even improve translation quality in some cases.

Keywords

Identifier

etd21060

Copyright statement

Copyright is held by the author(s).

Permissions

This thesis may be printed or downloaded for non-commercial research and scholarly purposes.

Supervisor or Senior Supervisor

Thesis advisor: Sarkar, Anoop

Language

English

Member of collection

Computing Science Theses

Download file	Size
input_data\21166\etd21060.pdf	489.15 KB

Training with adversaries to improve faithfulness of attention in neural machine translation

Keywords

Views & downloads - as of June 2023