Skip to main content

Exploiting target syntax with structured decoding

Resource type
Thesis type
(Thesis) M.Sc.
Date created
Incorporating structured target syntax in a neural machine translation model requires an effective tree-structured neural generator. We exploit a top-down tree-structured model called DRNN (Doubly-Recurrent Neural Networks) first proposed by Alvarez-Melis and Jaakola (2017) to create an NMT model called Seq2DRNN that combines a sequential encoder with tree-structured decoding augmented with a syntax-aware attention model. Unlike previous approaches to syntax-based NMT which use dependency parsing models, our method uses constituency parsing which we argue provides useful information for translation. In addition, we use the syntactic structure of the sentence to add new connections to the tree-structured decoder neural network (Seq2DRNN+SynC). We compare our NMT model with sequential and state of the art syntax-based NMT models and show that our model produces more fluent translations with better reordering. Since our model is capable of doing translation and constituency parsing at the same time we also compare our parsing accuracy against other neural parsing models. We also show that our proposed model is capable of learning even more strictly defined programming language syntax by modelling its Abstract Syntactic Trees, reaching new state-of-the-art exact match accuracy on Django dataset without having to resort to syntactic rule-based decoder.
Copyright statement
Copyright is held by the author.
This thesis may be printed or downloaded for non-commercial research and scholarly purposes.
Scholarly level
Supervisor or Senior Supervisor
Thesis advisor: Sarkar, Anoop
Member of collection

Views & downloads - as of June 2023

Views: 34
Downloads: 0