Resource type
Thesis type
(Thesis) Ph.D.
Date created
2024-06-28
Authors/Contributors
Author: Shavarani, Hassan S.
Abstract
Structured prediction in machine learning focuses on mapping a sequence of inputs to a sequence of outputs within a vast output space, with interconnected predictions, offering simplicity and speed while enhancing contextual understanding in NLP tasks. In this dissertation, we revisit the applicability of structured prediction in modern, intricate NLP applications. We introduce a structured prediction-based approach for extracting linguistic knowledge from pre-trained encoder-only language models, and demonstrate the effectiveness of the extracted knowledge in enhancing translation quality of encoder-decoder models. We showcase the efficacy of well-designed, simple structured prediction-based sequence labelling in handling complex entity linking with large entity vocabularies. Our proposed method, SpEL, not only simplifies and accelerates the process but also achieves state-of-the-art results on a prominent entity linking benchmark dataset. Furthermore, we investigate Entity Retrieval, the application of our structured prediction-based entity linking framework as an alternative strategy to prevalent dense retrieval methods in retrieval-augmented question answering, particularly for factual questions about the real world. Our research underscores structured prediction as a compelling approach for modelling complex NLP tasks, particularly when prioritizing computational efficiency and high accuracy. We conclude the dissertation with a review of additional contributions that either diverge from the primary focus or involve shared authorship, even if they pertain to the central theme of the dissertation.
Document
Extent
107 pages.
Identifier
etd23144
Copyright statement
Copyright is held by the author(s).
Supervisor or Senior Supervisor
Thesis advisor: Sarkar, Anoop
Language
English
Member of collection
Download file | Size |
---|---|
etd23144.pdf | 1.57 MB |