Whitney, Max Everett

Resource type

Thesis

Thesis type

(Thesis) M.Sc.

Date created

2012-08-23

Authors/Contributors

Author: Whitney, Max Everett

Abstract

The Yarowsky algorithm is a simple self-training algorithm for bootstrapping learning from a small number of initial seed rules which has proven very effective in several natural language processing tasks. Bootstrapping a classifier from a small set of seed rules can be viewed as the propagation of labels between examples via features shared between them. This thesis introduces a novel variant of the Yarowsky algorithm based on this view. It is a bootstrapping learning method which uses a graph propagation algorithm with a well-defined objective function. The experimental results show that our proposed bootstrapping algorithm achieves state of the art performance or better on several different natural language data sets.

Keywords

Identifier

etd7421

Copyright statement

Copyright is held by the author.

Permissions

The author granted permission for the file to be printed and for the text to be copied and pasted.

Scholarly level

Graduate student (Masters)

Supervisor or Senior Supervisor

Thesis advisor: Sarkar, Anoop

Member of collection

Computing Science Theses

Download file	Size
etd7421_MWhitney.pdf	709.16 KB

Bootstrapping via graph propagation

Keywords

Views & downloads - as of June 2023