Skip to main content

Decipherment of Evasive or Encrypted Offensive Text

Resource type
Thesis type
(Thesis) M.Sc.
Date created
2016-07-20
Authors/Contributors
Author (aut): Wu, Zhelun
Abstract
Automated filters are commonly used in on line chat to stop users from sending malicious messages such as age-inappropriate language, bullying, and asking users to expose personal information. Rule based filtering systems are the most common way to deal with this problem but people invent increasingly subtle ways to disguise their malicious messages to bypass such filtering systems. Machine learning classifiers can also be used to identify and filter malicious messages, but such classifiers rely on training data that rapidly becomes out of date and new forms of malicious text cannot be classified accurately. In this thesis, we model the disguised messages as a cipher and apply automatic decipherment techniques to decrypt corrupted malicious text back into plain text which can be then filtered using rules or a classifier. We provide experimental results on three different data sets and show that decipherment is an effective tool for this task.
Document
Identifier
etd9692
Copyright statement
Copyright is held by the author.
Permissions
This thesis may be printed or downloaded for non-commercial research and scholarly purposes.
Scholarly level
Supervisor or Senior Supervisor
Thesis advisor (ths): Sarkar, Anoop
Member of collection
Download file Size
etd9692_ZWu.pdf 1.27 MB

Views & downloads - as of June 2023

Views: 0
Downloads: 0