Classification based on supervised clustering with application to juvenile idiopathic arthritis

Date created
2013-08-16
Authors/Contributors
Author: Yang, Yuanyu
Abstract
Juvenile Idiopathic Arthritis (JIA) is the most common rheumatic disease of childhood. Our objective is to predict the results of remission so that those children who are likely to experience poor remission outcomes could benefit from early aggressive treatment. Many classification techniques could provide either a binary prediction or an estimated probability of remission. However, parents would like to know more specifically about the remission outcomes of children similar to their own. In this project, we propose a supervised clustering method that provides this information. Inspired by the basic idea of supervised principal component analysis, we perform supervision by selecting and/or weighting explanatory variables differently depending on their associations with the class response. Our supervised clustering method is applied to JIA data and to data simulated with known properties. Our method is shown to be competitive with an existing supervised clustering method, classification trees and random forests in terms of out-of-sample misclassification rates.
Document
Identifier
etd7997
Copyright statement
Copyright is held by the author.
Permissions
The author granted permission for the file to be printed and for the text to be copied and pasted.
Scholarly level
Attachment Size
etd7997_YYang.pdf 4.04 MB