Skip to main content

A comparison of two logistic regression approaches for case-control data with missing haplotypes

Resource type
Thesis type
(Project) M.Sc.
Date created
2005
Authors/Contributors
Abstract
In a case-control study, subjects are selected according to disease status and their risk factors are determined retrospectively. When risk factors are fully observed for all subjects, maximum-likelihood inference of disease associations may be obtained by applying prospective logistic regression to case-control data as though it were collected prospectively. We investigate the statistical properties of prospective maximum-likelihood (PML) inference of disease associations with risk factors known as haplotypes when haplotype phase is not fully observed in some subjects. We motivate applying PhlL to case-control data and compare PML to an estimating equation (EE) approach developed specifically for such data. We conduct limited simulations of case-control data to investigate the bias of PhlL and EE, both in estimated haplotype risks and in their standard errors. PhlL performed well in the simulation configurations we considered. By contrast, EE gave anticonservative inference when there was marked haplotype ambiguity.
Document
Copyright statement
Copyright is held by the author.
Permissions
The author has not granted permission for the file to be printed nor for the text to be copied and pasted. If you would like a printable copy of this thesis, please contact summit-permissions@sfu.ca.
Scholarly level
Language
English
Download file Size
etd1785.pdf 624.34 KB

Views & downloads - as of June 2023

Views: 0
Downloads: 0