Please use this identifier to cite or link to this item:
https://hdl.handle.net/2440/100190
Type: | Theses |
Title: | A simulation study to compare gene set analysis methods |
Author: | Pfeiffer, Andrew James |
Issue Date: | 2016 |
School/Discipline: | School of Mathematical Sciences |
Abstract: | Genome-wide association studies (GWA studies) identify alleles that are associated with a disease. These allele variations are called single nucleotide polymorphisms (SNPs). However, GWA studies do not account for interaction between SNPs. Gene set analysis (GSA) is used in GWA studies to account for interaction. GSA methods map SNPs to gene sets and identify gene sets that are associated with a disease. Comprehensive reviews of GSA exist in the literature. However, these reviews do not compare specific methods or implement them on data. In this thesis, we compare six GSA methods. We use seven factors highlighted by the reviews as important in GSA to compare these methods. For example, we analyse how each method accounts for parameters that could affect the analysis. These parameters include gene size and SNP interaction. We consider the null hypothesis tested by each method. We also analyse the sensitivity of methods to individual SNPs with small p-values. In contrast, the marginal effect of many SNPs that cause diseases is often small. The p-values of such SNPs need not be small. We conduct a simulation study to compare four GSA methods. We investigate the sensitivity of these methods to SNPs with very small p-values. We use Manhattan plots to display gene sets that were assigned disparate p-values by different methods. We also use receiver operating characteristic curves to compare the performance of each method. Finally, we recommend a method that gave excellent performance. |
Advisor: | Glonek, Garique Francis Vladimir Tuke, Simon Jonathan |
Dissertation Note: | Thesis (M.Phil.) -- University of Adelaide, School of Mathematical Sciences, 2016. |
Keywords: | genetics bioinformatics statistics |
Provenance: | This electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legals |
Appears in Collections: | Research Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01front.pdf | 361.29 kB | Adobe PDF | View/Open | |
02whole.pdf | 12.81 MB | Adobe PDF | View/Open | |
Permissions Restricted Access | Library staff access only | 197.71 kB | Adobe PDF | View/Open |
Restricted Restricted Access | Library staff access only | 12.82 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.