Cardiff University | Prifysgol Caerdydd ORCA
Online Research @ Cardiff 
WelshClear Cookie - decide language by browser settings

Integration and biological interpretation of microarry gene expression profiling data

Menon, Suraj 2009. Integration and biological interpretation of microarry gene expression profiling data. PhD Thesis, Cardiff University.

[thumbnail of U584393.pdf] PDF - Accepted Post-Print Version
Download (33MB)

Abstract

Many different strategies have been developed for the analysis of microarray data and these have a significant influence on the level and quality of knowledge that may be achieved from a microarray-based experiment. Two such strategies are explored in this thesis. Part A of this thesis describes explorations of a resource-efficient strategy that could allow for large-scale integration of microarray data in an unsupervised fashion. For this purpose, comparisons were carried out between a series of genelists manually extracted from the literature, representing a disparate set of microarray experiments. Initial results were highly unexpected, and are likely to have been caused by violations of the assumptions of the hypergeometric test used for assessing comparisons. Statistical modelling was found to successfully simulate these results however the estimated net effect of these violations was found to be considerable. These findings strongly caution against the comparison of microarray experiments using their genelists. Part B then describes the development of Gene Set Discovery (GSD), a novel methodology to perform threshold-free gene set analysis of microarray datasets without requiring sample class information. This was achieved by deriving a novel metric that allows for the selection of those gene sets that exhibit significant discrimination between samples. GSD was implemented on four microarray datasets and the results were found to be biologically plausible and/or in agreement with prior analyses of these datasets. These findings suggested that GSD could be a potentially useful tool for biological theme discovery in microarray datasets, particularly in studies of cancer where sample classification is problematic. Also described is a related methodology for extraction of informative genes from within selected gene sets, and a scheme for visualization of results in an integrated format.

Item Type: Thesis (PhD)
Status: Unpublished
Schools: Medicine
ISBN: 9781303189562
Funders: MRC-BP Dorothy Hodgkin Postgraduate Award, Lord Merthyr Scholarship
Date of First Compliant Deposit: 30 March 2016
Last Modified: 16 Oct 2014 12:55
URI: https://orca.cardiff.ac.uk/id/eprint/55858

Actions (repository staff only)

Edit Item Edit Item

Downloads

Downloads per month over past year

View more statistics