Tanapom Ueungwetwanit. An Analytical Process Using Pooled DNA on SNP Array Data for Screening Susceptibility Genes of Type 2 Diabetes. Master's Degree(Bioinformatics). King Mongkut's University of Technology Thonburi. KMUTT Library.. : King Mongkut's University of Technology Thonburi, 2007.
An Analytical Process Using Pooled DNA on SNP Array Data for Screening Susceptibility Genes of Type 2 Diabetes
การวิเคราะห์ข้อมมูล Pooled DNA จาก SNP ไมโครอาเรย์ เพื่อหาปัจจัยเสี่ยง ทางพัธูกรรมของเบาหวานชนิดที่ 2 ในคนไทย
Abstract:
Diabetes is a major public health problem around the world. Prevalence of this disease
has reached about 150 million cases globally, and it is expected to hit 300 million by the
year 2025. Diabetes is a multifactor disease that is polygenetic and depends on environmental
factors. Type 2 diabetes (T2DM), accounting for 90-95% of all cases diagnosed worldwide, can be
associated with serious complications such as heart disease, stroke, high blood pressure, blindness,
chronic kidney disease and premature death. Type 2 diabetes arises from the complex interplay of various
pathophysiologic mechanisms involving insulin resistance and relative insulin insufficiency. Through its
adverse impact on insulin action, obesity is a major risk factor for the disease. By combining the strength of DNA
pooling to genotype large number of individuals and the strength of microarray to genotype large number of SNPs
by allelotyping, pooled DNA on single nucleotide polymorphism (SNP) microarray analysis is an emergent technique
to study gene-disease association. Although this technique is acceptable to use in study of genetic association test, a standard
method of analysis has not yet been developed. The aims of this study are to investigate a suitable analysis of pooled DNA
and identify susceptibility genes in type 2 diabetes for Thais. Microarray data preprocessing is performed prior to statistical
analysis in order to account for technical variability. Two normalization methods were investigated, Cyclic Loess and Quantile,
before identifying susceptible genes of Type 2 Diabetes. The probe data set is filtered by discrimination score and range of estimate
allele frequency to remove unreliable probe intensities. The results show that both normalizations can minimize signal intensity variation;
however, there is no significant difference in identifying significant S1\TPs. The probe data set that uses only perfect match intensity is preferred.
The mismatch probe intensities are used to exclude unreliable SNPs in the step of filtering. The amounts of associated SNPs to non-obese
T2DM and obese T2DM are 92 and 177 SNPs respectively at p-value of 0.003 as well as the number of nominated SNPs to nonobese
T2DM and obese T2DM for further validation contain 16 SNPs from 13 genes, and 8 SNPs from 6 genes, respectively.