Model-based clustering for identifying disease-associated SNPs in case-control genome-wide association studies

dc.contributor.authorXu, Y.
dc.contributor.authorXing, L.
dc.contributor.authorSu, J.
dc.contributor.authorZhang, Xuekui
dc.contributor.authorQiu, W.
dc.date.accessioned2021-08-18T18:28:16Z
dc.date.available2021-08-18T18:28:16Z
dc.date.copyright2019en_US
dc.date.issued2019
dc.description.abstractGenome-wide association studies (GWASs) aim to detect genetic risk factors for complex human diseases by identifying disease-associated single-nucleotide polymorphisms (SNPs). The traditional SNP-wise approach along with multiple testing adjustment is over-conservative and lack of power in many GWASs. In this article, we proposed a model-based clustering method that transforms the challenging high-dimension-small-sample-size problem to low-dimension-large-sample-size problem and borrows information across SNPs by grouping SNPs into three clusters. We pre-specify the patterns of clusters by minor allele frequencies of SNPs between cases and controls, and enforce the patterns with prior distributions. In the simulation studies our proposed novel model outperforms traditional SNP-wise approach by showing better controls of false discovery rate (FDR) and higher sensitivity. We re-analyzed two real studies to identifying SNPs associated with severe bortezomib-induced peripheral neuropathy (BiPN) in patients with multiple myeloma (MM). The original analysis in the literature failed to identify SNPs after FDR adjustment. Our proposed method not only detected the reported SNPs after FDR adjustment but also discovered a novel BiPN-associated SNP rs4351714 that has been reported to be related to MM in another study.en_US
dc.description.reviewstatusRevieweden_US
dc.description.scholarlevelFacultyen_US
dc.description.sponsorshipThis work was supported by the Natural Sciences and Engineering Research Council Discovery Grants (XZ, YX), Natural Sciences and Engineering Research Council Post Doctoral Fellowship (LX), and the Canada Research Chair (XZ), and NSERC CREATE (The Visual and Automated Disease Analytics graduate training program) (YX).en_US
dc.identifier.citationXu, Y., Xing, L. Su, J., Zhang, X., & Qiu, W. (2012). Model-based clustering for identifying disease-associated SNPs in case-control genome-wide association studies. Scientific Reports, 9. https://doi.org/10.1038/s41598-019-50229-6en_US
dc.identifier.urihttps://doi.org/10.1038/s41598-019-50229-6
dc.identifier.urihttp://hdl.handle.net/1828/13276
dc.language.isoenen_US
dc.publisherScientific Reportsen_US
dc.subject.departmentDepartment of Mathematics and Statistics
dc.titleModel-based clustering for identifying disease-associated SNPs in case-control genome-wide association studiesen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
xu_yan_sci.rep_2019.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: