Comparing feature selection algorithms using microarray data

dc.contributor.authorLaw, Timothy Tao Hin
dc.contributor.supervisorLesperance, M. L.
dc.date.accessioned2008-12-22T21:39:26Z
dc.date.available2008-12-22T21:39:26Z
dc.date.copyright2008en_US
dc.date.issued2008-12-22T21:39:26Z
dc.degree.departmentDepartment of Mathematics and Statistics
dc.degree.levelMaster of Science M.Sc.en_US
dc.description.abstractIn this thesis study, three different feature selection methods, LASSO, SLR, and SMLR, were tested and compared using microarray fold change data. Two real datasets were used to first investigate and compare the ability of the algorithms in selecting feature genes on data under two conditions. It was found that SMLR was quite sensitive to its parameter, and was more accurate in selecting differentially expressed genes when compared to SLR and LASSO. In addition, the model coefficients generated by SMLR had a close relationship with the magnitude of fold changes. Also, SMLR's ability in selecting differentially expressed genes with data that had more than two conditions was shown to be successful. The results from simulation experiments agreed with the results from the real dataset experiments. Additionally, it was found that different proportions of differentially expressed genes in the data did not affect the performance of LASSO and SLR, but the number of genes selected by SMLR increased with the proportion of regulated genes. Also, as the number of replicates used to build the model increased, the number of genes selected by SMLR increased. This applied to both correctly and incorrectly selected genes. Furthermore, it was found that SMLR performed the best in identifying future treatment samples.en_US
dc.identifier.urihttp://hdl.handle.net/1828/1308
dc.languageEnglisheng
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subject.lcshUVic Subject Index::Sciences and Engineering::Mathematics::Mathematical statisticsen_US
dc.titleComparing feature selection algorithms using microarray dataen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis_final_0336724.pdf
Size:
2.11 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.94 KB
Format:
Item-specific license agreed upon to submission
Description: