All-against-all approximate substring matching

dc.contributor.authorBarsky, Marina
dc.contributor.supervisorThomo, Alex
dc.contributor.supervisorUpton, Christopher
dc.date.accessioned2010-01-21T17:05:47Z
dc.date.available2010-01-21T17:05:47Z
dc.date.copyright2006en
dc.date.issued2010-01-21T17:05:47Z
dc.degree.departmentDepartment of Computer Science
dc.degree.levelMaster of Science M.Sc.en
dc.description.abstractFinding local regions of high similarity in a set of strings is of great importance in biological sequence analysis. This problem is far from being efficiently solved. In this thesis we study the best known solutions to this problem. We present a new and efficient algorithm to solve the "threshold all vs. all" variant of the problem. which involves searching two strings (with length N and M respectively) for all maximal approximate substring matches of length at least S, with up to K differences. The algorithm is based on a novel graph model and solves the problem in time O(NMK2). We also explore the possibility of extending our approach to the local alignment problem for multiple strings. Our developed program is a practical solution that detects similar regions in a set of strings in a feasible time, for cases of practical importance.en
dc.identifier.urihttp://hdl.handle.net/1828/2090
dc.languageEnglisheng
dc.language.isoenen
dc.rightsAvailable to the World Wide Weben
dc.subjectbiochemistryen
dc.subjectdata processingen
dc.subjectbioinformaticsen
dc.subject.lcshUVic Subject Index::Sciences and Engineering::Applied Sciences::Computer scienceen
dc.titleAll-against-all approximate substring matchingen
dc.typeThesisen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Barsky_M_MSc.pdf
Size:
10.01 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.83 KB
Format:
Item-specific license agreed upon to submission
Description: