Iimi: A novel automated workflow for plant virus diagnostics from high-throughput sequencing data

dc.contributor.authorNing, Haochen
dc.contributor.supervisorZhang, Xuekui
dc.date.accessioned2023-08-31T22:38:02Z
dc.date.copyright2023en_US
dc.date.issued2023-08-31
dc.degree.departmentDepartment of Mathematics and Statisticsen_US
dc.degree.levelMaster of Science M.Sc.en_US
dc.description.abstractSeveral workflows have been developed for the diagnostic testing of plant viruses using high-throughput sequencing methods. Most of these workflows require considerable expertise and input from the analyst to perform and interpret the data when deciding on a plant’s disease status. The most common detection methods use workflows based on de novo assembly and/or read mapping. Existing virus detection software mainly uses simple deterministic rules for decision-making, requiring a certain level of understanding of virology when interpreting the results. This can result in inconsistencies in data interpretation between analysts which can have serious ramifications. To combat these challenges, we developed an automated workflow using machine-learning methods, decreasing human interaction while increasing recall, precision, and consistency. Our workflow involves sequence data mapping, feature extraction, and machine learning model training. Using real data, we compared the performance of our method with other popular approaches and show our approach increases recall and precision while decreasing the detection time for most types of sequencing data.en_US
dc.description.embargo2025-08-18
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/15329
dc.languageEnglisheng
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectmachine learningen_US
dc.subjectvirus diagnosticsen_US
dc.titleIimi: A novel automated workflow for plant virus diagnostics from high-throughput sequencing dataen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Ning_Haochen_MSc_2023.pdf
Size:
2.92 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: