Paper Categorization Using Naive Bayes

Cui, Man

Paper Categorization Using Naive Bayes

dc.contributor.author	Cui, Man
dc.contributor.supervisor	Wadge, W. W.
dc.date.accessioned	2013-04-29T18:53:18Z
dc.date.available	2013-04-29T18:53:18Z
dc.date.copyright	2013	en_US
dc.date.issued	2013-04-29
dc.degree.department	Department of Computer Science
dc.degree.level	Master of Science M.Sc.	en_US
dc.description.abstract	Literature survey is a time-consuming process as researchers spend a lot of time in searching the papers of interest. While search engines can be useful in finding papers that contain a certain set of keywords, one still has to go through these papers in order to decide whether they are of interest. On the other hand, one can quickly decide which papers are of interest if each one of them is labelled with a category. The process of labelling each paper with a category is termed paper categorization, an instance of a more general problem called text classification. In this thesis, we presented a text classifier called Iris that makes use of the popular Naive Bayes algorithm. With Iris, we were able to (1) evaluate Naive Bayes using a number of popular datasets, (2) propose a GUI for assisting users with document categorization and searching, and (3) demonstrate how the GUI can be utilized for paper categorization and searching.	en_US
dc.description.proquestcode	0984	en_US
dc.description.scholarlevel	Graduate	en_US
dc.identifier.uri	http://hdl.handle.net/1828/4564
dc.language	English	eng
dc.language.iso	en	en_US
dc.rights.temp	Available to the World Wide Web	en_US
dc.subject	Text classification	en_US
dc.title	Paper Categorization Using Naive Bayes	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Cui_Man_MSc_2013.pdf
Size:: 1.22 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.74 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electronic Theses and Dissertations (ETD)