Spam Detection using N-gram Analysis and Machine Learning Techniques

dc.contributor.authorKaur, Simran
dc.contributor.supervisorTraore, Issa
dc.date.accessioned2019-12-17T23:31:20Z
dc.date.available2019-12-17T23:31:20Z
dc.date.copyright2019en_US
dc.date.issued2019-12-17
dc.degree.departmentDepartment of Electrical and Computer Engineering
dc.degree.levelMaster of Engineering M.Eng.en_US
dc.description.abstractThere are many types of fraudulent activities happening today that open the loopholes of security, but email is a cheaper and widely known method for delivering false messages to potential victims. Spam is a form of email messages that is not only annoying for users but can provide a conduit for fraudulent or deceptive content delivery. In this project, a spam detector to identify an email as either spam or ham is built using n-gram analysis and supervised machine learning models. Three different algorithms are implemented and compared, namely naïve-Bayes, logistic regression and support vector machines (SVM). Experimental evaluation of the detector using a public dataset shows that the SVM and logistic regression attain the highest accuracy.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/11375
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectN-gramsen_US
dc.subjectSpam detectionen_US
dc.titleSpam Detection using N-gram Analysis and Machine Learning Techniquesen_US
dc.typeprojecten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kaur_Simran_MEng_2019.pdf
Size:
885.31 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: