Intra-topic clustering for social media

dc.contributor.authorGondhi, Uttej Reddy
dc.contributor.supervisorNeville, Stephen William
dc.date.accessioned2020-08-29T06:09:49Z
dc.date.available2020-08-29T06:09:49Z
dc.date.copyright2020en_US
dc.date.issued2020-08-28
dc.degree.departmentDepartment of Electrical and Computer Engineeringen_US
dc.degree.levelMaster of Applied Science M.A.Sc.en_US
dc.description.abstractWith the social media platforms leading the internet in terms of user base and the average time spent, significant amount of data is being generated by these platforms every day. This makes social media platforms a go-to place to understand the reviews, trends, and opinions of the people. Any regular search for a popular topic would result in an abundance of information and thus it is impossible to go through these large amounts of data manually to understand the trends. This thesis discusses techniques for the intra-topic clustering of such social media data and discusses how social media noise increases the redundancy of the search results. Our goal is to filter the amount of redundant information an end-user must review from a regular social media search. The research proposes clustering models based on two string similarity measures Jaccard word token and T-Information distance. Evaluation parameters are introduced and the models are evaluated on clustering a set of current and historical topics to determine which techniques are the most effective.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/12058
dc.languageEnglisheng
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectSocialmediaen_US
dc.subjectclusteringen_US
dc.subjectintra-topic clusteringen_US
dc.subjectTweet clusteringen_US
dc.titleIntra-topic clustering for social mediaen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gondhi_Uttej_Reddy_MASc_2020.pdf
Size:
3.69 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: