Battling the Internet water army: detection of hidden paid posters.
| dc.contributor.author | Chen, Cheng | |
| dc.contributor.supervisor | Wu, Kui | |
| dc.contributor.supervisor | Srinivasan, Venkatesh | |
| dc.date.accessioned | 2012-07-04T15:29:59Z | |
| dc.date.available | 2012-07-04T15:29:59Z | |
| dc.date.copyright | 2012 | en_US |
| dc.date.issued | 2012-07-04 | |
| dc.degree.department | Department of Computer Science | |
| dc.degree.level | Master of Science M.Sc. | en_US |
| dc.description.abstract | Online social media, such as news websites and community question answering (CQA) portals, have made useful information accessible to more people. However, many of online comment areas and communities are flooded with fraudulent information. These messages come from a special group of online users, called online paid posters, or termed "Internet water army" in China, represents a new type of online job opportunities. Online paid posters get paid for posting comments or articles on different online communities and websites for hidden purpose, e.g., to influence the opinion of other people towards certain social events or business markets. Though an interesting strategy in business marketing, paid posters may create a significant negative effect on the online communities, since the information from paid posters is usually not trustworthy. We thoroughly investigate the behavioral pattern of online paid posters based on a real-world trace data from the social comments of a business conflict. We design and validate a new detection mechanism, including both non-semantic analysis and semantic analysis, to identify potential online paid posters. Using supervised and unsupervised approaches, our test results with real-world datasets show a very promising performance. | en_US |
| dc.description.scholarlevel | Graduate | en_US |
| dc.identifier.uri | http://hdl.handle.net/1828/4044 | |
| dc.language | English | eng |
| dc.language.iso | en | en_US |
| dc.rights.temp | Available to the World Wide Web | en_US |
| dc.subject | online paid posters | en_US |
| dc.subject | machine learning | en_US |
| dc.subject | spam detection | en_US |
| dc.subject | behavioral pattern | en_US |
| dc.title | Battling the Internet water army: detection of hidden paid posters. | en_US |
| dc.type | Thesis | en_US |