A study of semantics across different representations of language

Dharmaretnam, Dhanush

A study of semantics across different representations of language

dc.contributor.author	Dharmaretnam, Dhanush
dc.contributor.supervisor	Fyshe, Alona
dc.date.accessioned	2018-05-28T20:59:24Z
dc.date.available	2018-05-28T20:59:24Z
dc.date.copyright	2018	en_US
dc.date.issued	2018-05-28
dc.degree.department	Department of Computer Science	en_US
dc.degree.level	Master of Science M.Sc.	en_US
dc.description.abstract	Semantics is the study of meaning and here we explore it through three major representations: brain, image and text. Researchers in the past have performed various studies to understand the similarities between semantic features across all the three representations. Distributional Semantic (DS) models or word vectors that are trained on text corpora have been widely used to study the convergence of semantic information in the human brain. Moreover, they have been incorporated into various NLP applications such as document categorization, speech to text and machine translation. Due to their widespread adoption by researchers and industry alike, it becomes imperative to test and evaluate the performance of di erent word vectors models. In this thesis, we publish the second iteration of BrainBench: a system designed to evaluate and benchmark word vectors using brain data by incorporating two new Italian brain datasets collected using fMRI and EEG technology. In the second half of the thesis, we explore semantics in Convolutional Neural Network (CNN). CNN is a computational model that is the state of the art technology for object recognition from images. However, these networks are currently considered a black-box and there is an apparent lack of understanding on why various CNN architectures perform better than the other. In this thesis, we also propose a novel method to understand CNNs by studying the semantic representation through its hierarchical layers. The convergence of semantic information in these networks is studied with the help of DS models following similar methodologies used to study semantics in the human brain. Our results provide substantial evidence that Convolutional Neural Networks do learn semantics from the images, and the features learned by the CNNs correlate to the semantics of the object in the image. Our methodology and results could potentially pave the way for improved design and debugging of CNNs.	en_US
dc.description.scholarlevel	Graduate	en_US
dc.identifier.uri	http://hdl.handle.net/1828/9399
dc.language	English	eng
dc.language.iso	en	en_US
dc.rights	Available to the World Wide Web	en_US
dc.subject	Computational linguistics	en_US
dc.subject	Semantics	en_US
dc.subject	Semantics in Brain	en_US
dc.subject	Convolutional Neural Networks	en_US
dc.subject	Deep learning	en_US
dc.title	A study of semantics across different representations of language	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Dharmaretnam_Dhanush_Msc_2018.pdf
Size:: 28.09 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electronic Theses and Dissertations (ETD)
Theses (Computer Science)