Learning audio features for genre classification with deep belief networks

dc.contributor.authorNoolu, Satya
dc.contributor.supervisorTzanetakis, George
dc.date.accessioned2018-12-04T19:52:35Z
dc.date.available2018-12-04T19:52:35Z
dc.date.copyright2018en_US
dc.date.issued2018-12-04
dc.degree.departmentDepartment of Computer Scienceen_US
dc.degree.levelMaster of Science M.Sc.en_US
dc.description.abstractFeature extraction is a crucial part of many music information retrieval(MIR) tasks. In this project, I present an end to end deep neural network that extract the features for a given audio sample, performs genre classification. The feature extraction is based on Discrete Fourier Transforms (DFTs) of the audio. The extracted features are used to train over a Deep Belief Network (DBN). A DBN is built out of multiple layers of Randomized Boltzmann Machines (RBMs), which makes the system a fully connected neural network. The same network is used for testing with a softmax layer at the end which serves as the classifier. This entire task of genre classification has been done with the Tzanetakis dataset and yielded a test accuracy of 74.6%.en_US
dc.description.scholarlevelGraduateen_US
dc.identifier.urihttp://hdl.handle.net/1828/10380
dc.language.isoenen_US
dc.rightsAvailable to the World Wide Weben_US
dc.subjectMachine Learningen_US
dc.subjectNeural Networksen_US
dc.subjectDeep Belief Networksen_US
dc.subjectMusic Information Retrievalen_US
dc.subjectGoogle cloud platformen_US
dc.titleLearning audio features for genre classification with deep belief networksen_US
dc.typeProjecten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Noolu_Satya_MSc_2018.pdf
Size:
1.19 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: