Learning audio features for genre classification with deep belief networks

Noolu, Satya

Learning audio features for genre classification with deep belief networks

Files

Noolu_Satya_MSc_2018.pdf (1.19 MB)

Date

2018-12-04

Authors

Noolu, Satya

Abstract

Feature extraction is a crucial part of many music information retrieval(MIR) tasks. In this project, I present an end to end deep neural network that extract the features for a given audio sample, performs genre classification. The feature extraction is based on Discrete Fourier Transforms (DFTs) of the audio. The extracted features are used to train over a Deep Belief Network (DBN). A DBN is built out of multiple layers of Randomized Boltzmann Machines (RBMs), which makes the system a fully connected neural network. The same network is used for testing with a softmax layer at the end which serves as the classifier. This entire task of genre classification has been done with the Tzanetakis dataset and yielded a test accuracy of 74.6%.

Keywords

Machine Learning, Neural Networks, Deep Belief Networks, Music Information Retrieval, Google cloud platform

URI

http://hdl.handle.net/1828/10380

Collections

Graduate Projects (Computer Science)

Full item page

Learning audio features for genre classification with deep belief networks

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections