Music Genre Conversion
Authors: Tung Nguyen
Advisor: Edward M. Reingold
Details
Fast Fourier Transform has been the backbone of signal processing. Under professor Edward M. Reingold, I conducted an independent research study on Fast Fourier Transform and its application on audio processing.
The goal of the project is to create a music genre-conversion model. Given an audio file, the model would algorithmatically modify the song to a different genre. The problem statement would then be:
Create a Folk version of Nirvana's Smells Like Teen Spirit using Deep Neural Network
Approach
- Created an adversarial network including one conversion model and an adversarial music genre classifier.
- Using Short Time Fourier Transform, vectorized audio files in GTZAN Dataset.
- Using vectorize audio as input, created a music genre classification model.
- Trained a music generation model for each genre using GTZAN as our training data. These models are called
genre blueprints
.
Results
Folk -> Rock conversion on Rock blueprints achieved 60% accuracy on genre classifier.