I used the following data sources:
- The Lakh MIDI Dataset v0.1
- GTZAN dataset
- giantsteps-mtg-key-dataset
- giantsteps-key-dataset
- directional_cnns
I implemented a CNN very close to the architecture mentioned in this paper:
I also added spectral-focused layers at beginning of network as used in this paper: