Helicality: An Isomap-based Measure of Octave Equivalence in Audio Data

This is the repository pertaining to the above-titled Late-Breaking Demo presented at ISMIR 2020.
In this paper, we introduce a novel algorithm to measure the octave-equivalence of audio datasets. Octave equivalence serves as domain-knowledge in MIR systems, including chromagram, spiral convolutional networks, and harmonic CQT. Prior work has applied the Isomap manifold learning algorithm to unlabeled audio data to embed frequency sub-bands in 3-D space where the Euclidean distances are inversely proportional to the strength of their Pearson correlations. However, discovering octave equivalence via Isomap requires visual inspection and is not scalable. To address this problem, we define "helicality" as the goodness of fit of the 3-D Isomap embedding to a Shepherd-Risset helix. Our method is unsupervised and uses a custom Frank-Wolfe algorithm to minimize a least-squares objective inside a convex hull. Numerical experiments indicate that isolated musical notes have a higher helicality than speech, followed by drum hits.

Dependencies

mir-data
sklearn, scipy, numpy (core numerical computation)
librosa (audio feature extraction)
matplotlib, colorcet (plotting)
h5py, json (data handling)

Download and run

Dataset features are pre-computed and stored in the corresponding .h5 files in the root directory.
Execute main.py from a command line terminal with the name of the dataset you want to test.

python3 main.py -d tinysol

Plots are stored in the ./convexHull sub-directory by default.
Numerical results are stored in the <dataset>_helicality.json format in the main directory.

Datasets

TinySOL (Isolated notes played on 14 different instruments)
ENST-drums (dry_mix subset which contains isolated hits on drums)
NTVOW (North Texas Vowel Dataset, containing 12 vowel utterances from 50 speakers)

Links

Pre-print
ISMIR Presentation Video

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
FeatureExtraction		FeatureExtraction
convexHull		convexHull
test-notebooks		test-notebooks
utilities		utilities
.gitattributes		.gitattributes
.gitignore		.gitignore
ENST-drums-public.h5		ENST-drums-public.h5
ENST-drums-public_euclideanLosses.json		ENST-drums-public_euclideanLosses.json
ENST-drums-public_helicality.json		ENST-drums-public_helicality.json
ENST-drums-public_radii.json		ENST-drums-public_radii.json
LICENSE		LICENSE
NTVow.h5		NTVow.h5
NTVow_euclideanLosses.json		NTVow_euclideanLosses.json
NTVow_helicality.json		NTVow_helicality.json
NTVow_radii.json		NTVow_radii.json
README.md		README.md
SOL.h5		SOL.h5
SOL_instr.json		SOL_instr.json
TinySOL.h5		TinySOL.h5
TinySOL_barplot_v4.pdf		TinySOL_barplot_v4.pdf
TinySOL_euclideanLosses.json		TinySOL_euclideanLosses.json
TinySOL_helicality.json		TinySOL_helicality.json
TinySOL_radii.json		TinySOL_radii.json
helicality.py		helicality.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Helicality: An Isomap-based Measure of Octave Equivalence in Audio Data

Dependencies

Download and run

Datasets

Links

About

Releases

Packages

Languages

License

sripathisridhar/sridhar2020ismir

Folders and files

Latest commit

History

Repository files navigation

Helicality: An Isomap-based Measure of Octave Equivalence in Audio Data

Dependencies

Download and run

Datasets

Links

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages