Piano-GPT: MIDI Piano Music Generation

Overview

Piano-GPT is a project leveraging the GPT-2 architecture for generating and processing MIDI piano music. It introduces the PIANO (Performance Inference And Note Orchestration) dataset, a multi-task benchmark for voice and dynamic reconstruction in MIDI piano rolls.

Tokenization

Tokenization is described to full extent in midi-tokenizers repo

PIANO Dataset

The PIANO dataset is designed to standardize approaches and provide a benchmark for the music modeling community. It focuses on specific subsets of music generation tasks:

Voice Inference: Inferring missing notes within specific voice parts of a musical composition, based on the surrounding musical context of incomplete sequences.
Dynamic Reconstruction: Recovering notes from different volume ranges (velocity in MIDI terminology) to challenge models in reconstructing the dynamics of a piece.
Noise Reduction: Reconstructing original note information from noisy inputs, including pitch, velocity, and timing. This task simulates scenarios where MIDI data might be imperfectly recorded or played.

Task Categories

Pitch-based Tasks: Divide notes into groups based on relative pitch height (e.g., above/below median pitch, highest/lowest quartiles).
Volume-based Tasks: Categorize notes based on loudness (velocity), such as loud/soft or very loud/very soft.
Denoising Tasks: Add controlled random variations to pitch, volume, or timing of notes.
Comprehensive Denoising: Combine variations in pitch, volume, and timing simultaneously.
Performance Task: Simplify a piece by standardizing note length, volume, and timing.

Project Structure

artifacts.py: Utility functions and constants
checkpoints/: Saved model checkpoints
dashboards/: Streamlit dashboards for data visualization
data/: Dataset handling and preprocessing modules
database/: Database connection and management utilities
gpt2/: Core GPT-2 model implementation and training scripts
midi_datasets/: Custom dataset classes for MIDI data
scripts/: Utility scripts for model management and evaluation

Installation

Clone the repository:

git clone https://github.com/your-username/piano-gpt.git
cd piano-gpt

Install dependencies:
```
pip install -r requirements.txt
```

Usage

Training

python -m gpt2.train.py

You can run the script in DDP mode and with custom configuration. You can change the configuration in gpt2/configs/*.yaml, or specify the training hyperparameters from command line, for example

PYTHONPATH=. torchrun --nproc-per-node=8 \
gpt2/train.py --config-name=gpt2_pretraining \
data.batch_size=32 \
optimizer.gradient_accumulation_steps=8 \
optimizer.max_iters=30000 \
data.sequence_length=4096 \
dataset.extra_datasets="['roszcz/maestro-sustain-v2', 'roszcz/giant-midi-sustain-v2', 'roszcz/pianofor-ai-sustain-v2']" \
dataset.augmentation.max_pitch_shift=5 \
"dataset.augmentation.speed_change_factors=[0.975, 0.95, 1.025, 1.05]" \
lr.warmup_iters=1000 \
lr.learning_rate=1e-5 \
lr.min_lr=1e-6 \
model=gpt2_large \
system.data_workers=64 \
system.compile=true \
loss_masking=pretrianing \
init_from=scratch

or, for downstream tasks:

PYTHONPATH=. torchrun --nproc-per-node=4 \
gpt2/train.py --config-name=gpt2_piano \
tasks = subsequence \
data.batch_size=64 \
optimizer.gradient_accumulation_steps=4 \
optimizer.max_iters=30000 \
data.sequence_length=1024 \
data.notes_per_record=128 \
dataset.extra_datasets="['roszcz/maestro-sustain-v2', 'roszcz/giant-midi-sustain-v2', 'roszcz/pianofor-ai-sustain-v2']" \
dataset.augmentation.max_pitch_shift=5 \
dataset.augmentation.speed_change_factors="[0.95, 1.05]" \
lr.learning_rate=8e-5 \
system.data_workers=128 \
system.compile=true \
loss_masking=finetuning \
init_from=midi-gpt2-my-awesome-model.pt  # has to be located in checkpoints and the name needs to start with midi-gpt2

Awesome Tokenizer training

python3.10 -m gpt2.prepare_tokenizer_dataset; \
python3.10 -m gpt2.train_tokenizer; \
PYTHONPATH=. torchrun --nproc-per-node=4 \
gpt2/train.py --config-name=gpt2_pretraining \
model=gpt2 \
lr.learning_rate=8e-5 \
lr.min_lr=8e-6 \
lr.warmup_iters=1000 \
system.data_workers=124 \
optimizer.gradient_accumulation_steps=4 \
task=next_token_prediction_with_composer \
eval_iters=200 eval_interval=1000 \
"dataset.extra_datasets=['roszcz/maestro-sustain-v2', 'roszcz/giant-midi-sustain-v2', 'roszcz/pianofor-ai-sustain-v2']" \
data.batch_size=20 \
data.sequence_length=4096 \
logging.wandb_run_name_suffix=huge-pretraining-4096-ctx \
tokenizer=awesome \
logging.wandb_project=piano-awesome-gpt

prepare_tokenizer_dataset will create a text file in tmp/tokenizer_datasets, with a dump of tokenized and augmented MAESTRO dataset.

The text will be in a format in which tokenizer will be able to train on. train_tokenizer script will then train an AwesomeMidiTokenizer on this data and dump json format of the tokenizer to tmp/tokenizers

Both of these scripts use gpt2/configs/tokenizer_training as a default hydra config. It is equivalent to dataset + tokenizer training config.

During model training initialization the program will look for a tokenizer saved with the same dataset and tokenizer configuration as training config.

Evaluation

python -m gpt2.eval.py init_from=path_to_checkpoint.pt

Generation

To generate with your model refer to: https://github.com/Nospoko/piano-generation a repository fully commited to generation methods.

Data Visualization

Browse through piano dataset by running:

PYTHONPATH=. streamlit run dashboards/piano_dataset_review.py

Model Management

Download models: python scripts/download_model.py <model_filename>
Upload models: python scripts/upload_models.py
Run multi-task evaluation: python scripts/run_evaluation.py <model_paths> <device> [--tasks task1 task2 ...]

Configuration

The project uses Hydra for configuration management. Main configuration files are located in gpt2/configs/.

Acknowledgments

This project uses the GPT-2 architecture developed by OpenAI.
The PIANO dataset is based on the MAESTRO (MIDI and Audio Edited for Synchronous TRacks and Organization) dataset, provided by the International Piano-e-Competition and the Tensorflow Magenta team.

Important Links

Maestro Dataset: Link to dataset
GitHub Repository: piano-gpt
Midi Tokenizers Repository: midi-tokenizers
Platform for pianists and algorithmic music enthusiasts: pianoroll.io

References

Oore, S., et al. (2018). This Time with Feeling: Learning Expressive Musical Performance. Neural Information Processing Systems (NeurIPS).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
dashboards		dashboards
data		data
database		database
gpt2		gpt2
midi_datasets		midi_datasets
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
artifacts.py		artifacts.py
readme.md		readme.md
requirements.txt		requirements.txt
train_command.example		train_command.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Piano-GPT: MIDI Piano Music Generation

Overview

Tokenization

PIANO Dataset

Task Categories

Project Structure

Installation

Usage

Training

Awesome Tokenizer training

Evaluation

Generation

Data Visualization

Model Management

Configuration

Acknowledgments

Important Links

References

About

Releases

Packages

Contributors 2

Languages

Nospoko/piano-gpt

Folders and files

Latest commit

History

Repository files navigation

Piano-GPT: MIDI Piano Music Generation

Overview

Tokenization

PIANO Dataset

Task Categories

Project Structure

Installation

Usage

Training

Awesome Tokenizer training

Evaluation

Generation

Data Visualization

Model Management

Configuration

Acknowledgments

Important Links

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages