Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays

This repository contains the code for reproducing the results shown in the paper "Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays".

WASPAA Paper

Demo

Note

The code in the spear-tools submodule is subject to its own licenses. If not licensed, all rights remain with the original author (Imperial College London).

The FullSubNet sub-repository also is third-party code and has separate licensing.

Installation

First cd into ./spear-tools and set up the SPEAR paths and symbolic links and the conda environment according to spear-tools/README.md.

Instructions

To run the models, you will need the packages defined in requirements.txt. Install them into your environment <your-env-name> using pip install -r requirements.txt.

For inference with the MaxDirAndFullsubnet method, you need to download the FullSubNet checkpoint model weights here into ./FullSubNet.

Model checkpoints are available here

Please adjust paths and other variables in the scripts train.py, validate.py, process, validate_baseline_unprocessed.py, view_metrics.py and in the config files as you need.

Author

This repository is authored by Benjamin Stahl and was created at the Intitute of Elctronic Music and Acoustics in Graz, Austria in 2022/23.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
FullSubNet		FullSubNet
assets		assets
configs		configs
models		models
spear-tools @ f037e2f		spear-tools @ f037e2f
third_party		third_party
trained_model_checkpoints		trained_model_checkpoints
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
convert_to_realtime.py		convert_to_realtime.py
dataset.py		dataset.py
estimate_feat_sigma.py		estimate_feat_sigma.py
gaussian_sigma_fs16000_blk508.npy		gaussian_sigma_fs16000_blk508.npy
process.py		process.py
requirements.txt		requirements.txt
train.py		train.py
validate.py		validate.py
validate_baseline_unprocessed.ipy		validate_baseline_unprocessed.ipy
view_metrics.py		view_metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays

Note

Installation

Instructions

Author

About

Releases

Packages

Languages

License

BenjSta/spear-tools

Folders and files

Latest commit

History

Repository files navigation

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network For Direction-Based Speech Enhancement With Head-Mounted Microphone Arrays

Note

Installation

Instructions

Author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages