A fully differentiable model for unsupervised singing voice separation

This is the source code for the experiments related to our work on a differentiable model for unsupervised singing voice separation.

We proposed to extend the work of Schultze-Foster et al., and to build a complete, fully differentiable model by integrating a multipitch estimator and a novel differentiable voice assignment module within the core model.

Note 1: This project builds upon the model of Schultze-Foster et al. and parts of the code are taken/adapted from their repository.

Note 2: The trained models of Cuesta et al. (multiple-f0 estimation) and, Cuesta and Gómez (voice assignment) have been used in our experiments.

Links

🔊 Audio examples

📄 Schultze-Forster et al. paper

📄 Multiple-f0 estimation paper | Multiple-f0 Assignment paper

📁 CSD Database | Cantoría Database

Requirements

The following packages are required:

pytorch=1.6.0
matplotlib=3.3.1
python-sounddevice=0.4.0
scipy=1.5.2
torchaudio=0.6.0
tqdm=4.49.0
pysoundfile=0.10.3
librosa=0.8.0
scikit-learn=0.23.2
tensorboard=2.3.0
resampy=0.2.2
pandas=1.2.3

These packages can be found using the conda-forge and pytorch channels. Python 3.7 or 3.8 is recommended. From a new conda environment:

conda update conda
conda config --add channels conda-forge
conda config --set channel_priority strict
conda config --add channels pytorch
conda install pytorch=1.6.0
conda install numpy=1.23.5 matplotlib=3.3.1 python-sounddevice=0.4.0 scipy=1.5.2 torchaudio=0.6.0 tqdm=4.49.0 pysoundfile=0.10.3 librosa=0.8.0 scikit-learn=0.23.2 tensorboard=2.3.0 resampy=0.2.2 pandas=1.2.3 configargparse=0.13.0
pip install pumpp==0.6.0 nnAudio=0.3.2

or you can use the provided environment.yml file:

conda env create -f environment.yml

Training

python train.py -c config.txt

python train_u_nets.py -c unet_config.txt

Evaluation

python eval.py --tag 'TAG' --f0-from-mix --test-set 'CSD'

Note: 'TAG' is the evaluated model's name. (Example: UMSS_4s_bcbq)

Trained models

The trained models used in our experiments are available here.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Datasets		Datasets
ddsp		ddsp
docs		docs
evaluation		evaluation
pre-trained_models		pre-trained_models
robustness_tests/groundTruthF0		robustness_tests/groundTruthF0
tensorboard		tensorboard
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.txt		config.txt
data.py		data.py
environment.yml		environment.yml
eval.py		eval.py
eval_robustness_tests.py		eval_robustness_tests.py
evaluation_metrics.py		evaluation_metrics.py
model_utls.py		model_utls.py
models.py		models.py
network_components.py		network_components.py
preprocessing_multif0_cuesta_BCBQ.py		preprocessing_multif0_cuesta_BCBQ.py
preprocessing_multif0_cuesta_CSD.py		preprocessing_multif0_cuesta_CSD.py
preprocessing_multif0_cuesta_Cantoria.py		preprocessing_multif0_cuesta_Cantoria.py
run_multiple_evals.py		run_multiple_evals.py
train.py		train.py
train_u_nets.py		train_u_nets.py
unet_config.txt		unet_config.txt
utils.py		utils.py
utils_eval.py		utils_eval.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A fully differentiable model for unsupervised singing voice separation

Links

Requirements

Training

Evaluation

Trained models

About

Releases

Packages

Languages

License

PierreChouteau/umss_icassp

Folders and files

Latest commit

History

Repository files navigation

A fully differentiable model for unsupervised singing voice separation

Links

Requirements

Training

Evaluation

Trained models

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages