wav2svp is a project that converts a waveform to a Synthesizer V Project (SVP) file. It is based on the SOME and RMVPE. In addition to automatically extracting MIDI, this project can also extract pitch data, tension data(Experimental) and breathiness data(Experimental) simultaneously. But unfortunately, at present, it's unable to simultaneously extract lyrics.
You can download the One click startup package from releases, unzip and double click go-webui.bat
to start the WebUI.
-
Clone this repository and install the dependencies. We recommand to use python 3.10.
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118 pip install -r requirements.txt
-
Download pre-trained models:
- 0119_continuous128_5spk and unzip it to
weights
. - rmvpe and unzip it to
weights
and rename it tormvpe.pt
. - Order the
weights
folder as follows:
weights ├-config.yaml ├-model_steps_64000_simplified.ckpt └-rmvpe.pt
- 0119_continuous128_5spk and unzip it to
-
Run the following command to start WebUI:
python webui.py
-
You can download the inference results from WebUI interface or from the
results
folder.
Use infer.py
:
usage: infer.py [-h] [--model_path MODEL_PATH] [--tempo TEMPO] [--extract_pitch] [--extract_tension] [--extract_breathiness] audio_path
Inference for wav2svp
positional arguments:
audio_path Path to the input audio file
options:
-h, --help show this help message and exit
--model_path MODEL_PATH
Path to the model file, default: weights/model_steps_64000_simplified.ckpt
--tempo TEMPO Tempo value for the midi file, default: 120
--extract_pitch Whether to extract pitch from the audio file, default: False
--extract_tension Whether to extract tension from the audio file, default: False
--extract_breathiness
Whether to extract breathiness from the audio file, default: False
You can find the results in the results
folder.
- [openvpi/SOME] openvpi/SOME
- [Dream-High/RMVPE] Dream-High/RMVPE
- [yxlllc/RMVPE] yxlllc/RMVPE