GitHub - kadirnar/VoiceHub: VoiceHub: A Unified Inference Interface for TTS Models

VoiceHub: A Unified Inference Interface for TTS Models

🛠️ Installation

uv venv --python 3.12
source .venv/bin/activate
uv pip install voicehub

📚 Usage

VoiceHub provides a simple, unified interface for working with various Text-to-Speech (TTS) models. Below are examples showing how to use different supported TTS models with the same consistent approach.

OrpheusTTS Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="orpheustts",  # or "dia" or "vui"
    model_path="canopylabs/orpheus-3b-0.1-ft",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    voice="tara",
    output_file="output.wav",
)

DiaTTS Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="dia",  # or "dia" or "vui"
    model_path="dia/dia-100m-base.pt",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    output_file="output.wav",
)

VuiTTS Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="vui",  # or "dia" or "vui"
    model_path="fluxions/vui",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    output_file="output.wav",
)

Llasa Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="llasa",  # or "dia" or "vui"
    model_path="HKUSTAudio/Llasa-1B-Multilingual",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    output_file="output.wav",
)

Llasa Voice Clone Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="llasa_voice_clone",  # or "dia" or "vui"
    model_path="HKUSTAudio/Llasa-1B-Multilingual",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    output_file="output.wav",
)

Chatterbox Model

from voicehub.automodel import AutoInferenceModel

model = AutoInferenceModel.from_pretrained(
    model_type="chatterbox",  # or "dia" or "vui"
    model_path="ResembleAI/chatterbox",
    device="cuda",
)

output = model(
    text="Hey, here is some random stuff, the text the less likely the model can cope!",
    output_file="output.wav",
)

🤗 Contributing

uv pip install pre-commit
pre-commit install
pre-commit run --all-files

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github		.github
assets		assets
scripts		scripts
voicehub		voicehub
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

VoiceHub: A Unified Inference Interface for TTS Models

🛠️ Installation

📚 Usage

OrpheusTTS Model

DiaTTS Model

VuiTTS Model

Llasa Model

Llasa Voice Clone Model

Chatterbox Model

🤗 Contributing

📝 Acknowledgments

About

Uh oh!

Releases 1

Sponsor this project

Uh oh!

Contributors 2

Uh oh!

Languages

Uh oh!

License

kadirnar/VoiceHub

Folders and files

Latest commit

History

Repository files navigation

VoiceHub: A Unified Inference Interface for TTS Models

🛠️ Installation

📚 Usage

OrpheusTTS Model

DiaTTS Model

VuiTTS Model

Llasa Model

Llasa Voice Clone Model

Chatterbox Model

🤗 Contributing

📝 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Sponsor this project

Uh oh!

Contributors 2

Uh oh!

Languages