GitHub - AlexanderGlogger/voice2json: Command-line tools for speech and intent recognition on Linux

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux. It is free, open source (MIT), and supports 17 human languages.

From the command-line:

$ voice2json transcribe-wav \
      < turn-on-the-light.wav | \
      voice2json recognize-intent | \
      jq .

produces a JSON event like:

{
    "text": "turn on the light",
    "intent": {
        "name": "LightState"
    },
    "slots": {
        "state": "on"
    }
}

when trained with this template:

[LightState]
states = (on | off)
turn (<states>){state} [the] light

voice2json is optimized for:

It can be used to:

Supported speech to text systems include:

Unique Features

voice2json is more than just a wrapper around open source speech to text systems!

Training produces both a speech and intent recognizer. By describing your voice commands with voice2json's templating language, you get more than just transcriptions for free.
Re-training is fast enough to be done at runtime (usually < 5s), even up to millions of possible voice commands. This means you can change referenced slot values or add/remove intents on the fly.
All of the available commands are designed to work well in Unix pipelines, typically consuming/emitting plaintext or newline-delimited JSON. Audio input/output is file-based, so you can receive audio from any source.

Name		Name	Last commit message	Last commit date
Latest commit History 349 Commits
bin		bin
debian		debian
docker		docker
docs		docs
etc		etc
m4		m4
recipes		recipes
scripts		scripts
tests		tests
voice2json		voice2json
.dockerignore		.dockerignore
.gitignore		.gitignore
.projectile		.projectile
.python-version		.python-version
AUTHORS		AUTHORS
CHANGELOG		CHANGELOG
Dockerfile		Dockerfile
Dockerfile.debian		Dockerfile.debian
Dockerfile.test.debian		Dockerfile.test.debian
LICENSE		LICENSE
Makefile.in		Makefile.in
PKG-INFO		PKG-INFO
README.md		README.md
TODO.md		TODO.md
VERSION		VERSION
__main__.py		__main__.py
aclocal.m4		aclocal.m4
architecture.sh		architecture.sh
bootstrap.sh		bootstrap.sh
config.guess		config.guess
config.sub		config.sub
configure		configure
configure.ac		configure.ac
install-sh		install-sh
missing		missing
mkdocs.yml		mkdocs.yml
mypy.ini		mypy.ini
pylintrc		pylintrc
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt
setup.cfg		setup.cfg
setup.py.in		setup.py.in
voice2json.sh.in		voice2json.sh.in
voice2json.spec.in		voice2json.spec.in