Scriber - Windows Voice Dictation

Scriber is a high-performance, AI-driven voice dictation application for Windows. It replicates the functionality of tools like Aqua Voice and Wispr Flow, allowing you to dictate text into any application system-wide.

Quick Start (Windows)

Download the repository.
Double-click start.bat.
- It will automatically set up the environment, install dependencies, and prompt you for API keys.
Dictate: Press Ctrl+Alt+S to start/stop listening.

Features

System-Wide Dictation: Works in any application (Word, IDEs, Browser, etc.).
Global Hotkey: Activate voice capture with Ctrl+Alt+S (configurable).
Multi-Engine Support:
- Soniox: Ultra-low latency streaming with custom vocabulary.
- AssemblyAI: High accuracy with punctuation.
- Deepgram: Fast and cost-effective streaming.
- OpenAI (Whisper): High accuracy via Whisper API.
- Azure Speech: Microsoft's enterprise STT.
- Gladia: Audio intelligence API.
- Groq: Fast inference for Whisper models.
- Speechmatics: Specialized ASR.
- Google Cloud STT: Enterprise-grade recognition.
- ElevenLabs: Scribe model integration.
Smart Turn Detection: Optional pause detection when supported by the environment.
One-Click Setup: Automated start.bat script for easy installation.

Manual Installation

If you prefer to run it manually or are on Linux/Mac:

Clone the repository.
Install dependencies:
```
pip install -r requirements.txt
```
Create a .env file with your API keys.
Run:
```
python src/main.py
```

Configuration

The start.bat script will create a .env file for you. You can also edit it manually:

# STT Service API Keys
SONIOX_API_KEY=your_key
ASSEMBLYAI_API_KEY=your_key
DEEPGRAM_API_KEY=your_key
OPENAI_API_KEY=your_key
AZURE_SPEECH_KEY=your_key
AZURE_SPEECH_REGION=westus
GLADIA_API_KEY=your_key
GROQ_API_KEY=your_key
SPEECHMATICS_API_KEY=your_key
ELEVENLABS_API_KEY=your_key
GOOGLE_APPLICATION_CREDENTIALS=path/to/json

# App Settings
SCRIBER_HOTKEY=ctrl+alt+s
SCRIBER_DEFAULT_STT=soniox  # Options: soniox, assemblyai, deepgram, openai, azure, gladia, groq, speechmatics, google, elevenlabs
SCRIBER_MODE=toggle         # toggle or push_to_talk
SCRIBER_CUSTOM_VOCAB=Scriber, Pipecat, Soniox

Requirements

Windows 10/11 (Recommended)
Python 3.10+
Microphone

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
src		src
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
check_imports.py		check_imports.py
requirements.txt		requirements.txt
start.bat		start.bat
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scriber - Windows Voice Dictation

Quick Start (Windows)

Features

Manual Installation

Configuration

Requirements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

MyButtermilk/Scriber

Folders and files

Latest commit

History

Repository files navigation

Scriber - Windows Voice Dictation

Quick Start (Windows)

Features

Manual Installation

Configuration

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages