Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wake Word Config #238

Closed
Felitendo opened this issue Oct 2, 2024 · 4 comments
Closed

Wake Word Config #238

Felitendo opened this issue Oct 2, 2024 · 4 comments
Labels
enhancement New feature or request tts&stt Speech-to-text, text-to-speech and wakeword requests or bugs, including Vosk

Comments

@Felitendo
Copy link

Let us change the Wake Word to a custom one like "Scotty" for example.

@Stypox
Copy link
Owner

Stypox commented Oct 4, 2024

This is not really doable, unless there is a backend server that trains a new wake-word model every time a user picks a new wake word. Therefore this is out of scope for Dicio, though you can train a wake word on your computer and then use it in Dicio using this tutorial from openWakeWord: https://github.com/dscripka/openWakeWord/blob/main/notebooks/automatic_model_training.ipynb . This is what I used to train thw "Hey Dicio" wake word.

@Stypox Stypox added enhancement New feature or request tts&stt Speech-to-text, text-to-speech and wakeword requests or bugs, including Vosk labels Oct 4, 2024
@Stypox Stypox closed this as not planned Won't fix, can't repro, duplicate, stale Oct 4, 2024
@StefanoWoerner
Copy link

@Stypox where did you get the data to train hey_dicio_v6.0.tflite and how much data did you use?

@Stypox
Copy link
Owner

Stypox commented Nov 30, 2024

The data is autogenerated, see the notebook. In particular, I used these parameters:

augmentation_batch_size: 16
augmentation_rounds: 1
background_paths:
- ./audioset_16k
- ./fma
background_paths_duplication_rate:
- 1
batch_n_per_class:
  ACAV100M_sample: 1024
  adversarial_negative: 50
  positive: 50
custom_negative_phrases: []
false_positive_validation_data_path: validation_set_features.npy
feature_data_files:
  ACAV100M_sample: openwakeword_features_ACAV100M_2000_hrs_16bit.npy
layer_size: 32
max_negative_weight: 2700
model_name: hey_dicio
model_type: dnn
n_samples: 50000
n_samples_val: 2000
output_dir: ./my_custom_model
piper_sample_generator_path: ./piper-sample-generator
rir_paths:
- ./mit_rirs
steps: 120000
target_accuracy: 0.7
target_false_positives_per_hour: 0.2
target_phrase:
- hey dicho
- hey dicio
- hey dichio
- hey disio
- hey dizio
target_recall: 0.7
tts_batch_size: 50

@LoneDev6
Copy link

Related to #291

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request tts&stt Speech-to-text, text-to-speech and wakeword requests or bugs, including Vosk
Projects
None yet
Development

No branches or pull requests

4 participants