Home

Welcome to the minutes wiki! 🎤

minutes is a standalone API for audio speaker diaratization.

Speaker diarisation (or diarization) is a process for partitioning an input audio stream into homogeneous segments according to the speaker identity - Wikipedia.

👉 Input

As input, minutes accepts an audio recording of a conversation including n speakers, and n labeled audio samples (one for each speaker).

👈 Output

As output, minutes produces a list of phrases from the conversation - each phrase should have the following keys: speaker, start_time, end_time and body.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Welcome to the minutes wiki! 🎤

👉 Input

👈 Output

Clone this wiki locally