We would need Pyannote for this, [here](https://huggingface.co/spaces/Xenova/whisper-speaker-diarization) is an example.
We would need Pyannote for this, here is an example.