【Language Detection】 Wrong language detection and Decreasing detection quality when specifying language #2522
Replies: 1 comment
-
|
This behavior can happen with Whisper, especially when the audio comes from movies, DVDs, anime, or heavily edited media rather than clean speech recordings. Why Whisper may detect Japanese audio as English or ChineseWhisper's language detection is performed using only a relatively short portion of audio (typically the beginning of the recording). If the start of the clip contains:
the language detector may make the wrong choice. For example: can sometimes be misclassified as: especially if the speech content is short. Why transcription sometimes becomes worse when using
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Background: I bought a DVD and got its video.
Question:
1 Although I have cut the long video into small ones, whisper cannot correctly detect its language. It always detects the language as English or Chinese, but actually the video is Japanese.
2 When I use --language , the transcription quality is much worse than the one without --language
My goal:
I want to make whisper correctly detect the video with normal quality and accuracy.
Beta Was this translation helpful? Give feedback.
All reactions