Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transcription stops before end of audio file #137

Open
pantau000 opened this issue Oct 25, 2023 · 11 comments
Open

transcription stops before end of audio file #137

pantau000 opened this issue Oct 25, 2023 · 11 comments
Labels
bug Something isn't working

Comments

@pantau000
Copy link

transcription of a 1h34 audio file stops after 1h13 minutes.

last message:

INFO: Finished transcription for Transcription of Caxias 09.mp3 in 32723 seconds
'HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /facebook/bart-large-mnli/resolve/main/config.json (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x0000021AFA45DFF0>, 'Connection to huggingface.co timed out. (connect timeout=10)'))' thrown while requesting HEAD https://huggingface.co/facebook/bart-large-mnli/resolve/main/config.json

@pantau000 pantau000 added the bug Something isn't working label Oct 25, 2023
@octimot
Copy link
Owner

octimot commented Nov 1, 2023

I think there must have been an error with accessing huggingface.co when the tool tried to download the bart-large-mnli model (maybe for question labeling?).

If you haven't tried this again already, I would give it another try. It should work!

Feel free to re-open this and continue the conversation if it doesn't.

Cheers

@octimot octimot closed this as completed Nov 1, 2023
@pantau000
Copy link
Author

just checked again, it continues to stop transcription before the end

@octimot
Copy link
Owner

octimot commented Nov 2, 2023

Could you check if you can access https://huggingface.co/facebook/bart-large-mnli/resolve/main/config.json directly from your browser? If so, there might be an issue with some ssl certificates on your machine. Are you using the standalone or the git version of the tool?

@octimot octimot reopened this Nov 2, 2023
@octimot octimot changed the title time limit? Issue when downloading model from huggingface during transcription Nov 2, 2023
@pantau000
Copy link
Author

I would rather suggest that this has maybe to do with the metadata in the audio file?

Anyway, I checked, and the page opens without problem:

{
"_num_labels": 3,
"activation_dropout": 0.0,
"activation_function": "gelu",
"add_final_layer_norm": false,
"architectures": [
"BartForSequenceClassification"
],
"attention_dropout": 0.0,
"bos_token_id": 0,
"classif_dropout": 0.0,
"classifier_dropout": 0.0,
"d_model": 1024,
"decoder_attention_heads": 16,
"decoder_ffn_dim": 4096,
"decoder_layerdrop": 0.0,
"decoder_layers": 12,
"decoder_start_token_id": 2,
"dropout": 0.1,
"encoder_attention_heads": 16,
"encoder_ffn_dim": 4096,
"encoder_layerdrop": 0.0,
"encoder_layers": 12,
"eos_token_id": 2,
"forced_eos_token_id": 2,
"gradient_checkpointing": false,
"id2label": {
"0": "contradiction",
"1": "neutral",
"2": "entailment"
},
"init_std": 0.02,
"is_encoder_decoder": true,
"label2id": {
"contradiction": 0,
"entailment": 2,
"neutral": 1
},
"max_position_embeddings": 1024,
"model_type": "bart",
"normalize_before": false,
"num_hidden_layers": 12,
"output_past": false,
"pad_token_id": 1,
"scale_embedding": false,
"transformers_version": "4.7.0.dev0",
"use_cache": true,
"vocab_size": 50265
}

@octimot
Copy link
Owner

octimot commented Nov 2, 2023

I would rather suggest that this has maybe to do with the metadata in the audio file?

Not according to the error though.

Are you using the standalone or the git version?

@pantau000
Copy link
Author

git

@pantau000
Copy link
Author

just checked again, this happens independently of the hugging face error (which i didn't get again), with different audio files. maybe a problem of mp3 files?

@pantau000 pantau000 reopened this Nov 5, 2023
@pantau000 pantau000 changed the title Issue when downloading model from huggingface during transcription transcription before end of audio file Nov 5, 2023
@pantau000 pantau000 changed the title transcription before end of audio file transcription stops before end of audio file Nov 5, 2023
@octimot
Copy link
Owner

octimot commented Jan 8, 2024

Did you find a workaround for this or is it still an issue?

Cheers!

@pantau000
Copy link
Author

haven't tried again, will do so with the newest version

@pantau000
Copy link
Author

just checked, wiht another audio file, unfortunately the problem continues. audio is 1:28:15 and transcription stops at 01:00:33,660.

@pantau000
Copy link
Author

bump

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants