-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The training procedure fails in phoenix2014.py #6
Comments
Thanks for reporting this. I have realized that I haven't done a long overdue update of the code about the phoenix part. The current version of the code does not really correspond to the released model training because phoenix2014T was trained with the CTC loss for which I have removed support to simplify the code. On the other hand, |
Gul Thanks for your reply, I would love for you to update me as soon as the code matching updates to phoenix2014T are done and in which code files the updates were made. Also, I tried the alternative option you offered even before I opened the issue, but also in the evaluation process I've encountered incompatibilities in the evaluate.py file, which begin with the aggregate_clips function in the gt variable and are dragged along the entire evaluation process and prevent its completion. For that matter, I used the following command for the evaluation - I would be happy if you could also update the evaluation code accordingly. In addition, I would be happy to receive your answer regarding the Technical question I asked above (my first comment). Is the trained model you provided necessary as a starting point for training? And in particular, is it suitable for phoneix2014T? |
I would love to get an update on whether the updated code is ready and will be pulled soon to the repo? If not yet, then I would love to know how much longer do you think it will take to complete these updates? |
Sorry for the slow response. I clearly failed to update the code on time, so I would prefer not to make another estimate now. I will try to find some time for it. Find other answers below:
Step a was pretrained on this model or equivalently by setting Steps c and d are heavy. I'd like to check whether I can train one model with a single step so that it's simpler. If it helps: the result of training with a single stage only on Phoenix2014T uniform labels (without CTC, without Phoenix2014 pretraining, with BSL-1K pretraining) was 53.7 WER. |
Can you give the pretrained model for step d? I want to extract the i3d feature using that. |
Hi, sorry I am not at capacity to provide support too much. But from what I read above, the released model is already the result of step d. |
Dear authors,
I've been trying to use the code for the Phoenix dataset by running:
following your instructions in run
I also hope the command I mentioned above relates to my training intent, please correct me if not.
First of all, I think there may be an error in line 72 of datasets/phoenix2014.py:
self.frame_level_glosses = data["videos"]["alignments"]["gloss_id"]
because in the dictionary created by misc/phoenix2014T/gather_frames.py, there is no "alignment" property.
So I'm currently using the following patch:
self.frame_level_glosses = data["videos"]["gloss_ids"]
Please update me if this should be something else.
My main problem is when I try to train the model using the command mentioned above, the code fails in the ‘_get_class’ function of datasets/phoenix2014.py, because the variable clip_glosses seems to be an empty list [] 98% of the time.
The error:
What can be done to solve this problem?
Thanks in advance.
The text was updated successfully, but these errors were encountered: