-
Notifications
You must be signed in to change notification settings - Fork 289
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Result dimension and audio speed #15
Comments
Hi the model is trained to generate images at 96x128. I'm afraid it can't do larger images unless we retrain it with larger videos in the training set. Have you manually provided the sampling rate of your audio? it sounds to me like there is a mismatch between the actual audio sampling rate and the one provided to the animator |
what I found in Q2: the wav track must be mono channel, if input a stereo track, output will slow down and double the playtime... |
Oh yes I forgot to mention that I guess in the documentation. The track should be mono. I presume this causes it to have 2x the samples when reshaping takes place. |
Wow! Thank you! I'll try mono. |
Yep, mono fixed the issue! |
To be honest it was of a memory and time restriction than anything else. The model takes 11GB on the GPU to train and trains for 1-2 weeks (depending on the dataset size). If we have an update in the future I'll post about it here. |
@wooloodya hello,i meet the question |
Hi! I've just run the code and generated video successfully. But I've got a copule of questions.
The result video dimension is very small despite of source image dimension which is much bigger. Is there any ability to increase result video dimension?
Audio in the result mp4 is slowed down dramatically. Why is it so?
The text was updated successfully, but these errors were encountered: