You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the open source! I've noticed that in v_express_pipeline.py, you use classifier free guidance to audio embeddings, however, the technique report doesn't seem to mention the audio embedding dropout. I'm wondering if you drop the audio embeddings during training, and what's the dropout rate?
The text was updated successfully, but these errors were encountered:
For the classifier-free guidance strategy, we drop all conditions during training with a drop rate of 10%. Whereas strategies that drop strong conditions (e.g., drop kps, etc.) and the classifier-free guidance strategy are independent of each other, the technical report refers to related settings for drop strong conditions.
Thanks for the open source! I've noticed that in
v_express_pipeline.py
, you use classifier free guidance to audio embeddings, however, the technique report doesn't seem to mention the audio embedding dropout. I'm wondering if you drop the audio embeddings during training, and what's the dropout rate?The text was updated successfully, but these errors were encountered: