Question on training strategy: Is there audio conditional drop out? #45

fredkingdom · 2024-06-25T08:39:53Z

Thanks for the open source! I've noticed that in v_express_pipeline.py, you use classifier free guidance to audio embeddings, however, the technique report doesn't seem to mention the audio embedding dropout. I'm wondering if you drop the audio embeddings during training, and what's the dropout rate?

The text was updated successfully, but these errors were encountered:

tiankuan93 · 2024-06-26T10:46:47Z

For the classifier-free guidance strategy, we drop all conditions during training with a drop rate of 10%. Whereas strategies that drop strong conditions (e.g., drop kps, etc.) and the classifier-free guidance strategy are independent of each other, the technical report refers to related settings for drop strong conditions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question on training strategy: Is there audio conditional drop out? #45

Question on training strategy: Is there audio conditional drop out? #45

fredkingdom commented Jun 25, 2024

tiankuan93 commented Jun 26, 2024

Question on training strategy: Is there audio conditional drop out? #45

Question on training strategy: Is there audio conditional drop out? #45

Comments

fredkingdom commented Jun 25, 2024

tiankuan93 commented Jun 26, 2024