Question about the Sequence Discriminator #36

yj7082126 · 2020-01-25T04:28:11Z

Hello. I read your paper on speech-driven facial animations, and it's good to see your code that explains the overall architecture of the model.

I have some questions on the sequence discriminator described on your paper. In the paper, you mentioned that the frames at each time steps are encoded using a CNN, and fed into a two-layer GRU. Is this CNN identical to the Identity Encoder used for the Generator?

You also mentioned adding the audio as a conditional input to the network. How is this audio encoded, and how is it added to the input?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the Sequence Discriminator #36

Question about the Sequence Discriminator #36

yj7082126 commented Jan 25, 2020

Question about the Sequence Discriminator #36

Question about the Sequence Discriminator #36

Comments

yj7082126 commented Jan 25, 2020