The results obtained after running the demo are inconsistent with those shown. #23

xiao-keeplearning · 2024-06-04T11:50:46Z

I ran the demo code for scenario 2 and got talk_tys_fix_face.mp4, but the video results are not the same as shown in the readme. And it looks like my result are a little worse.

talk_tys_fix_face.mp4

tiankuan93 · 2024-06-05T02:24:51Z

We've adjusted the default weights for reference_attention_weight and audio_attention_weight with the goal of making mouth movements more pronounced. You can turn up reference_attention_weight to make the model maintain higher character consistency, and turn down audio_attention_weight to reduce mouth artifacts. As shown below.

python inference.py \
    --reference_image_path "./test_samples/short_case/tys/ref.jpg" \
    --audio_path "./test_samples/short_case/tys/aud.mp3" \
    --output_path "./output/short_case/talk_tys_fix_face.mp4" \
    --retarget_strategy "fix_face" \
    --num_inference_steps 25 \
    --reference_attention_weight 1.0 \
    --audio_attention_weight 1.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The results obtained after running the demo are inconsistent with those shown. #23

The results obtained after running the demo are inconsistent with those shown. #23

xiao-keeplearning commented Jun 4, 2024

tiankuan93 commented Jun 5, 2024

The results obtained after running the demo are inconsistent with those shown. #23

The results obtained after running the demo are inconsistent with those shown. #23

Comments

xiao-keeplearning commented Jun 4, 2024

tiankuan93 commented Jun 5, 2024