Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the video mouth shape is the same with the reference? #29

Open
guoyilin opened this issue Jun 6, 2024 · 1 comment
Open

the video mouth shape is the same with the reference? #29

guoyilin opened this issue Jun 6, 2024 · 1 comment

Comments

@guoyilin
Copy link

guoyilin commented Jun 6, 2024

great job. I test the mouth shape when i input a ref image , the ref image has smile mouth shape, after the model inference, the output video keep the smile mouth when speak(the mouth shape should change according to the audio ). what's the reason? is it the MEAD data problem(a video always only one expression)?

@zhangjun001
Copy link
Collaborator

zhangjun001 commented Jun 18, 2024

If you set retarget_strategy as "no_retarget", it is highly recommended to use reference_attention_weight >2.
python inference.py
--reference_image_path "./test_samples/short_case/AOC/ref.jpg"
--audio_path "./test_samples/short_case/AOC/v_exprss_intro_chattts.mp3"
--kps_path "./test_samples/short_case/AOC/AOC_raw_kps.pth"
--output_path "./output/short_case/talk_AOC_raw_kps_chattts_no_retarget.mp4"
--retarget_strategy "fix_face"
--num_inference_steps 25
--reference_attention_weight 1.0
--audio_attention_weight 3.0
--save_gpu_memory

Generally, if the front view reference and kps are used, naive_retarget works better. note, audio_attention_weight sets to 1.0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants