Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

31-second audio -> a total processing time of 2617.4 second / 43 min VS realtime #39

Open
johndpope opened this issue Jun 15, 2024 · 2 comments

Comments

@johndpope
Copy link

johndpope commented Jun 15, 2024

the key difference between VASA - and V-express from architecture POV - is the use of Megaportraits / resnets to do warping without needing keypoints.

Screenshot 2024-06-16 at 8 47 34 am

image

I am attempting to recreate Megaportrait codebase here - https://github.com/johndpope/MegaPortrait-hack
There's some audit on flops using profiler -
johndpope/MegaPortrait-hack#39

is it just a case of ripping out the keypoints in this repo to make inference faster?

@dCodeMaestro
Copy link

dCodeMaestro commented Jul 15, 2024

Interesting. Does V-express yield similar results quality as vasa-1 ?

@FurkanGozukara
Copy link

Interesting. Does V-express yield similar results quality as vasa-1 ?

i have full tutorials check them out

76.) Free - Local - PC

https://youtu.be/xLqDTVWUSe

V-Express: 1-Click AI Avatar Talking Heads Video Animation Generator - D-ID Alike - Free Open Source

image

77.) Free & Paid - Cloud - RunPod - Massed Compute - Kaggle

https://youtu.be/GXBiqJOc9FE

V-Express 1-Click AI Talking Avatar Generator - Like D-ID - Massed Compute, RunPod & Kaggle Guide

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants