-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
xformers for potential speedup, or torch 2.01 arguments #17
Comments
Hmm, I wonder if Higher version of
I think it is safe to say you can upgrade your CUDA version.
I do not know how those args would work. |
I will look into this and try it out and report back. |
AFAIK these args are relevent only to Stable Diffusion to speed up image generation. At least I've never encountered them in any other context.
Where's that little mind exploding emoji when you need it. This, good sir, is some very exciting news. Must test soon. |
Threedeejay sent me this and someone got a little further, I wonder if they just need to do I'm currently trying to get the Unity 2022.3.18f1 project going at this time. A lot of features from meta which are finally exposing the hand tracking, and other features to openxr fully for PCVR with the v62 update. (I have it early by opting into the public test channels on pc and the mobile app) |
If you have python 3.11 and cuda 11.8, I assume. |
I had read that using xformers (pip install xformers) could possibly results in a large speedup in the marigold and depth-anything realtime conversion. The issue is I can't find any xformers wheel that is compatible with 2.0.1+cu117 (cuda 11.7) and not sure if the unity project requires that version of cuda to work.
It seems like xformers version 22 is compatible possibly with torch 2.01 and cuda 11.8.
If this doesn't work though because it's too old: I had read an argument you can do with torch 2.01 that would be as fast of a speedup as xformers is adding
--opt-sdp-attention
or--opt-sdp-no-mem-attention
arguments (but these seem specific flags only for automatic1111 I am wondering if the same sort of thing could be done here?)I still can't get the depth-anything model going quite yet to test though. Somehow threedeejay did but he says it runs at 2 frames per second.
The text was updated successfully, but these errors were encountered: