-
Notifications
You must be signed in to change notification settings - Fork 1.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fast API on CPU + MPS GPU #658
Comments
Just realized this can't work on MPS since the fast API requires autocast and autocast isn't yet supported on MPS. I guess we'll have to wait for that to get merged before the fast API will work |
pytorch/pytorch#99272 (comment) Seems autocast is finally supported on MPS! |
I made some changes to the codebase, which lets it utilize mps when running autocast, and based on some initial testing, it's significantly faster. Especially on the clvp step! |
@SSardorf can you make pull request for this ? |
Sure, I'll clean up the changes and open a PR 👍 |
Opened a draft PR. There may be other places where mps could help, I just haven't had a chance to look |
Hi @manmay-nakhashi, are you planning to support the fast API work on CPU + MPS GPU? Thanks!
The text was updated successfully, but these errors were encountered: