Method for Faster Inference #614

fakerybakery · 2023-10-01T23:08:59Z

Hi, might it be possible to merge this repository upstream? It's slightly faster than the original version. Thanks!

jhowilbur · 2023-10-13T11:16:36Z

@fakerybakery, can you help me understand the main point of making this other repo faster?
I tried to find it here in the commits, but I need help finding out why.

fakerybakery · 2023-10-13T15:28:39Z

Hi @jhowilbur
In the API repo, the authors mentioned that in their fork they made it 30% faster:

We have made some improvements to Tortoise to make the inference ~30% faster, and welcome contributions on our repo to improve it further!

Source

The commit messages say they made it faster by not moving the models between CPU and GPU.

Hope this helps!

fakerybakery · 2023-10-19T00:00:23Z

Hi @manmay-nakhashi, has this been resolved?

manmay-nakhashi · 2023-10-19T03:23:07Z

Yes we don't do offloading to cpu in api_fast.py

fakerybakery · 2023-11-03T01:19:51Z

The api_fast.py can't work on MPS (#658) since the fast API requires autocast and autocast isn't yet supported on MPS.

Yes we don't do offloading to cpu in api_fast.py

Does this mean support for the regular API is not planned @manmay-nakhashi?

manmay-nakhashi closed this as completed Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Method for Faster Inference #614

Method for Faster Inference #614

fakerybakery commented Oct 1, 2023

jhowilbur commented Oct 13, 2023

fakerybakery commented Oct 13, 2023 •

edited

Loading

fakerybakery commented Oct 19, 2023

manmay-nakhashi commented Oct 19, 2023 •

edited

Loading

fakerybakery commented Nov 3, 2023

Method for Faster Inference #614

Method for Faster Inference #614

Comments

fakerybakery commented Oct 1, 2023

jhowilbur commented Oct 13, 2023

fakerybakery commented Oct 13, 2023 • edited Loading

fakerybakery commented Oct 19, 2023

manmay-nakhashi commented Oct 19, 2023 • edited Loading

fakerybakery commented Nov 3, 2023

fakerybakery commented Oct 13, 2023 •

edited

Loading

manmay-nakhashi commented Oct 19, 2023 •

edited

Loading