Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Would GPT4All integration provide a performance improvement? #529

Open
loren-osborn opened this issue Jun 12, 2024 · 2 comments
Open

Would GPT4All integration provide a performance improvement? #529

loren-osborn opened this issue Jun 12, 2024 · 2 comments
Labels
feature request Indicates new feature requests.

Comments

@loren-osborn
Copy link

loren-osborn commented Jun 12, 2024

In the demos I’ve seen of Leon AI, it appeared rather slow. I have no idea if this was a limitation of the hardware or there were inefficiencies that might be improved upon. GPT4All appears to be rather performant, even on systems without CUDA compatible GPUs. I have no idea if it is any faster than the inference engine you’re already using.

@loren-osborn loren-osborn added the feature request Indicates new feature requests. label Jun 12, 2024
@louistiti
Copy link
Member

louistiti commented Jun 12, 2024

Which demos are you referring to? If it's about the former new voice video, then it's because I don't show the tokens being generated for most of the video. But you can see it from here.
Also, it is possible to disable the LLM and use the built-in text classification which is nearly real time.

@loren-osborn
Copy link
Author

While I based my recommendation on the performance I saw in this video: https://youtu.be/6CInSt6pTVA?si=oIipaG4Rb07EqSet I know many local LLM inference and training systems rely heavily on Nvdia CUDA GPUs. I mentioned GPT4All as I knew it leveraged AVX CPU instructions and Nomic Vulkan to provide efficient access to LLM inference on Nvidia and AMD GPUs. I’m not sure if Leon currently relies on CUDA for performance, but if so, GPT4All may help you support more hardware.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request Indicates new feature requests.
Projects
None yet
Development

No branches or pull requests

2 participants