Skip to content

Issues: turboderp/exllamav2

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[BUG] lmformatenforcer integration seems to be broken on new versions bug Something isn't working
#696 opened Dec 11, 2024 by hvico
3 tasks done
[REQUEST] EXAONE 3.5 Support
#695 opened Dec 9, 2024 by necrogay
3 tasks done
[BUG] ExLlamaV2DynamicGenerator class is not multiple threads supported bug Something isn't working
#690 opened Nov 29, 2024 by UTSAV-44
3 tasks done
[BUG] generator.iterate() returns corrupted result objects in some cases bug Something isn't working
#689 opened Nov 29, 2024 by p-e-w
3 tasks done
[REQUEST] High throughput with large batch size
#686 opened Nov 26, 2024 by fzyzcjy
3 tasks done
[BUG] Speculative decoding regresses performance on 7900 xtx under ROCM bug Something isn't working
#685 opened Nov 25, 2024 by Mushoz
3 tasks done
qwen coder32b run on colab t4 bug Something isn't working
#682 opened Nov 23, 2024 by werruww
3 tasks done
[REQUEST] Can we have 1.0/1.5 bpw internally?
#675 opened Nov 17, 2024 by Originalimoc
3 tasks done
[BUG] [Qwen] Draft model produce garbage output bug Something isn't working
#674 opened Nov 14, 2024 by Nepherpitou
3 tasks done
[REQUEST] Support for a Qwen based vision model
#672 opened Nov 12, 2024 by TyraVex
3 tasks done
[REQUEST] Synthetic Data generation features
#669 opened Nov 3, 2024 by AstrisCantCode
3 tasks done
[BUG] AMD - Out of memory errors despite having plenty of VRAM bug Something isn't working
#662 opened Oct 27, 2024 by RSAStudioGames
3 tasks done
[REQUEST] Faster 6/8-bit EXL2 quantization
#660 opened Oct 19, 2024 by grimulkan
3 tasks done
[BUG] Appending-Runtime-LoRA-weights bug Something isn't working
#656 opened Oct 16, 2024 by royallavanya140
3 tasks done
[BUG] Convert script fails to run on master branch as of v0.2.3 bug Something isn't working
#655 opened Oct 15, 2024 by iamwavecut
3 tasks done
[BUG] RAM UTILISATION IS INCREASING RAPIDLY bug Something isn't working
#639 opened Sep 25, 2024 by UTSAV-44
[BUG] Random slowdowns in tensor parallel. bug Something isn't working
#630 opened Sep 21, 2024 by Ph0rk0z
3 tasks done
[BUG] Quantization of Qwen return garbage bug Something isn't working
#621 opened Sep 10, 2024 by fahadh4ilyas
3 tasks done
ProTip! Find all open issues with in progress development work with linked:pr.