Regarding the issues encountered with w_bit 3 quantification #231

langxinspieder · 2024-10-30T10:44:45Z

Very good job!
I have encountered a problem:
I can quantify according to a w_bit 4，128 group, but is there a result of quantifying according to a w_bit 3， 128 group in the article, or is the result in the article based on the simulated PPL index? I ran the instruction that could be quantized according to 4 bits, but modified w_bit 4 by w_bit 3, which resulted in an error, as shown in the figure

What should I do？

terarachang · 2024-11-21T07:07:43Z

Hi, I got a similar issue when generating real quantized weights (w3).

  File "/home/username/llm-awq/awq/quantize/qmodule.py", line 83, in __init__
    raise NotImplementedError("Only 4-bit are supported for now.")

Any solutions?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding the issues encountered with w_bit 3 quantification #231

Regarding the issues encountered with w_bit 3 quantification #231

langxinspieder commented Oct 30, 2024

terarachang commented Nov 21, 2024

Regarding the issues encountered with w_bit 3 quantification #231

Regarding the issues encountered with w_bit 3 quantification #231

Comments

langxinspieder commented Oct 30, 2024

terarachang commented Nov 21, 2024