Can't quantize kv cache: observer = self.k_observers[layer_idx]
liste index out of range
#1295
Labels
bug
Something isn't working
Describe the bug
Using this recipe:
Results in the error below. If I remove the
kv_cache_scheme
, it works.The model I am quantizing is Llama 3.3 70b.
Expected behavior
A clear and concise description of what you expected to happen.
Environment
Include all relevant environment information:
f7245c8
]: 0.4.1To Reproduce
Exact steps to reproduce the behavior:
Errors
If applicable, add a full print-out of any errors or exceptions that are raised or include screenshots to help explain your problem.
Additional context
Add any other context about the problem here. Also include any relevant files.
The text was updated successfully, but these errors were encountered: