Gliner quantization #145

carloronsi · 2024-07-04T10:24:36Z

carloronsi
Jul 4, 2024

Hi everyone! I've tried the quantization code present in this report:

https://github.com/urchade/GLiNER/blob/main/examples/convert_to_onnx.ipynb

Followed step-by-step, but the quantized model (last cell) is not returning any entity (if the threshold is reduced, texts are labeled incorrectly).

Am I missing something? Is there an additional step required to make the quantized model work?

Thank you all!

urchade · 2024-07-04T10:27:00Z

urchade
Jul 4, 2024
Maintainer

@Ingvarstep

0 replies

miguelwon · 2024-12-16T10:06:44Z

miguelwon
Dec 16, 2024

I'm also having a similar problem. When quantizing to 8bit I'm measuring a quite strong drop in performance. More than I usually see for this type of models.

1 reply

merlihson Jan 16, 2025

Have you, guys, tried Quantization Aware Training? Can it technically work with GliNER?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Gliner quantization #145

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Gliner quantization #145

Uh oh!

carloronsi Jul 4, 2024

Replies: 2 comments · 1 reply

Uh oh!

urchade Jul 4, 2024 Maintainer

Uh oh!

miguelwon Dec 16, 2024

Uh oh!

merlihson Jan 16, 2025

carloronsi
Jul 4, 2024

Replies: 2 comments 1 reply

urchade
Jul 4, 2024
Maintainer

miguelwon
Dec 16, 2024