Support BERT finetuned on SQuAD #1

Pierrci · 2019-11-18T17:31:26Z

Since the tokenizer is the same than MobileBERT/DistilBERT, would be pretty straightforward to add once this TensorFlow issue is solved: tensorflow/tensorflow#34210

ucalyptus · 2020-01-03T18:59:19Z

@Pierrci is this closing now as 34210 is closed?

Pierrci · 2020-01-06T15:47:47Z

The non-quantized TFLite version is around 1GB so way too big for a mobile app.
I'll close this once the FP16 quantization works so we can use a model with a reduced size and good performance, but it's not the case for now (at least when I tried last Friday)

csarron · 2020-03-24T22:46:31Z

Hi @Pierrci, were you able to quantize the BERT for TFLite? I tried a few options but failed to get the quantized model.

Pierrci added the enhancement New feature or request label Nov 18, 2019

Pierrci self-assigned this Nov 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support BERT finetuned on SQuAD #1

Support BERT finetuned on SQuAD #1

Pierrci commented Nov 18, 2019 •

edited

Loading

ucalyptus commented Jan 3, 2020

Pierrci commented Jan 6, 2020

csarron commented Mar 24, 2020

Support BERT finetuned on SQuAD #1

Support BERT finetuned on SQuAD #1

Comments

Pierrci commented Nov 18, 2019 • edited Loading

ucalyptus commented Jan 3, 2020

Pierrci commented Jan 6, 2020

csarron commented Mar 24, 2020

Pierrci commented Nov 18, 2019 •

edited

Loading