Skip to content

Commit

Permalink
Update enpt and move to prod (#42)
Browse files Browse the repository at this point in the history
* Update enpt and move to prod

* Update evaluation results [skip ci]

* Update model registry [skip ci]

Co-authored-by: CircleCI evaluation job <ci-models-evaluation@firefox-translations>
  • Loading branch information
eu9ene and CircleCI evaluation job authored Apr 18, 2022
1 parent e0b29e5 commit 1d1d4da
Show file tree
Hide file tree
Showing 37 changed files with 163 additions and 163 deletions.
1 change: 0 additions & 1 deletion evaluation/dev/en-pt/flores-dev.bergamot.pt.bleu

This file was deleted.

1 change: 0 additions & 1 deletion evaluation/dev/en-pt/flores-dev.google.pt.bleu

This file was deleted.

1 change: 0 additions & 1 deletion evaluation/dev/en-pt/flores-dev.microsoft.pt.bleu

This file was deleted.

1 change: 0 additions & 1 deletion evaluation/dev/en-pt/flores-test.bergamot.pt.bleu

This file was deleted.

1 change: 0 additions & 1 deletion evaluation/dev/en-pt/flores-test.google.pt.bleu

This file was deleted.

Binary file modified evaluation/dev/img/avg.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/dev/img/en-fa.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/dev/img/en-it.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/dev/img/en-ru.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/dev/img/ru-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
72 changes: 31 additions & 41 deletions evaluation/dev/results.md
Original file line number Diff line number Diff line change
Expand Up @@ -57,73 +57,63 @@ Both absolute and relative differences in BLEU scores between Bergamot and other

## avg

| Translator/Dataset | en-it | en-pt | ru-en | en-ru | en-fa | fa-en | is-en |
| --- | --- | --- | --- | --- | --- | --- | --- |
| bergamot | 29.53 | 46.55 | 33.37 | 30.47 | 17.30 | 28.70 | 23.50 |
| google | 29.47 (-0.07, -0.23%) | 56.05 (+9.50, +20.41%) | 36.53 (+3.15, +9.45%) | 33.72 (+3.25, +10.67%) | 27.70 (+10.40, +60.12%) | 36.05 (+7.35, +25.61%) | 34.95 (+11.45, +48.72%) |
| microsoft | 32.20 (+2.67, +9.03%) | 50.25 (+3.70, +7.95%) | 36.87 (+3.50, +10.49%) | 33.68 (+3.21, +10.53%) | 20.50 (+3.20, +18.50%) | 36.15 (+7.45, +25.96%) | 34.90 (+11.40, +48.51%) |
| Translator/Dataset | fa-en | ru-en | en-ru | en-it | en-fa | is-en |
| --- | --- | --- | --- | --- | --- | --- |
| bergamot | 28.70 | 33.37 | 30.47 | 29.53 | 17.30 | 23.50 |
| google | 36.05 (+7.35, +25.61%) | 36.53 (+3.15, +9.45%) | 33.72 (+3.25, +10.67%) | 29.47 (-0.07, -0.23%) | 27.70 (+10.40, +60.12%) | 34.95 (+11.45, +48.72%) |
| microsoft | 36.15 (+7.45, +25.96%) | 36.87 (+3.50, +10.49%) | 33.68 (+3.21, +10.53%) | 32.20 (+2.67, +9.03%) | 20.50 (+3.20, +18.50%) | 34.90 (+11.40, +48.51%) |

![Results](img/avg.png)

## en-it

| Translator/Dataset | flores-test | flores-dev | wmt09 |
| --- | --- | --- | --- |
| bergamot | 29.20 | 28.40 | 31.00 |
| google | 30.10 (+0.90, +3.08%) | 29.00 (+0.60, +2.11%) | 29.30 (-1.70, -5.48%) |
| microsoft | 32.00 (+2.80, +9.59%) | 31.00 (+2.60, +9.15%) | 33.60 (+2.60, +8.39%) |

![Results](img/en-it.png)

## en-pt
## fa-en

| Translator/Dataset | flores-test | flores-dev |
| Translator/Dataset | flores-dev | flores-test |
| --- | --- | --- |
| bergamot | 46.20 | 46.90 |
| google | 55.70 (+9.50, +20.56%) | 56.40 (+9.50, +20.26%) |
| microsoft | 50.70 (+4.50, +9.74%) | 49.80 (+2.90, +6.18%) |
| bergamot | 29.10 | 28.30 |
| google | 36.70 (+7.60, +26.12%) | 35.40 (+7.10, +25.09%) |
| microsoft | 36.50 (+7.40, +25.43%) | 35.80 (+7.50, +26.50%) |

![Results](img/en-pt.png)
![Results](img/fa-en.png)

## ru-en

| Translator/Dataset | mtedx_test | wmt19 | wmt17 | flores-dev | flores-test | wmt14 | wmt15 | wmt16 | wmt13 | wmt18 | wmt20 |
| Translator/Dataset | wmt17 | wmt18 | wmt14 | mtedx_test | wmt16 | wmt13 | wmt19 | wmt20 | flores-dev | flores-test | wmt15 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| bergamot | 24.20 | 39.30 | 37.90 | 31.90 | 31.50 | 38.00 | 33.70 | 33.40 | 29.50 | 32.30 | 35.40 |
| google | 25.00 (+0.80, +3.31%) | 42.40 (+3.10, +7.89%) | 41.50 (+3.60, +9.50%) | 37.00 (+5.10, +15.99%) | 35.50 (+4.00, +12.70%) | 41.20 (+3.20, +8.42%) | 37.50 (+3.80, +11.28%) | 36.60 (+3.20, +9.58%) | 31.40 (+1.90, +6.44%) | 36.00 (+3.70, +11.46%) | 37.70 (+2.30, +6.50%) |
| microsoft | 26.10 (+1.90, +7.85%) | 42.60 (+3.30, +8.40%) | 41.60 (+3.70, +9.76%) | 36.20 (+4.30, +13.48%) | 36.10 (+4.60, +14.60%) | 41.70 (+3.70, +9.74%) | 37.80 (+4.10, +12.17%) | 37.60 (+4.20, +12.57%) | 31.20 (+1.70, +5.76%) | 36.90 (+4.60, +14.24%) | 37.80 (+2.40, +6.78%) |
| bergamot | 37.90 | 32.30 | 38.00 | 24.20 | 33.40 | 29.50 | 39.30 | 35.40 | 31.90 | 31.50 | 33.70 |
| google | 41.50 (+3.60, +9.50%) | 36.00 (+3.70, +11.46%) | 41.20 (+3.20, +8.42%) | 25.00 (+0.80, +3.31%) | 36.60 (+3.20, +9.58%) | 31.40 (+1.90, +6.44%) | 42.40 (+3.10, +7.89%) | 37.70 (+2.30, +6.50%) | 37.00 (+5.10, +15.99%) | 35.50 (+4.00, +12.70%) | 37.50 (+3.80, +11.28%) |
| microsoft | 41.60 (+3.70, +9.76%) | 36.90 (+4.60, +14.24%) | 41.70 (+3.70, +9.74%) | 26.10 (+1.90, +7.85%) | 37.60 (+4.20, +12.57%) | 31.20 (+1.70, +5.76%) | 42.60 (+3.30, +8.40%) | 37.80 (+2.40, +6.78%) | 36.20 (+4.30, +13.48%) | 36.10 (+4.60, +14.60%) | 37.80 (+4.10, +12.17%) |

![Results](img/ru-en.png)

## en-ru

| Translator/Dataset | wmt16 | wmt15 | flores-dev | wmt18 | wmt14 | wmt17 | wmt20 | wmt13 | wmt19 | flores-test |
| Translator/Dataset | wmt15 | flores-dev | wmt13 | wmt16 | wmt20 | flores-test | wmt19 | wmt17 | wmt18 | wmt14 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| bergamot | 31.60 | 31.80 | 30.20 | 29.00 | 38.50 | 34.10 | 22.00 | 26.70 | 31.70 | 29.10 |
| google | 34.00 (+2.40, +7.59%) | 35.00 (+3.20, +10.06%) | 33.60 (+3.40, +11.26%) | 34.10 (+5.10, +17.59%) | 43.40 (+4.90, +12.73%) | 37.30 (+3.20, +9.38%) | 26.40 (+4.40, +20.00%) | 27.60 (+0.90, +3.37%) | 32.20 (+0.50, +1.58%) | 33.60 (+4.50, +15.46%) |
| microsoft | 33.80 (+2.20, +6.96%) | 35.60 (+3.80, +11.95%) | 33.20 (+3.00, +9.93%) | 33.20 (+4.20, +14.48%) | 44.10 (+5.60, +14.55%) | 38.10 (+4.00, +11.73%) | 26.00 (+4.00, +18.18%) | 27.10 (+0.40, +1.50%) | 32.70 (+1.00, +3.15%) | 33.00 (+3.90, +13.40%) |
| bergamot | 31.80 | 30.20 | 26.70 | 31.60 | 22.00 | 29.10 | 31.70 | 34.10 | 29.00 | 38.50 |
| google | 35.00 (+3.20, +10.06%) | 33.60 (+3.40, +11.26%) | 27.60 (+0.90, +3.37%) | 34.00 (+2.40, +7.59%) | 26.40 (+4.40, +20.00%) | 33.60 (+4.50, +15.46%) | 32.20 (+0.50, +1.58%) | 37.30 (+3.20, +9.38%) | 34.10 (+5.10, +17.59%) | 43.40 (+4.90, +12.73%) |
| microsoft | 35.60 (+3.80, +11.95%) | 33.20 (+3.00, +9.93%) | 27.10 (+0.40, +1.50%) | 33.80 (+2.20, +6.96%) | 26.00 (+4.00, +18.18%) | 33.00 (+3.90, +13.40%) | 32.70 (+1.00, +3.15%) | 38.10 (+4.00, +11.73%) | 33.20 (+4.20, +14.48%) | 44.10 (+5.60, +14.55%) |

![Results](img/en-ru.png)

## en-fa
## en-it

| Translator/Dataset | flores-test | flores-dev |
| --- | --- | --- |
| bergamot | 17.40 | 17.20 |
| google | 28.20 (+10.80, +62.07%) | 27.20 (+10.00, +58.14%) |
| microsoft | 21.10 (+3.70, +21.26%) | 19.90 (+2.70, +15.70%) |
| Translator/Dataset | flores-dev | flores-test | wmt09 |
| --- | --- | --- | --- |
| bergamot | 28.40 | 29.20 | 31.00 |
| google | 29.00 (+0.60, +2.11%) | 30.10 (+0.90, +3.08%) | 29.30 (-1.70, -5.48%) |
| microsoft | 31.00 (+2.60, +9.15%) | 32.00 (+2.80, +9.59%) | 33.60 (+2.60, +8.39%) |

![Results](img/en-fa.png)
![Results](img/en-it.png)

## fa-en
## en-fa

| Translator/Dataset | flores-dev | flores-test |
| --- | --- | --- |
| bergamot | 29.10 | 28.30 |
| google | 36.70 (+7.60, +26.12%) | 35.40 (+7.10, +25.09%) |
| microsoft | 36.50 (+7.40, +25.43%) | 35.80 (+7.50, +26.50%) |
| bergamot | 17.20 | 17.40 |
| google | 27.20 (+10.00, +58.14%) | 28.20 (+10.80, +62.07%) |
| microsoft | 19.90 (+2.70, +15.70%) | 21.10 (+3.70, +21.26%) |

![Results](img/fa-en.png)
![Results](img/en-fa.png)

## is-en

Expand Down
1 change: 1 addition & 0 deletions evaluation/prod/en-pt/flores-dev.bergamot.pt.bleu
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
49.4
1 change: 1 addition & 0 deletions evaluation/prod/en-pt/flores-dev.google.pt.bleu
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
53.6
1 change: 1 addition & 0 deletions evaluation/prod/en-pt/flores-dev.microsoft.pt.bleu
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
49.6
1 change: 1 addition & 0 deletions evaluation/prod/en-pt/flores-test.bergamot.pt.bleu
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
50.3
1 change: 1 addition & 0 deletions evaluation/prod/en-pt/flores-test.google.pt.bleu
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
53.9
Binary file modified evaluation/prod/img/avg.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/cs-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/de-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/en-bg.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/en-cs.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/en-de.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/en-es.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/en-et.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added evaluation/prod/img/en-pt.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/es-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/et-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified evaluation/prod/img/it-en.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit 1d1d4da

Please sign in to comment.