Adding CER/WER metrics #3418

yazanmashal03 · 2025-07-24T13:30:09Z

Pull Request Template

Checklist

Confirmed that cargo run-checks command has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

The PR is about Issue #2649, and implements two new metrics for sequence evaluation in NLP and related fields. These are the character error rate (CER) and word error rate(WER) metrics. These are (the same) error metrics used on the character and word levels, respectively.

Changes

The cer.rs file provides an implementation of the Character Error Rate metric. CER measures (here we use Levenshtein using DP) the percentage of characters that are incorrect (insertions, deletions, substitutions) in the predicted sequence compared to the reference sequence.

The wer.rs file implements the Word Error Rate metric. WER is similar to CER but operates at the word level, measuring the percentage of words that are incorrect in the predicted sequence.

Testing

Testing was done using unit tests, where each function (for example test_wer_without_padding, test_wer_with_padding) is a unit test for the WerMetric implementation. Same for CER.

… cer.rs and mod.rs were changed in the process. All tests pass.

…ementation works on words now as tokens, rather than chars.

codecov · 2025-07-28T06:08:37Z

Codecov Report

❌ Patch coverage is 99.09091% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 63.50%. Comparing base (38874eb) to head (d7a8bc0).
⚠️ Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
crates/burn-train/src/metric/cer.rs	99.09%	1 Missing ⚠️
crates/burn-train/src/metric/wer.rs	99.09%	1 Missing ⚠️

❌ Your project check has failed because the head coverage (63.50%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3418      +/-   ##
==========================================
+ Coverage   63.43%   63.50%   +0.07%     
==========================================
  Files         981      983       +2     
  Lines      109705   109925     +220     
==========================================
+ Hits        69589    69807     +218     
- Misses      40116    40118       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

laggui

Thanks for contributing these additional metrics!

I only have a couple of comments regarding the implementation.

/edit: ignore the failed test on CUDA CI, totally unrelated.

crates/burn-train/src/metric/cer.rs

laggui · 2025-07-28T14:55:45Z

crates/burn-train/src/metric/cer.rs

+        self.state.update(
+            value,
+            batch_size,
+            FormatOptions::new(self.name()).unit("%").precision(2),
+        )


I think the state might need to keep track of the errors and total characters (or words for WER)? Otherwise aggregation might be incorrect 🤔 this would require a new state type though

Hmm, that sounds correct. However, the value here includes the errors relative to the total characters, since value=total_edit_distance/total_characters * 100, so why would we need to keep the total characters?

For the current batch that's accurate, but when aggregated for an epoch it might be incorrect since this is a numeric state (not all batches have the same composition). Probably out of scope for this PR so no worries 👍

crates/burn-train/src/metric/wer.rs

…rs as much as possible.

laggui

Sorry for the late follow-up! Didn't see this was updated. Please explicitly re-request a review so I know when changes have been applied 🙂

LGTM, just minor formatting issues to fix.

laggui · 2025-08-08T18:54:44Z

crates/burn-train/src/metric/cer.rs

+        self.state.update(
+            value,
+            batch_size,
+            FormatOptions::new(self.name()).unit("%").precision(2),
+        )


For the current batch that's accurate, but when aggregated for an epoch it might be incorrect since this is a numeric state (not all batches have the same composition). Probably out of scope for this PR so no worries 👍

laggui · 2025-08-08T18:55:29Z

crates/burn-train/src/metric/cer.rs

+/// deletions, or substitutions) required to change one sequence into the other. This
+/// implementation is optimized for space, using only two rows of the dynamic programming table.
+/// 
+pub fn edit_distance(reference: &[i32], prediction: &[i32]) -> usize {


We can mark it as pub(crate) only

github-actions · 2025-09-08T12:13:12Z

This PR has been marked as stale because it has not been updated for over a month

yazanmashal03 and others added 5 commits July 24, 2025 15:01

Implemented the CER metric, with its respective tests in cer.rs. Only…

672d912

… cer.rs and mod.rs were changed in the process. All tests pass.

Worked on WER. It is very similar (identical) to CER, except the impl…

56452ca

…ementation works on words now as tokens, rather than chars.

Merge branch 'tracel-ai:main' into main

642f304

added some few comments

097b8fc

merge request

d7a8bc0

laggui requested changes Jul 28, 2025

View reviewed changes

yazanmashal03 and others added 3 commits July 28, 2025 18:31

Merge branch 'tracel-ai:main' into main

3f73318

Merge branch 'tracel-ai:main' into main

be9f5db

Updated the operations in update function such that they are on tenso…

ec785b7

…rs as much as possible.

laggui reviewed Aug 8, 2025

View reviewed changes

github-actions bot added the stale The issue or pr has been open for too long label Sep 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding CER/WER metrics #3418

Adding CER/WER metrics #3418

Uh oh!

yazanmashal03 commented Jul 24, 2025

Uh oh!

codecov bot commented Jul 28, 2025

Uh oh!

laggui left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

laggui Jul 28, 2025

Uh oh!

yazanmashal03 Aug 3, 2025

Uh oh!

laggui Aug 8, 2025

Uh oh!

Uh oh!

laggui left a comment

Uh oh!

laggui Aug 8, 2025

Uh oh!

laggui Aug 8, 2025

Uh oh!

github-actions bot commented Sep 8, 2025

Uh oh!

Uh oh!

Adding CER/WER metrics #3418

Are you sure you want to change the base?

Adding CER/WER metrics #3418

Uh oh!

Conversation

yazanmashal03 commented Jul 24, 2025

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

Uh oh!

codecov bot commented Jul 28, 2025

Codecov Report

Uh oh!

laggui left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

laggui Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

yazanmashal03 Aug 3, 2025

Choose a reason for hiding this comment

Uh oh!

laggui Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

laggui left a comment

Choose a reason for hiding this comment

Uh oh!

laggui Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

laggui Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 8, 2025

Uh oh!

Uh oh!

laggui left a comment •

edited

Loading