Add LabelScorer base class #80

SimBe195 · 2024-10-31T14:43:41Z

Abstract base class for scoring tokens within an ASR search algorithm.

This class provides an interface for different types of label scorers in an ASR system. Label Scorers compute the scores of tokens based on input features and a scoring context. Children of this base class should represent various ASR model architectures and cover a wide range of possibilities such as CTC, transducer, AED or other models.

The usage is intended as follows:

Before or during the search, features can be added
At the beginning of search, getInitialScoringContext should be called and used for the first hypotheses
For a given hypothesis in search, its search context together with a successor token and transition type are packed into a request and scored via getScoreWithTime. This also returns the timestamp of the successor.
- Note: The scoring function may return no value, in this case it is not ready yet and needs more input features.
- Note: There is also the function getScoresWithTimes which can handle an entire batch of requests at once and might be implemented more efficiently (e.g. using batched model forwarding).
For all hypotheses that survive pruning, the LabelScorer can compute a new scoring context that extends the previous scoring context of that hypothesis with a given successor token. This new scoring context can then be used as context in subsequent search steps.
After all features have been passed, the signalNoMoreFeatures function is called to inform the label scorer that it doesn't need to wait for more features and can score as much as possible. This is especially important when the label scorer internally uses an encoder or window with right context.
When all necessary scores for the current segment have been computed, the reset function is called to clean up any internal data (e.g. feature buffer) or reset flags of the LabelScorer. Afterwards it is ready to receive features for the next segment.

Each concrete subclass internally implements a concrete type of scoring context which the outside search algorithm is agnostic to. Depending on the model, this scoring context can consist of things like the current timestep, a label history, a hidden state or other values.

This PR is dependent on #78.

src/Nn/LabelScorer/LabelScorer.hh

curufinwe

Please write const references as std::vector<int> const& instead of const std::vector<int>& for better consistency with recently written RASR code.

src/Nn/LabelScorer/LabelScorer.hh

src/Nn/LabelScorer/ScoringContext.cc

src/Search/SearchV2.hh

Add LabelScorer base class

34aa5ca

SimBe195 requested review from curufinwe, Marvin84, larissakl and NurAd-Din October 31, 2024 14:43

larissakl reviewed Nov 1, 2024

View reviewed changes

src/Nn/LabelScorer/LabelScorer.hh Outdated Show resolved Hide resolved

src/Nn/LabelScorer/LabelScorer.hh Outdated Show resolved Hide resolved

Marvin84 reviewed Nov 7, 2024

View reviewed changes

src/Nn/LabelScorer/LabelScorer.hh Show resolved Hide resolved

Simon Berger added 3 commits November 8, 2024 13:46

Consistent naming of timestep

85ff2ad

Add batched versions of addInput

8248f0c

Move transition type to search

62b3526

larissakl approved these changes Nov 8, 2024

View reviewed changes

curufinwe requested changes Nov 11, 2024

View reviewed changes

Simon Berger added 3 commits December 5, 2024 11:00

Apply suggestions from code review

5c6c8d3

Add more versions of addInput[s] methods

76d57ff

Merge branch 'collapsed-vector' into labelscorer-base

2692e2a

SimBe195 requested a review from curufinwe December 5, 2024 10:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LabelScorer base class #80

Add LabelScorer base class #80

SimBe195 commented Oct 31, 2024 •

edited

Loading

curufinwe left a comment

Add LabelScorer base class #80

Are you sure you want to change the base?

Add LabelScorer base class #80

Conversation

SimBe195 commented Oct 31, 2024 • edited Loading

curufinwe left a comment

Choose a reason for hiding this comment

SimBe195 commented Oct 31, 2024 •

edited

Loading