Fixes prediction across all architectures #273

kylebgorman · 2024-11-30T20:20:00Z

Two bugs found by testing prediction across all architectures:

Hard attention, some RNN-backer pointer-generator, and transducer models all inherit the beam search implementation in RNNModel, but this incompatible so it needs to be disabled. Failing to do this results in an inscrutable error instead of a NotImplementedError. This is fixed in all locations. In each case I made the decision to place the exception relatively "low" (i.e., more derived) in the class hierarchy, so if someone actually did implement beam_decode on these classes everything would just work.
The base model implementation of the prediction loop is made to have what seems like the obvious polymorphic return type which restores compatibility for hard attention. I suspect this is a minor regression introduced in Added beam search for LSTM #257; it has been fixed.

Clean-ups done at the same time:

Standardizes the name of the special symbol: it's called EOS at various points in the documentation but the code actually calls it END and its tag is <E>.
Uses the names beam_decode and greedy_decode everywhere.

Adamits

Left one nit, but LGTM.

Thanks for taking the time to do this, you're doing the lord's work.

yoyodyne/evaluators.py

yoyodyne/models/hard_attention.py

kylebgorman added 8 commits November 30, 2024 13:20

Expands list of separate features models

8c6a029

Attempts to fix batch prediction support

e9fb0b0

Update indexes.py

8f057c0

Update README.md

33e4bda

Standardizes name: "END" not "EOS".

9689635

Bugfixes.

8e794b3

More bugfixes

4135e04

Late-breaking improvements

0b094fd

kylebgorman marked this pull request as ready for review December 1, 2024 02:17

kylebgorman requested a review from Adamits December 1, 2024 02:17

kylebgorman added the bug Something isn't working label Dec 1, 2024

Adamits approved these changes Dec 2, 2024

View reviewed changes

yoyodyne/evaluators.py Outdated Show resolved Hide resolved

yoyodyne/models/hard_attention.py Show resolved Hide resolved

fix 1

e32444f

kylebgorman merged commit 0a91f56 into CUNY-CL:master Dec 2, 2024
8 checks passed

kylebgorman deleted the predict2 branch December 2, 2024 16:17

Provide feedback