This is the implementation of the paper Efficient Beam Search for Large Language Models Using Trie-Based Decoding
https://arxiv.org/pdf/2502.00085
Results in the paper can be reproduced in the folder reproduction
.
Data analysis and visualization utilities are located in the analysis
directory. Outputs from these scripts are written to analysis/results
by
default.