-
Notifications
You must be signed in to change notification settings - Fork 319
mcts #45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@xjdr-alt colour code: current sampling (original): note that the name is wrong scenario 1: static beam search on blue only: note that the name is wrong i think the difference between the two is marginal, albeit i like beam is a little bit better. Scenario 2: apply adaptive beam search to red (high / high): note that the name is right Scenario 3: apply blue, static beam search and adaptive red: note that the name is right i like scenario 3 the most. |
How about also using the base model, since it usually has better logprobs than what instruction turning messes the model up with. |
mcts to low ent / high vent branching.
The text was updated successfully, but these errors were encountered: