Update to llm-samplers v0.0.7 #440

KerfuffleV2 · 2023-11-06T08:37:09Z

See KerfuffleV2/llm-samplers#9 for more information about the changes.

Notable features are adding Top-A and Min-P samplers.

KerfuffleV2 · 2023-11-06T23:59:50Z

This shouldn't be merged until I release 0.0.7 (likely in the next couple days), but I think it's ready for review in case there are any changes need. After the release, I'll update Cargo.toml to use that instead of pointing at the repo.

I added a new way to build the logits that prunes them (like Top-K) and they start out sorted which can be a big performance win. For example, doing logits::try_from_iter_top_k(blah, 1000) only takes the top 1,000. The remainder are not likely to ever be selected by sampling. Let me know if you want me to add a commandline option or something to enable that. There's some discussion here: KerfuffleV2/llm-samplers#9 (comment)

KerfuffleV2 · 2023-11-09T08:33:51Z

@philpax Could you please take a look at this one? (I don't seem to have access to request a review.)

philpax

Seems pretty reasonable - nice work on removing the type parameters from Sampler! Is there anything else you'd like to do, or should I merge it?

KerfuffleV2 · 2023-11-09T21:41:54Z

Thanks for checking. Should be all set to merge as far as I know as long as you're satisfied with the changes. Note that it passes the various tests but actual usage wasn't extensively tested, so I'd recommend running it on a model and making sure you get reasonable results. I don't have a lot of old GGML format models laying around.

philpax · 2023-11-09T22:20:15Z

Tested and it seems to work. Thanks for your work!

KerfuffleV2 · 2023-11-09T22:25:09Z

Not a problem. If you ever find the sampling performance to have a measurable impact, you can try the Logits::try_from_iter_top_k method. If you set k to 2000 or so it's extremely unlikely to affect results and can increase the performance a lot, especially for those models with a very large vocab size (there are a few with 250K+).

Update to llm-samplers v0.0.7

5fa9bb2

KerfuffleV2 marked this pull request as ready for review November 6, 2023 23:55

Depend on llm-samplers 0.0.7 release

9df5a7e

philpax self-requested a review November 9, 2023 21:25

philpax approved these changes Nov 9, 2023

View reviewed changes

philpax merged commit 23e4b46 into rustformers:main Nov 9, 2023
14 checks passed

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to llm-samplers v0.0.7 #440

Update to llm-samplers v0.0.7 #440

KerfuffleV2 commented Nov 6, 2023

KerfuffleV2 commented Nov 6, 2023

KerfuffleV2 commented Nov 9, 2023

philpax left a comment

KerfuffleV2 commented Nov 9, 2023

philpax commented Nov 9, 2023

KerfuffleV2 commented Nov 9, 2023

Update to llm-samplers v0.0.7 #440

Update to llm-samplers v0.0.7 #440

Conversation

KerfuffleV2 commented Nov 6, 2023

KerfuffleV2 commented Nov 6, 2023

KerfuffleV2 commented Nov 9, 2023

philpax left a comment

Choose a reason for hiding this comment

KerfuffleV2 commented Nov 9, 2023

philpax commented Nov 9, 2023

KerfuffleV2 commented Nov 9, 2023