Support multiple completions for ModelbasedClassify #1484

tom-christie · 2024-03-14T18:15:54Z

Describe the feature or improvement you're requesting

It would be nice to be able to score multiple sample completions using ModelBasedClassify. Even if n>1 is passed into a completion function and multiple samples are returned, only the first is graded because of this line:

https://github.com/openai/evals/blob/main/evals/elsuite/utils.py#L193

Additional context

I would like to be able to raise the temperature, ask a model to produce N completions, and have each completion graded separately using a rubric. This appears to work fine for non-model-based scoring.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple completions for ModelbasedClassify #1484

Support multiple completions for ModelbasedClassify #1484

tom-christie commented Mar 14, 2024 •

edited

Loading

Support multiple completions for ModelbasedClassify #1484

Support multiple completions for ModelbasedClassify #1484

Comments

tom-christie commented Mar 14, 2024 • edited Loading

Describe the feature or improvement you're requesting

Additional context

tom-christie commented Mar 14, 2024 •

edited

Loading