openai / evals Public

Notifications You must be signed in to change notification settings
Fork 2.6k
Star 15.2k

Code
Issues 95
Pull requests 46
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: openai/evals

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

95 Open 120 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[_common.py:105] Backing off create_retrying(...) for 1.3s (False) bug

Something isn't working

#1571 opened Dec 4, 2024 by Khyathianeesha

Request for Global Memory Across Different Chats (Across Sessions)

#1570 opened Nov 14, 2024 by rodrigoreis

Vector maping, origins, 2-3-4D definitions. bug

Something isn't working

#1569 opened Nov 11, 2024 by Really-69

Project installation fails: tensorflow conflicting dependencies bug

Something isn't working

#1567 opened Nov 7, 2024 by djbb7

ERROR: Failed building wheel for numpy. clang error compiler does not support 'faltivec bug

Something isn't working

#1566 opened Nov 6, 2024 by jinchi2013

AttributeError: module 'openai' has no attribute 'error' bug

Something isn't working

#1564 opened Oct 17, 2024 by sahilrajput03

Is Evals repo being replaced by the Evaluations feature in the Playground?

#1562 opened Oct 1, 2024 by sakher

Text2code2video eval

#1559 opened Sep 22, 2024 by bhack

o1 release breaks token usage stats bug

Something isn't working

#1556 opened Sep 13, 2024 by lucapericlp

Information exposure alert through an exception bug

Something isn't working

#1543 opened Aug 8, 2024 by arpitjain099

Log injection alert bug

Something isn't working

#1542 opened Aug 8, 2024 by arpitjain099

Multiple Unit Test Failures Across OpenAI Assistants, Anthropic, and Google Gemini Libraries bug

Something isn't working

#1536 opened Jun 24, 2024 by sakher

Schelling point eval doesn't work

#1533 opened May 22, 2024 by johny-b

What is this

#1527 opened May 3, 2024 by DXv-3

Setting completion function args via CLI does not work bug

Something isn't working

#1504 opened Mar 27, 2024 by LoryPack

OpenAIChatCompletionFn should __init__ should accept **kwargs bug

Something isn't working

#1493 opened Mar 15, 2024 by ezraporter

Support multiple completions for ModelbasedClassify

#1484 opened Mar 14, 2024 by tom-christie

Support for Azure OpenAI client

#1469 opened Feb 21, 2024 by pkt1583

Request to change arithmetical_puzzles prompting

#1448 opened Dec 24, 2023 by ArcticBeat05

Improvements to Match: case insensitive and strip

#1421 opened Nov 30, 2023 by LoryPack

Proposal for Adding a New Evaluation Metric: Sentiment Analysis Accuracy

#1419 opened Nov 23, 2023 by Sarfaraz021

Using different models in evaluating mode-graded eval and in generating the completion

#1393 opened Nov 3, 2023 by LoryPack

In the task "balance_chemical_equation", many instances have incorrect labels. bug

Something isn't working

#1386 opened Oct 19, 2023 by dongZheX

Eval-running often hangs on last sample bug

Something isn't working

#1384 opened Oct 17, 2023 by sjadler2004

Should random collection of values be supported?

#1382 opened Oct 17, 2023 by assert6

Previous 1 2 3 4 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly