-
Notifications
You must be signed in to change notification settings - Fork 898
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ChatEvaluator as a composite evaluator for chat scenario evaluation #2562
Add ChatEvaluator as a composite evaluator for chat scenario evaluation #2562
Conversation
src/promptflow-evals/promptflow/evals/evaluators/chat/__init__.py
Outdated
Show resolved
Hide resolved
src/promptflow-evals/promptflow/evals/evaluators/chat/__init__.py
Outdated
Show resolved
Hide resolved
] | ||
score = chat_eval(conversation=conversation) | ||
print(score) | ||
# {'gpt_coherence': 5.0, 'gpt_coherence_per_turn': [5.0, 5.0], 'gpt_fluency': 5.0, 'gpt_fluency_per_turn': [5.0, 5.0], 'gpt_groundedness': 5.0, 'gpt_groundedness_per_turn': [5.0, 5.0], 'gpt_relevance': 5.0, 'gpt_relevance_per_turn': [5.0, 5.0]} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will this be in same order as conversation ? @qusongms how does UI show the per turn score right now ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, it will be in the same order of the conversation.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ninghu are we giving a reason as well for the score ? For chat we do see reasons being presented
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving with a follow up:
- Turn level scores and visualization
Description
Please add an informative description that covers that changes made by the pull request and link all relevant issues.
All Promptflow Contribution checklist:
General Guidelines and Best Practices
Testing Guidelines