Skip to content

Conversation

@silasvb
Copy link
Collaborator

@silasvb silasvb commented Oct 15, 2025

Description

Rather than creating PNG files for each LLM, which make comparison between LLMs difficult, this changes the ragas_evaluation script to take a new argument --llm <name>, and appends all results to a CSV file - This means that retrospective graph generation can be done easily.

It also fixes LLM usage statistics, which are stored in the dataframe as a dictionary, rather than as individual columns

Changelog

@silasvb silasvb force-pushed the OSAI-101/compare-llms branch from f0b7784 to 03461d0 Compare October 15, 2025 13:51
Copy link
Collaborator

@osouthwell-scottlogic osouthwell-scottlogic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good

Copy link
Collaborator

@osouthwell-scottlogic osouthwell-scottlogic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice idea to use a note book

Copy link
Collaborator

@osouthwell-scottlogic osouthwell-scottlogic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Re-approval

@silasvb silasvb merged commit 135313f into main Nov 5, 2025
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants