Add Data Explorer Tab#3632
Conversation
|
You're amazing! So fast! |
|
Really wonderful job :) will review |
|
Looks wonderful. One bug: Dark mode causes issues because the text doesn't turn white. Can you fix? @ygtangg
|
|
I fixed the dark mode issue on the website. No change to the iframe link is needed. |
|
Where is fix? I don't see another commit or PR. |
|
I changed the html website, whose files are in the arena-leaderboard-v2 repo. Do you think we should move it in here as well? |
|
Oh got it! |
|
LGTM. @infwinston ? |
|
this is suuuuper cool! @aangelopoulos @infwinston lmk if yall need any help to get this merged, this will be an amazing feature |
| model_keys = ['chatgpt-4o-latest', 'gemini-1.5-pro-exp-0827','gpt-4o-mini-2024-07-18','claude-3-5-sonnet-20240620','gemini-1.5-flash-exp-0827','llama-3.1-405b-instruct','gemini-1.5-pro-api-0514','mistral-large-2407','reka-core-20240722','gemini-1.5-flash-api-0514', 'deepseek-coder-v2-0724','yi-large','llama-3-70b-instruct','qwen2-72b-instruct','claude-3-haiku-20240307','llama-3.1-8b-instruct','mistral-large-2402','command-r','mixtral-8x22b-instruct-v0.1','gpt-3.5-turbo-0613'] | ||
| output_tokens_per_USD = [66.66666667000001,200.0,1666.666667,66.66666667000001,3333.333333,333.3333333,200.0,166.6666667,166.6666667,3333.333333,3333.333333,333.3333333,1265.8227849999998,1111.111111,800.0,11111.11111,166.6666667,666.6666667,166.6666667,500.0] | ||
| score=[1316.1559008799543,1300.8583398843484,1273.6004783067303,1270.113546648134,1270.530573909608,1266.244657076764,1259.2844314017723,1249.8268751367714,1229.2148108171098,1226.8769924152105,1214.5634252743123,1212.4668382698005,1206.3236747009742,1186.7832147344182,1178.5484948812955,1167.8793593807711,1157.271872307139,1148.6665817312062,1147.0325504217642,1117.0289441863001] | ||
| fig = px.scatter(x=output_tokens_per_USD, y=score, title="Quality vs. Cost Effectiveness", labels={ | ||
| "output_tokens_per_USD": "# of output tokens per USD (in thousands)", | ||
| "score": "Arena Score"}, log_x=True, text=model_keys) |
There was a problem hiding this comment.
Let's probably not push this info here. it'll be hard to maintain/update.
There was a problem hiding this comment.
yes yes, will upload plotly graph to the google storage and then embed with iframe
There was a problem hiding this comment.
No, I don't think so. It's added when I run the server. I can remove it.
Co-authored-by: Sophie Xie <sxie2@berkeley.edu>

Why are these changes needed?
This Data Explorer provides a visually engaging and interactive tool that allows users to explore and draw insights from the leaderboard (conversation) data. It fosters transparency in the ranking process and enhances users’ trust in our leaderboard.


(link to the explorer website: link)
Related issue number (if applicable)
Checks
format.shto lint the changes in this PR.