-
Notifications
You must be signed in to change notification settings - Fork 852
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AraGen v2 and Arabic IFEval #2786
AraGen v2 and Arabic IFEval #2786
Conversation
fix space display
fix second heatmap image display
fix images display
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As a general comment, and not being familiar with the nomenclature, I find it a bit difficult to navigate through the concepts. For example, AraGen is a leaderboard, which is based on a dataset (also called AraGen, if I'm understanding correctly) and an evaluation method that has been reworked. Not sure if AraGen 2 is the same as AraGen-03-25 or the combination of several things.
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Pedro Cuenca <[email protected]>
@pcuenca, AraGen is first a benchmark, but the previous space was named AraGen as well. To resolve this confusion, we renamed the whole space to "Arabic-Leaderboards" instead of "AraGen-Leaderboard" and we will be adding more tasks into the same space to centralize the efforts. AraGen 2 is the same as AraGen-03-25 and the name was replaced in the blog to resolve the confusion. Thank you @pcuenca, i will resolve the current conflicts in |
@pcuenca I believe all your comments are addressed now and good to go. Thanks again for your help |
change date Co-authored-by: Pedro Cuenca <[email protected]>
* Create leaderboard-3c3h-aragen-ifeval.md * Update _blog.yml * Update leaderboard-3c3h-aragen-ifeval.md fix space display * Update leaderboard-3c3h-aragen-ifeval.md fix second heatmap image display * Update leaderboard-3c3h-aragen-ifeval.md fix images display * Update leaderboard-3c3h-aragen-ifeval.md Co-authored-by: Pedro Cuenca <[email protected]> * Update leaderboard-3c3h-aragen-ifeval.md Co-authored-by: Pedro Cuenca <[email protected]> * Update leaderboard-3c3h-aragen-ifeval.md Co-authored-by: Pedro Cuenca <[email protected]> * Update leaderboard-3c3h-aragen-ifeval.md Co-authored-by: Pedro Cuenca <[email protected]> * Update leaderboard-3c3h-aragen-ifeval.md * Update _blog.yml change date Co-authored-by: Pedro Cuenca <[email protected]> --------- Co-authored-by: Pedro Cuenca <[email protected]>
In this PR we seek to merge the blog introducing the 1st Arabic Instruction Following Bench/Leaderboard
@pcuenca and @ariG23498, you assistance in this PR is much appreciated.
@clefourrier and @albertvillanova, if you have some buffer, your review here is much appreciated 🤗
cc: @Sarah-albarri @samta-kamboj