Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Global MMLU Lite #2567

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

shivalika-singh
Copy link

Hi @baberabb

Reopening the PR for integrating global mmlu with eval harness. I followed the instructions here and made sure pre-commit checks are passing. Hopefully the tests should pass this time.

This PR integrates the "lite" version of global mmlu which contains 200 CS (culturally sensitive) and 200 CA (culturally agnostic) samples across 15 languages with human translations. We recommend using this dataset for evaluating multilingual models and would like to integrate this with eval-harness.

This is the initial version of the PR based on our discussion here. Let me know if any changes are needed before we can merge this.

cc: @marziehf

@CLAassistant
Copy link

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

@baberabb
Copy link
Contributor

@shivalika-singh Thanks for the PR and mostly looks good. Just a couple of nits:

  1. please sign the CLA if you agree. We can't merge without it.
  2. add a readme.
  3. add an entry to tasks/README.md, with a sentence explaining the benchmark like all the other tasks.

I think we can also add a group config. Groups are similar to tags in that they both include multiple tasks, but the former also provides an aggregated metric.

@shivalika-singh
Copy link
Author

shivalika-singh commented Dec 13, 2024

Hi @baberabb , sure I'll update the readme and look into adding group config and update the PR shortly.

Regarding the CLA, I've been trying to sign the CLA since a while. I have agreed to it but somehow it's not getting reflected here. Now when I click the CLA link, it shows me "you have agreed..." and I can't see the button to accept anything anymore (as shown in screenshot).
But still not getting reflected here. I clicked on the "recheck" option quite a few times. Not sure what's the reason. Let me know in case you have any suggestions.

Screenshot 2024-12-13 at 9 46 04 PM

@baberabb
Copy link
Contributor

@shivalika-singh Hey, so the CLA issue is because you pushed from an account different from this one you made the PR on. see: cla-assistant/cla-assistant#661 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants