Skip to content
This repository has been archived by the owner on Oct 2, 2024. It is now read-only.

Make BBQ prompts identical to HELM's version #39

Merged
merged 29 commits into from
Jan 18, 2024

Conversation

brianwgoldman
Copy link
Contributor

In PR #33 I mentioned the only difference was how training examples get sampled. In this PR I'm porting the logic from HELM for how to do sampling.

My goal is to get NewHELM to produce the exact same value for the BBQ stats when using GPT2, as a way to ensure we have a fully functioning replacement.

In HELM this is the most common type of prompt. It is also how BBQ works, so I'll need it in fleshing out that Test.
Before this change the difference was how they sampled in context learning examples. I've updated that to match.
@brianwgoldman brianwgoldman requested a review from a team as a code owner January 11, 2024 20:11
Copy link

github-actions bot commented Jan 11, 2024

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Copy link
Contributor

@yifanmai yifanmai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should have mentioned this earlier, but an alternative to copy and pasting the sampling algorithm would be to hardcode in the indexes of the sampled training examples.

@yifanmai
Copy link
Contributor

Also, the sampled test items will still be different, right? I think it's sufficient to get close enough to BBQ, without needing to reproduce it exactly.

@brianwgoldman
Copy link
Contributor Author

Also, the sampled test items will still be different, right? I think it's sufficient to get close enough to BBQ, without needing to reproduce it exactly.

The test items are still the same as long as max_eval_instances is 1000 or more. In that situation, HELM does no sampling or shuffling of the eval instances.

@brianwgoldman brianwgoldman merged commit 29626dc into main Jan 18, 2024
2 checks passed
@brianwgoldman brianwgoldman deleted the auxy/rewrite-example-sampling branch January 18, 2024 00:12
@github-actions github-actions bot locked and limited conversation to collaborators Jan 18, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants