Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add Error Handling for Stale Issue Script in GitHub Action
#2258 opened Oct 21, 2024 by Ananya54321 Loading…
2 of 5 tasks
Adjust padding in batch generation 🐛 bug Something isn't working
#2251 opened Oct 18, 2024 by gaetanlop Loading…
3 tasks done
Conversational dataset support for KTOTrainer
#2248 opened Oct 18, 2024 by qgallouedec Loading…
5 tasks
Data mixer Integration
#2240 opened Oct 16, 2024 by August-murr Draft
3 of 5 tasks
[online-DPO] evaluaiton step error 🐛 bug Something isn't working
#2231 opened Oct 15, 2024 by kashif Loading…
Add VAS to TRL ✨ enhancement New feature or request
#2195 opened Oct 7, 2024 by idanshen Loading…
[CGPO] CGPO Trainer (single task single objective) ✨ enhancement New feature or request
#2190 opened Oct 6, 2024 by gaetanlop Draft
9 of 12 tasks
Change KTO tokenization to use DPO's 🏋 KTO Related to KTO
#2187 opened Oct 6, 2024 by kawine Loading…
[CGPO] Mixture of judges 👨‍⚖️ judge Related to judges
#2159 opened Oct 3, 2024 by gaetanlop Loading…
4 tasks done
Remove graph breaks for torch.compile() in padding free branch in DataCollatorForCompletionOnlyLM 🐛 bug Something isn't working 🏋 SFT Related to SFT
#2158 opened Oct 3, 2024 by Abhishek-TAMU Loading…
1 of 5 tasks
populate SUPPORTED_COMMANDS cli
#2157 opened Oct 2, 2024 by grumpyp Loading…
4 of 5 tasks
[Open discusion] Multistep dataset
#2148 opened Oct 1, 2024 by qgallouedec Draft
4 tasks
DPO trainer supports num_logits_to_keep to save memory 🏋 DPO Related to DPO
#2129 opened Sep 26, 2024 by xyangk Loading…
3 of 5 tasks
Process-supervised RM Trainer
#2127 opened Sep 26, 2024 by gaetanlop Draft
5 tasks done
[SCoRE] initial score stage 1
#2115 opened Sep 24, 2024 by kashif Draft
Remove deprecated args in trainers
#2036 opened Sep 8, 2024 by qgallouedec Draft
5 tasks
feat: add support for packing tokenized datasets
#2011 opened Sep 3, 2024 by kmehant Loading…
3 of 5 tasks
allow masking on consecutive messages with same roles
#2000 opened Aug 31, 2024 by lsy641 Loading…
4 of 5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.