Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Liger GRPO support
#2926 opened Feb 21, 2025 by SalmanMohammadi Draft
4 tasks
updated DPO default values for alpha and tau
#2918 opened Feb 20, 2025 by Ishan-Kumar2 Loading…
4 of 5 tasks
Update kto_config.py
#2912 opened Feb 20, 2025 by sileod Loading…
5 tasks
Remove CUDA synchronization in mean_token_accuracy
#2902 opened Feb 19, 2025 by cyyever Loading…
1 task done
parameterize enable_prefix_caching
#2900 opened Feb 19, 2025 by ji-huazhong Loading…
1 of 5 tasks
Fast dataset truncate in SFTTrainer
#2898 opened Feb 18, 2025 by mariosasko Loading…
1 of 5 tasks
[GRPO] Log completions at every step
#2893 opened Feb 18, 2025 by lidiya-co Loading…
1 of 5 tasks
BCOTrainer version upgrade fixes
#2867 opened Feb 15, 2025 by claralp Loading…
3 of 5 tasks
grpo_error 😴 stale No update from the author, will be closed soon
#2841 opened Feb 12, 2025 by huihuiustc Loading…
3 of 5 tasks
Add GRPO Trainer support for third-party accelerators
#2836 opened Feb 12, 2025 by ji-huazhong Loading…
1 of 5 tasks
[WIP] [Liger] Liger KTO support
#2812 opened Feb 10, 2025 by vaibhavjindal Draft
5 tasks
GRPO Environments for custom multi-step rollouts (vLLM-only)
#2810 opened Feb 9, 2025 by willccbb Loading…
5 tasks done
[draft] Use vLLM in LogCompletionsCallback
#2797 opened Feb 7, 2025 by tchang1997 Draft
2 of 4 tasks
Remote GRPO ref model
#2763 opened Feb 4, 2025 by edbeeching Draft
WIP: RLOOV2
#2724 opened Jan 31, 2025 by mnoukhov Draft
3 tasks
🔧 Optimize GRPO VRAM Usage
#2669 opened Jan 27, 2025 by andyl98 Loading…
2 of 5 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.