Issues · huggingface/trl

[Project] Training Agents with GRPO

#2723 opened Jan 31, 2025 by August-murr

Open 10

[Tracking issue] Integrate native liger-kernel losses

#2495 opened Dec 17, 2024 by qgallouedec

Open 6

[Tracking issue] Wrong loss scaling when accumulating gradient

#2617 opened Jan 23, 2025 by qgallouedec

Open

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

238 Open 1,277 Closed

🏋 GRPO ❓ question

#2927 opened Feb 21, 2025 by Tuziking

🐛 bug 🏋 GRPO

#2923 opened Feb 21, 2025 by edwardzjl

🐛 bug 🏋 SFT

#2920 opened Feb 21, 2025 by jbw3016

✨ enhancement 🏋 SFT

#2919 opened Feb 21, 2025 by GhostDog98

🏋 GRPO ❓ question

#2914 opened Feb 20, 2025 by zhaopku

5 tasks done

🐛 bug 🚀 deepspeed ⚡ PEFT 🏋 PPO

#2911 opened Feb 20, 2025 by Tarak200

🐛 bug 🏋 GRPO

#2910 opened Feb 20, 2025 by zhengqigao

5 tasks done

Cannot import name 'shard_checkpoint' (possibly deprecated in transformers) 🐛 bug 🏋 GRPO ⚡ PEFT

#2909 opened Feb 20, 2025 by anshuln2

5 tasks done

🐛 bug ✨ enhancement

#2904 opened Feb 19, 2025 by raphael-sch

GRPO completions skip special tokens ? 🏋 GRPO ⏳ needs more info 🏋 Reward

#2897 opened Feb 18, 2025 by MohamedAliRashad

🚀 deepspeed ⚡ PEFT 🏋 SFT

#2891 opened Feb 18, 2025 by sayakpaul

✨ enhancement 🏋 GRPO

#2888 opened Feb 18, 2025 by linkedlist771

✨ enhancement 🏋 GRPO

#2887 opened Feb 18, 2025 by ZYM66

🏋 GRPO 🏋 Reward

#2884 opened Feb 18, 2025 by Dong237

5 tasks done

⚡accelerate 🐛 bug 🚀 deepspeed 🏋 ORPO

#2882 opened Feb 17, 2025 by dannnnthemannnn

5 tasks done

🐛 bug 🏋 GRPO

#2878 opened Feb 17, 2025 by Saturnoul

5 tasks done

🐛 bug 🏋 GRPO

#2877 opened Feb 17, 2025 by GuodongFan

5 tasks done

I have this strange error with GRPO Trainer 🐛 bug 🏋 GRPO

#2876 opened Feb 16, 2025 by MohamedAliRashad

ProTip! What’s not been updated in a month: updated:<2025-01-21.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issues: huggingface/trl

Issues list