Skip to content

Commit

Permalink
Merge pull request #278 from sergiopaniego/llm-grpo-trl
Browse files Browse the repository at this point in the history
Added new `Post-training an LLM using GRPO with TRL` recipe 🧑‍🍳️
  • Loading branch information
stevhliu authored Feb 5, 2025
2 parents 1c90308 + 0e45d15 commit 063ca3e
Show file tree
Hide file tree
Showing 3 changed files with 895 additions and 3 deletions.
4 changes: 3 additions & 1 deletion notebooks/en/_toctree.yml
Original file line number Diff line number Diff line change
Expand Up @@ -74,7 +74,9 @@
title: Phoenix Observability Dashboard on HF Spaces
- local: search_and_learn
title: Scaling Test-Time Compute for Longer Thinking in LLMs

- local: fine_tuning_llm_grpo_trl
title: Post training an LLM for reasoning with GRPO in TRL

- title: Computer Vision Recipes
isExpanded: false
sections:
Expand Down
891 changes: 891 additions & 0 deletions notebooks/en/fine_tuning_llm_grpo_trl.ipynb

Large diffs are not rendered by default.

3 changes: 1 addition & 2 deletions notebooks/en/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,12 +7,11 @@ applications and solving various machine learning tasks using open-source tools

Check out the recently added notebooks:

- [Post training an LLM for reasoning with GRPO in TRL](fine_tuning_llm_grpo_trl)
- [Evaluating AI Search Engines with `judges` - the open-source library for LLM-as-a-judge evaluators](llm_judge_evaluating_ai_search_engines_with_judges_library)
- [Structured Generation from Images or Documents Using Vision Language Models](structured_generation_vision_language_models)
- [Vector Search on Hugging Face with the Hub as Backend](vector_search_with_hub_as_backend)
- [Multi-Agent Order Management System with MongoDB](mongodb_smolagents_multi_micro_agents)
- [Scaling Test-Time Compute for Longer Thinking in LLMs](search_and_learn)
- [Signature-Aware Model Serving from MLflow with Ray Serve](mlflow_ray_serve)

You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).

Expand Down

0 comments on commit 063ca3e

Please sign in to comment.