[draft] Use vLLM in LogCompletionsCallback #2797

tchang1997 · 2025-02-07T15:41:10Z

What does this PR do?

Implements changes to completion-logging callback suggested in #2786.

Normally, LogCompletionsCallback calls a helper function called _generate_completions. However, this uses the default model.generate, which can be slower than vLLM. This PR:

adds a helper called _generate_completions_vllm with equivalent functionality
adds a switch to on_step_end in LogCompletionsCallback to choose whether to use vLLM or not based on args.use_vllm from the Trainer
adds a utility function to [partially] convert the GenerationConfig normally passed to __init__ in the callback to SamplingParams compatible with vLLM.

I have not fully written tests yet (hence the draft), or fully checked compatibility with non-GRPOTrainer trainers, but this yields identical logging as the original LogCompletionsCallback in my own training scripts. Posting PR draft here for early feedback.

Fixes # (issue)

#2786

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
[in progress] Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

tchang1997 added 2 commits February 6, 2025 16:48

use vLLM when possible in LogCompletionsCallback

ce1b47d

merge w/ main and combine trainer/utils.py edits

75b2625

tchang1997 marked this pull request as draft February 7, 2025 15:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[draft] Use vLLM in LogCompletionsCallback #2797

[draft] Use vLLM in LogCompletionsCallback #2797

tchang1997 commented Feb 7, 2025

[draft] Use vLLM in LogCompletionsCallback #2797

Are you sure you want to change the base?

[draft] Use vLLM in LogCompletionsCallback #2797

Conversation

tchang1997 commented Feb 7, 2025

What does this PR do?

Before submitting

Who can review?