Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include "time" as option to save_strategy (and log and eval too!) #36310

Open
davidhughhenrymack opened this issue Feb 20, 2025 · 2 comments
Open
Labels
Feature request Request for a new feature

Comments

@davidhughhenrymack
Copy link

Feature request

When building a training config for trainer I'd love to be able to do something akin to:

training_args = SFTConfig( eval_strategy="time", eval_minutes=15, save_strategy="time", save_minutes=30 )

Motivation

My motivation for this is that I'm always using steps to proxy time, but this is painful when I run on different GPUs or most commonly run the code locally (Macbook pro) to check it runs, then run it remotely on a beefy GPU.

For saving I want to "not lose significant amounts of work" and for me this is always time based (E.g. I don't want to wait for 6 hours but see the model crash and loose all progress at hour 5) since both my waiting and my paying for compute is time based. Similarly for eval, I don't want to eval every minute (expensive!) but do want to eval often enough to know the model is still progressing. Time based solves this.

Your contribution

I'm down to write code for this! I don't know the deeper architectural considerations so greatly welcome the community's insights. Thanks :)

@davidhughhenrymack davidhughhenrymack added the Feature request Request for a new feature label Feb 20, 2025
@Rocketknight1
Copy link
Member

cc @SunMarc @muellerzr

@SunMarc
Copy link
Member

SunMarc commented Feb 21, 2025

That could be a nice feature if you feel like steps or epoch are sometimes a bit tricky to set correctly. Feel free to submit a PR that that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature request Request for a new feature
Projects
None yet
Development

No branches or pull requests

3 participants