Include "time" as option to save_strategy (and log and eval too!) #36310

davidhughhenrymack · 2025-02-20T19:00:54Z

Feature request

When building a training config for trainer I'd love to be able to do something akin to:

training_args = SFTConfig( eval_strategy="time", eval_minutes=15, save_strategy="time", save_minutes=30 )

Motivation

My motivation for this is that I'm always using steps to proxy time, but this is painful when I run on different GPUs or most commonly run the code locally (Macbook pro) to check it runs, then run it remotely on a beefy GPU.

For saving I want to "not lose significant amounts of work" and for me this is always time based (E.g. I don't want to wait for 6 hours but see the model crash and loose all progress at hour 5) since both my waiting and my paying for compute is time based. Similarly for eval, I don't want to eval every minute (expensive!) but do want to eval often enough to know the model is still progressing. Time based solves this.

Your contribution

I'm down to write code for this! I don't know the deeper architectural considerations so greatly welcome the community's insights. Thanks :)

The text was updated successfully, but these errors were encountered:

Rocketknight1 · 2025-02-21T15:30:08Z

cc @SunMarc @muellerzr

SunMarc · 2025-02-21T16:56:57Z

That could be a nice feature if you feel like steps or epoch are sometimes a bit tricky to set correctly. Feel free to submit a PR that that.

davidhughhenrymack added the Feature request Request for a new feature label Feb 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Include "time" as option to save_strategy (and log and eval too!) #36310

Include "time" as option to save_strategy (and log and eval too!) #36310

davidhughhenrymack commented Feb 20, 2025

Rocketknight1 commented Feb 21, 2025

SunMarc commented Feb 21, 2025

Include "time" as option to save_strategy (and log and eval too!) #36310

Include "time" as option to save_strategy (and log and eval too!) #36310

Comments

davidhughhenrymack commented Feb 20, 2025

Feature request

Motivation

Your contribution

Rocketknight1 commented Feb 21, 2025

SunMarc commented Feb 21, 2025