Skip to content

0.2.1 - Bugfix: resuming mlflow runs

Compare
Choose a tag to compare
@gmertes gmertes released this 24 Oct 14:09
· 64 commits to main since this release
84fd53b

What's Changed

Added

  • Mlflow-sync to include new tag for server to server syncing #83
  • Mlflow-sync to include functionality to resume and fork server2server runs #83
  • Rollout training for Limited Area Models. #79
  • Feature: New Boolean1DMask class. Enables rollout training for limited area models. #79

Fixed

  • Mlflow-sync to handle creation of new experiments in the remote server #83
  • Fix for multi-gpu when using mlflow due to refactoring of _get_mlflow_run_params function #99
  • ci: fix pyshtools install error #100

Changed

  • Update copyright notice

Commit Log

Full Changelog: https://github.com/ecmwf/anemoi-training/blob/develop/CHANGELOG.md#021---bugfix-resuming-mlflow-runs---2024-10-24