-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EC-Earth3 downscaled precip ssp126 data has nans #581
Comments
It's weird that we're only seeing this in ssp126 and after downscaling... I suppose the first step is to examine the output downscaled data ( Edit: |
Okay, after manually searching I was unable to find nulls in the data. I manually reran the quality-control check that failed — it passed. So it appears that the data is good. Might have been an issue with transferring Zarr data from GCS. The retry-backoff for the QC steps starts at 5 seconds. It might be worth increasing that to 30 seconds if this really is the problem. Other than that, the run and data appear fine. |
Thanks for digging @brews. I am not sure I see how a change in the retry-backoff would have fixed that though ...? In general sounds like we can retry this run ? I can do it if you're not on it already. |
@emileten I don't know for sure. This is all speculation on my part. Zarr returns NaNs for missing chunks — might be that GCS is slow, busy, or limiting I/O. Waiting may give GCS a better chance to "empty its buffer" and catch up, or for other I/O or network-intensive work to complete. We've generally used a backoff of 15 to 30 seconds in other workflow steps, but this QC step starts at only 5 seconds. I'll see if I can't get it to just pass by retrying the existing workflow, first. |
After retrying the same workflow, it succeeded. I'm going to close this issue unless you have other ideas, concerns, or suggestions, @emileten. |
Wait, does it resume where it failed last time when using (Edit : don't have anything useful to add of course... Thanks !) |
@brews I am surprised we find nans after cleaning -- is it possible that we create nans during the pipeline ? I was tempted to re-run this, maybe it's just that something went wrong in that run...
Workflow : https://argo.cildc6.org/archived-workflows/default/ae0379ac-6f74-47af-bd91-1c4e6266bf11
Log :
The text was updated successfully, but these errors were encountered: