-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New preprocessor: Distance metrics dataset vs. reference #2266
Comments
yes I think @bouweandela plopped a Pearson test in an example recipe once. And I remember I barked at the poor guy that we should probably look at a Kolmogorov-Smirnoff distance plot better than a Pearson one 😁 |
Draft PR is available here: #2299 |
Would |
I actually like having the expected type of the variable in its name (here: |
But what if at some point in the future we would like to make it possible that the reference could be something else too? Using the type in the name was quite popular at some point (just Google Hungarian notation), not it's not considered good practice anymore, mostly because it makes it difficult to change code over time. |
Well, then it seems that almost all of our preprocessor functions use that bad practice (most use the argument Anyway, will change this. |
I know, I couldn't think of anything better at the time. |
Is your feature request related to a problem? Please describe.
To facilitate the comparison of different models against each other or against obersvations, it would be really helpful to have a preprocessor that calculates distance metrics of two datasets. Example for distance metrics can be the RMSE or the Pearson correlation coefficient.
I am thinking of the following signature for the new preprocessor:
When used within a recipe, reference datasets should be marked with
reference_for_metric: true
. This signature also allows an easy usage of the preprocessor outside of ESMValToolWould you be able to help out?
Yes!
@axel-lauer @hb326 @LisaBock
The text was updated successfully, but these errors were encountered: