-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restart Catalogue #272
Comments
Thanks @charles-turner-1 for making this issue and pointing out the possibilities for how it might work. I'll ping @jo-basevi and @tmcadam here as this is part of the experiment provenance and tracking work. Would it help to have some example restarts to index for testing purposes? We're probably still lacking some important metadata (experiment and run IDs) in the files themselves, see payu-org/payu#510, so that might be a blocker until remedied. |
For my own edification (and the future software peeps who aren't from a climate background), can someone give me a definitive explanation (or link to the same) of what the 'restarts' are, particularly with respect to how they differ from the 'outputs'? |
I believe that they contain the fields, as they were at the very last timestep of the model run, whereas the outputs tend to be averaged over the output period. So for a model with a 6 hour timestep, we might run up until eg. the end of June, giving us a restart that represents the model state on the 30th of June at 18:00, whereas the June output will be the average conditions over the entirety of June (00:00 1st June - 18:00 30th June). I would confirm this with someone who regularly performs the model runs - this is based off my conversation with @aidanheerdegen last week and some model runs I performed about 6 years ago... |
Yeah that's correct. The models have a The checkpoint step dumps to disk all the prognostic fields that that the model uses internally in time-stepping the models and these can then be read back in when the model restarts. If other users wants to use an experiment as a base from which they create new runs, typically with some perturbed physics, then they also need the restart files from the experiment. Usually not all restarts are retained, as it isn't usually necessary to be able to restart the model from any time during the experiment. |
Is your feature request related to a problem? Please describe.
@aidanheerdegen noted yesterday it might be helpful to have a separate catalogue of restarts, so users would be able to easily access restarts for model runs, but not accidentally access it to avoid confusion.
Describe the feature you'd like
Intake allows a single catalog to describe multiple sources: ie, the
access_nri
andrestart
catalogues could be combined aswhich would then be accessible through
Describe alternatives you've considered
This feature would build on the approach described in #245 - see there for potential pitfalls.
Additional context
The text was updated successfully, but these errors were encountered: