-
Notifications
You must be signed in to change notification settings - Fork 15
Conversation
- Split into seperate files - Use list in config to add callbacks - Provide legacy config enabled approach - Fix ruff issues
At the moment, this is the proposed refactor, I am yet to complete an exhaustive test of the changes |
Great work, thank you for taking this on. I was thinking that it might be nice to make this fully configurable through instantiate. For example, no one is really using the stochastic weight averaging as far as I know, so having specific config entries for this is a bit of feature bloat. Then the list of callbacks would just look like this:
This makes it more extensible and actually reduces some of or less used config entries. Additionally, we can keep the standard callbacks, like model checkpoints as "permanent callback" (I don't think we have to make everything optional). One idea I also had is that we could make a special list for "plot_callbacks" in the same style. Then we can easily keep the super convenient "plots.enabled = False" as a shortcut to disable them? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @HCookie, thanks for taking on the callbacks!
It's already much better, great work on that. I think we can take the refactor even further and make the callbacks (almost?) fully modular, which would be incredible for future extensibility.
One comment regarding the file names. So far we haven't been using <xyz>-ing.py
as language. Especially "checkpointing" would be confusing with activation checkpointing (although that is and will stay confusing honestly). Can we rename these please?
for more information, see https://pre-commit.ci
- Prefill config with callbacks - Warn on deprecations for old config - Expand config enabled - Add back SWA - Fix logging callback - Add flag to disable checkpointing - Add testing
[feature] Fix trainable attribute callbacks
Co-authored-by: Sara Hahner <[email protected]>
In general, this looks good to me. The new layout of the config files is intuitive. Thank you for the work that you have put into this. |
That issue with mlflow is addressed in #91. So once that is merged, the config will be accessible in a dump or fully expanded |
a3f7e00
to
30dfd45
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for incorporating the requested changes. This looks good to me now.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me.
* Refactor Callbacks - Split into seperate files - Use list in config to add callbacks - Split out plotting callbacks config * Refactor rollout (#87) - New rollout central function --------- Co-authored-by: Mario Santa Cruz <[email protected]> Co-authored-by: Sara Hahner <[email protected]>
New Usage
Set
config.diagnostics.callbacks
to a list of callback names to includeCloses #59, #45
📚 Documentation preview 📚: https://anemoi-training--60.org.readthedocs.build/en/60/