Add gradient accumulation functionality to SupervisedTrainer #6100

jak0bw · 2023-03-03T19:09:16Z

I am sorry if I missed any existing functionality or documentation on this topic but I could not find anything.

Is your feature request related to a problem? Please describe.
SupervisedTrainer is missing built-in gradient accumulation functionality

Describe the solution you'd like
Add gradient accumulation functionality

Describe alternatives you've considered
ignite.supervised_training_step as iteration_update parameter
(https://pytorch.org/ignite/generated/ignite.engine.supervised_training_step.html)

trainer = SupervisedTrainer(
      device=device,
      max_epochs=max_epochs,
      train_data_loader=train_loader,
      network=net,
      optimizer=optimizer,
      loss_function=loss,
      inferer=SimpleInferer(),
      key_train_metric=None,
      train_handlers=train_handlers,
      iteration_update=supervised_training_step(device=device,
                                                optimizer=optimizer,
                                                loss_fn=loss,
                                                model=net,
                                                gradient_accumulation_steps=4,
                                                prepare_batch=default_prepare_batch),
      amp=False,
      postprocessing=None,
  )

This works kind of but it does not feel like it should be the way to do it in monai as it does not fire the correct events during update and therefore ignores set handlers during training.

The text was updated successfully, but these errors were encountered:

jak0bw · 2023-03-03T19:11:30Z

#6101 Would be a first example of what I imagine the solution could look like.

(Source code is strongly (and shamelessly) influenced by https://pytorch.org/ignite/generated/ignite.engine.supervised_training_step.html

jak0bw mentioned this issue Mar 6, 2023

Add gradient accumulation logic to SupervisedTrainer #6101

Draft

7 tasks

KumoLiu added the Contribution wanted label Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gradient accumulation functionality to SupervisedTrainer #6100

Add gradient accumulation functionality to SupervisedTrainer #6100

jak0bw commented Mar 3, 2023

jak0bw commented Mar 3, 2023

Add gradient accumulation functionality to SupervisedTrainer #6100

Add gradient accumulation functionality to SupervisedTrainer #6100

Comments

jak0bw commented Mar 3, 2023

jak0bw commented Mar 3, 2023