[onert] Introduce use-def chains into `TrainableGraph` #13317

ragmani · 2024-06-27T12:56:12Z

What

Let's introduce use-def chains and new index types for training into TrainableGraph and apply it for memory optimization.

Why

There was no use-def chains that could directly know dependencies between operations and operands in a TrainableGraph. So we were not even verifying whether TrainableGraph is a dag graph, and we were also unable to optimize memory for training.

Ways to support new index types for training

New types : TrainingOperationIndex, TrainingOperandIndex

A way of distinguishing whether an operand or an operation is in forwarding and backwarding by using an existing index number and the information on whether it is in forwarding or in backwarding within the new index types.

An example

template <typename Index> class TrainingIndex
class TrainingIndex
{
...
private:
  Index _index;
  bool _is_forward;
}

using TrainingOperationIndex = TrainingIndex<OperationIndex>;
using TrainingOperandIndex = TrainingIndex<OperandIndex>;

A way of distinguishing whether an operand or an operation is in forwarding and backwarding only by using an new index number within the new index types.

An example

using TrainingOperationIndex = ::onert::util::Index<uint32_t, TrainingOperationIndexTag>;
using TrainingOperandIndex = ::onert::util::Index<uint32_t, TrainingOperandIndexTag>;

I'm trying with the first way.

Draft : #13305

The text was updated successfully, but these errors were encountered:

ragmani · 2024-08-08T04:36:14Z

I'm closing this issue because memory usage for training reduces as #13282 (comment) on master branch(08cfdf5) now.

There are still parts that can be further optimized for memory.
For example,

Unifying planning of tensors for gradient, disposable, and extra.
Unifying planning of tensors for non-const and back-prop.

However, I won't proceed them unless there is any request since their effects are not expected to reduce memory usage dramatically.

ragmani changed the title ~~[onert]~~ [onert] Introduce use-def chains into TrainableGraph Jun 27, 2024

ragmani closed this as completed Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onert] Introduce use-def chains into `TrainableGraph` #13317

[onert] Introduce use-def chains into `TrainableGraph` #13317

ragmani commented Jun 27, 2024 •

edited

Loading

ragmani commented Aug 8, 2024

[onert] Introduce use-def chains into TrainableGraph #13317

[onert] Introduce use-def chains into TrainableGraph #13317

Comments

ragmani commented Jun 27, 2024 • edited Loading

What

Why

Ways to support new index types for training

ragmani commented Aug 8, 2024

[onert] Introduce use-def chains into `TrainableGraph` #13317

[onert] Introduce use-def chains into `TrainableGraph` #13317

ragmani commented Jun 27, 2024 •

edited

Loading