[onert-micro] Introduce OMTraining entities #13145

BalyshevArtem · 2024-06-10T12:46:14Z

This pr introduces OMTrainingInterpreter, OMTrainingContext and OMTrainingRuntimeModule entities.

for issue #12873
from draft: #13107

ONE-DCO-1.0-Signed-off-by: Artem Balyshev a.balyshev@samsung.com

This pr introduces OMTrainingInterpreter, OMTrainingContext and OMTrainingRuntimeModule entities. ONE-DCO-1.0-Signed-off-by: Artem Balyshev <a.balyshev@samsung.com>

chunseoklee · 2024-06-11T02:06:29Z

onert-micro/onert-micro/include/OMConfig.h

+  uint32_t num_of_train_layers = 0;
+  OMTrainOptimizer optimizer = SGD;
+  OMLoss loss = MSE;
+  float learning_rate = 0.f;


how about default value other than 0

For learning_rate, right? Yes, I agree, I will use 0.001

onert-micro/onert-micro/include/OMConfig.h

chunseoklee · 2024-06-11T02:56:42Z

onert-micro/onert-micro/src/OMTrainingInterpreter.cpp

+  // Close file
+  out_file.close();
+#else
+  assert(fasle && "Not supported");


Suggested change

assert(fasle && "Not supported");

assert(false && "Not supported");

Thank you, fixed it

chunseoklee · 2024-06-11T04:58:03Z

onert-micro/onert-micro/src/core/OMTrainingRuntimeModule.cpp

+{
+  OMStatus status = Ok;
+  uint32_t batch_size = config.training_context.batch_size;
+  config.training_context.num_step += 1;


Only problem, where to reset this value to 0. For example we now start new epoch and that is why we need to reset this value again to 0.

AFAIK, this num_step is used only for ADAM. Then it does not need to be reset at new epoch.

chunseoklee · 2024-06-11T05:50:45Z

onert-micro/onert-micro/include/OMConfig.h

+/*
+ * OMMetrics - enum to store metrics supported by training with onert-micro
+ */
+enum OMMetrics


@BalyshevArtem Note that nnfw api does not expose Metric Evaluation. That is, nnfw api will expose loss evaluation function.

chunseoklee

LGTM. Thank you

Torrero

LGTM

Torrero · 2024-06-11T14:42:46Z

onert-micro/onert-micro/src/OMTrainingInterpreter.cpp

+    return UnknownError;
+
+  // Write data
+  out_file.write(config.model_ptr, config.model_size);


Maybe it would be better to catch and to process some exceptions of the write function and in case of arising ones to return Error status (the same can be apply for saveCheckpoint). Just check ios_base::badbit state.

IN next pr will add this, thank you

[onert-micro] Introduce OMTraining entities

bdc126a

This pr introduces OMTrainingInterpreter, OMTrainingContext and OMTrainingRuntimeModule entities. ONE-DCO-1.0-Signed-off-by: Artem Balyshev <a.balyshev@samsung.com>

BalyshevArtem requested review from Torrero and SlavikMIPT June 10, 2024 12:46

chunseoklee reviewed Jun 11, 2024

View reviewed changes

onert-micro/onert-micro/include/OMConfig.h Outdated Show resolved Hide resolved

chunseoklee reviewed Jun 11, 2024

View reviewed changes

chunseoklee previously approved these changes Jun 11, 2024

View reviewed changes

small fixes

1719b93

BalyshevArtem dismissed chunseoklee’s stale review via 1719b93 June 11, 2024 08:46

BalyshevArtem requested a review from chunseoklee June 11, 2024 08:47

chunseoklee approved these changes Jun 11, 2024

View reviewed changes

Torrero approved these changes Jun 11, 2024

View reviewed changes

BalyshevArtem merged commit a3dbe82 into Samsung:master Jun 11, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onert-micro] Introduce OMTraining entities #13145

[onert-micro] Introduce OMTraining entities #13145

BalyshevArtem commented Jun 10, 2024

chunseoklee Jun 11, 2024

BalyshevArtem Jun 11, 2024

BalyshevArtem Jun 11, 2024

chunseoklee Jun 11, 2024

BalyshevArtem Jun 11, 2024

chunseoklee Jun 11, 2024

BalyshevArtem Jun 11, 2024

chunseoklee Jun 11, 2024

chunseoklee Jun 11, 2024

chunseoklee left a comment

Torrero left a comment

Torrero Jun 11, 2024

BalyshevArtem Jun 11, 2024

	assert(fasle && "Not supported");
	assert(false && "Not supported");

[onert-micro] Introduce OMTraining entities #13145

[onert-micro] Introduce OMTraining entities #13145

Conversation

BalyshevArtem commented Jun 10, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chunseoklee left a comment

Choose a reason for hiding this comment

Torrero left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment