[onert] Introduce DepthwiseConv2d op for training #12382

jyoungyun · 2023-12-28T02:50:23Z

What

Let's introduce DepthwiseConv2d gradient kernel for training.

For #12325

Draft #12392

Task

jyoungyun · 2024-01-11T09:27:06Z

Performance check

input: {100,28,28,1}, kernel: {32,32}

using eigen thread

[ backwardFloat32] backpropInput time = 8482
[ backwardFloat32] backpropFilter time = 2020
[    backward    ] backward time = 10560

tensorflow ref kernel

[ backwardFloat32] backpropInput time = 2082
[ backwardFloat32] backpropFilter time = 1995
[    backward    ] backward time = 4119

input: {10,112,112,32}, kernel: {1,3,3,32}

InputGradExpected time = 40913 <- ref kernel
backpropInput time = 2839 <- using thread
FilterGradExpected time = 37948 <- ref kernel
backpropFilter time = 3540 <- using thread

input gradient	ref kernel	using thread	diff
input(28,28,1)	2082	8482	6400 up
input(112,112,32)	40913	2839	38074 down

filter gradient	ref kernel	using thread	diff
kernel(32,32,1)	1995	2020	25 up
kernel(3,3,32)	37948	3540	33937 down

jyoungyun · 2024-01-22T02:06:31Z

Done!

jyoungyun mentioned this issue Dec 28, 2023

[onert] Support mobilenet_v2 model training #12325

Closed

11 tasks

jyoungyun closed this as completed Jan 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[onert] Introduce DepthwiseConv2d op for training #12382

[onert] Introduce DepthwiseConv2d op for training #12382

jyoungyun commented Dec 28, 2023 •

edited

Loading

jyoungyun commented Jan 11, 2024 •

edited

Loading

jyoungyun commented Jan 22, 2024

[onert] Introduce DepthwiseConv2d op for training #12382

[onert] Introduce DepthwiseConv2d op for training #12382

Comments

jyoungyun commented Dec 28, 2023 • edited Loading

What

Task

jyoungyun commented Jan 11, 2024 • edited Loading

Performance check

jyoungyun commented Jan 22, 2024

jyoungyun commented Dec 28, 2023 •

edited

Loading

jyoungyun commented Jan 11, 2024 •

edited

Loading