[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass #13439

seanshpark · 2024-07-16T00:02:27Z

This will introduce FuseMulToFullyConnectedWeightsPass which will
fuse Mul to following FullyConnected weights if possible.

This will introduce FuseMulToFullyConnectedWeightsPass which will fuse Mul to following FullyConnected weights if possible. ONE-DCO-1.0-Signed-off-by: SaeHie Park <[email protected]>

seanshpark · 2024-07-16T00:02:36Z

mhs4670go

LGTM

shs-park

I left some questions, PTAL
=)

shs-park · 2024-07-16T06:52:47Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+    _mul_s->size<DT>(3);
+    for (uint32_t i = 0; i < 3; ++i)
+    {
+      _mul_s->at<DT>(0) = 1.0f;


i index is not used in for statement..!?

Did you intend below code?

Suggested change

_mul_s->at<DT>(0) = 1.0f;

_mul_s->at<DT>(i) = 1.0f;

shs-park · 2024-07-16T07:01:31Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+    _fc_w->size<DT>(4 * 6);
+    for (uint32_t i = 0; i < 4 * 6; ++i)
+    {
+      _fc_w->at<DT>(0) = 1.0f;


shs-park · 2024-07-16T07:01:41Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+    _fc_b->size<DT>(6);
+    for (uint32_t i = 0; i < 6; ++i)
+    {
+      _fc_b->at<DT>(0) = 1.0f;


shs-park · 2024-07-16T07:04:45Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+    _fc_w->dim(0) = 3;
+    _fc_w->dim(1) = 4;
+    _fc_w->dtype(DT);
+    _fc_w->size<DT>(4 * 6);
+    for (uint32_t i = 0; i < 4 * 6; ++i)


The shape of _fc_w is <3x4>
Does this size have to be the same to the shape..?

Suggested change

_fc_w->dim(0) = 3;

_fc_w->dim(1) = 4;

_fc_w->dtype(DT);

_fc_w->size<DT>(4 * 6);

for (uint32_t i = 0; i < 4 * 6; ++i)

_fc_w->dim(0) = 3;

_fc_w->dim(1) = 4;

_fc_w->dtype(DT);

_fc_w->size<DT>(3 * 4);

for (uint32_t i = 0; i < 3 * 4; ++i)

or

Suggested change

_fc_w->dim(0) = 3;

_fc_w->dim(1) = 4;

_fc_w->dtype(DT);

_fc_w->size<DT>(4 * 6);

for (uint32_t i = 0; i < 4 * 6; ++i)

_fc_w->dim(0) = 4;

_fc_w->dim(1) = 6;

_fc_w->dtype(DT);

_fc_w->size<DT>(4 * 6);

for (uint32_t i = 0; i < 4 * 6; ++i)

please check about FullyConnected Op.

with torch,

import torch tensor1 = torch.randn(3, 4) FC = torch.nn.Linear(4, 6) output = FC(tensor1) print(output) print(output.shape)

gives something like

tensor([[-0.8929, 0.2467, -0.1407, -0.9917, -0.3787, 0.6504], [-0.2442, -0.0493, -0.1175, -1.0433, 0.0100, -0.3347], [ 0.7240, 0.0195, -0.3735, -1.5789, -0.0348, -0.9252]], grad_fn=<AddmmBackward0>) torch.Size([3, 6])

more simple case is with vector,

tensor1 = torch.randn(4) FC = torch.nn.Linear(4, 6) output = FC(tensor1) print(output) print(output.shape)

gives

tensor([ 0.7355, -0.1311, 0.4749, -0.1821, -0.1245, -1.0594], grad_fn=<ViewBackward0>) torch.Size([6])

ah, should be

_fc_w->dim(0) = 6; _fc_w->dim(1) = 4;

shs-park · 2024-07-16T07:09:37Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+    _mul->x(input());
+    _mul->y(_mul_s);
+    _fc->input(_mul);
+    _fc->weights(_fc_b);


Should it be _fc_w instead of _fc_b..?

shs-park · 2024-07-16T07:14:48Z

compiler/luci/pass/src/FuseMulToFullyConnectedWeightsPass.test.cpp

+TEST_F(FuseMulToFullyConnectedWeightsPassS32Test, dtype_s32_NEG)
+{
+  _graph.init();
+
+  EXPECT_FALSE(_pass.run(_graph.g()));
+}


This test seems same with the fuse_mul_to_fc_weights test, but it expects false.
How could it be..!?

FuseMulToFullyConnectedWeightsPassS32Test is made with S32, which datatype we can't support.

Ah, it was S32.
Thank you!

shs-park · 2024-07-16T07:16:40Z

@seanshpark,
ping..
(It is already been merged, so I mentioned you directly to notify it)

[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass

b700147

This will introduce FuseMulToFullyConnectedWeightsPass which will fuse Mul to following FullyConnected weights if possible. ONE-DCO-1.0-Signed-off-by: SaeHie Park <[email protected]>

seanshpark requested a review from a team July 16, 2024 00:02

mhs4670go approved these changes Jul 16, 2024

View reviewed changes

seanshpark merged commit 7de74b8 into Samsung:master Jul 16, 2024
7 checks passed

seanshpark deleted the luci_pass_fusemultofcwei branch July 16, 2024 03:15

shs-park reviewed Jul 16, 2024

View reviewed changes

shs-park mentioned this pull request Jul 16, 2024

[luci/pass] Introduce FuseAddToFullyConnectedBiasPass #13438

Merged

This was referenced Jul 16, 2024

[luci/pass] Fix FuseMulToFullyConnectedWeightsPass test #13448

Closed

[luci/pass] Fix typos in FuseXxxToFC test #13449

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass #13439

[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass #13439

seanshpark commented Jul 16, 2024

seanshpark commented Jul 16, 2024

mhs4670go left a comment

shs-park left a comment

shs-park Jul 16, 2024

shs-park Jul 16, 2024

shs-park Jul 16, 2024

shs-park Jul 16, 2024

shs-park Jul 16, 2024

shs-park Jul 16, 2024

seanshpark Jul 16, 2024

seanshpark Jul 16, 2024

seanshpark Jul 17, 2024

shs-park Jul 16, 2024

shs-park Jul 16, 2024

seanshpark Jul 16, 2024

shs-park Jul 17, 2024

shs-park commented Jul 16, 2024

[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass #13439

[luci/pass] Introduce FuseMulToFullyConnectedWeightsPass #13439

Conversation

seanshpark commented Jul 16, 2024

seanshpark commented Jul 16, 2024

mhs4670go left a comment

Choose a reason for hiding this comment

shs-park left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

shs-park commented Jul 16, 2024