[compute/cker] Fix RmsNorm cker #14218

seockho-kim · 2024-10-14T06:49:09Z

This commit fixes RmsNorm cker to accept 3 dimension input and single gamma.

ONE-DCO-1.0-Signed-off-by: Seockho Kim [email protected]

This commit fixes RmsNorm cker to accept 3 dimension input and single gamma. ONE-DCO-1.0-Signed-off-by: Seockho Kim [email protected]

glistening · 2024-10-15T00:17:57Z

compute/cker/include/cker/operation/RmsNorm.h

-  const int32_t heights = MatchingDim(input_shape, 1, output_shape, 1);
-  const int32_t widths = MatchingDim(input_shape, 2, output_shape, 2);
-  const int32_t channels = MatchingDim(input_shape, 3, output_shape, 3);
+  bool single_gamma = gamma_shape.DimensionsCount() == 1 && gamma_shape.Dims(0) == 1;


@seockho-kim Could you check whether we need to allow scalar number (it may have DimensionCount() = 0) ?

@seockho-kim Could you check whether we need to allow scalar number (it may have DimensionCount() = 0) ?

Current model has no gamma(scale), so it's set to default gamma ( [1.0] ) after fusing.
So, no need to allow scalar number, now, I think.

AFAIK, gamma is some value, not 1.0. Maybe your current model is not final or actual model. Let's check why gamma is 1.0.

AFAIK, gamma is some value, not 1.0. Maybe your current model is not final or actual model. Let's check why gamma is 1.0.

Current model's RMSNorm pattern (need to fuse) has no scale.
(#13964 (comment))
So, scale(gamma) is set to 1.0 when it is fused to RMSNorm.

@seockho-kim

I talked with @jinevening who provided us model, and concluded gamma is not 1.0. It has actual values, but gamma values seem to be propagated to each successor's (= fully connected) weight during our internal quantization and optimization.

glistening

LGTM

Let me see the code more.

glistening · 2024-10-16T05:51:22Z

compute/cker/include/cker/operation/RmsNorm.h

+      for (int32_t height = 0; height < heights; height++)
+      {
+        for (int32_t width = 0; width < widths; width++)
+        {
+          double square_sum = 0.0f;
+          for (int32_t channel = 0; channel < channels; channel++)
+          {
+            double input_val = input_data[Offset(input_shape, batch, height, width, channel)];
+            square_sum += (input_val * input_val);
+          }
+          double rms = std::sqrt((square_sum / channels) + params.epsilon);
+          for (int32_t channel = 0; channel < channels; channel++)
+          {
+            double gamma = (single_gamma ? gamma_data[0] : gamma_data[channel]);
+            output_data[Offset(output_shape, batch, height, width, channel)] =
+              gamma * (input_data[Offset(input_shape, batch, height, width, channel)] / rms);
+          }
+        }
+      }
+    }


@seockho-kim ~~width must be the inner most. Please refer to compute/cker/include/cker/operation/InstanceNorm.h.~~

I was wrong.

glistening · 2024-10-16T07:03:57Z

compute/cker/include/cker/operation/RmsNorm.h

+            square_sum += (input_val * input_val);
+          }
+          double rms = std::sqrt((square_sum / channels) + params.epsilon);
+          for (int32_t channel = 0; channel < channels; channel++)


Suggested change

for (int32_t channel = 0; channel < channels; channel++)

// normalize over last-axis

for (int32_t channel = 0; channel < channels; channel++)

I'll update it, :)

Comment updated to explain that current RMSNorm normalizes over the last axis. ONE-DCO-1.0-Signed-off-by: Seockho Kim [email protected]

glistening

LGTM

[compute/cker] Fix RmsNorm cker

e60c52d

This commit fixes RmsNorm cker to accept 3 dimension input and single gamma. ONE-DCO-1.0-Signed-off-by: Seockho Kim [email protected]

seockho-kim requested a review from a team October 14, 2024 06:49

glistening reviewed Oct 15, 2024

View reviewed changes

glistening previously approved these changes Oct 16, 2024

View reviewed changes

glistening reviewed Oct 16, 2024

View reviewed changes

glistening requested a review from a team October 16, 2024 05:52

glistening reviewed Oct 16, 2024

View reviewed changes

glistening requested a review from a team October 16, 2024 07:04

[compute/cker] Updated comment of RmsNorm

bc8c3f3

Comment updated to explain that current RMSNorm normalizes over the last axis. ONE-DCO-1.0-Signed-off-by: Seockho Kim [email protected]

glistening approved these changes Oct 16, 2024

View reviewed changes

glistening requested a review from a team October 16, 2024 07:47

glistening merged commit c5fd64a into Samsung:master Oct 16, 2024
9 checks passed

seockho-kim deleted the compute_cker_rmsnorm branch October 17, 2024 00:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[compute/cker] Fix RmsNorm cker #14218

[compute/cker] Fix RmsNorm cker #14218

seockho-kim commented Oct 14, 2024

glistening Oct 15, 2024

seockho-kim Oct 15, 2024

glistening Oct 16, 2024

seockho-kim Oct 16, 2024

glistening Oct 16, 2024 •

edited

Loading

glistening left a comment

glistening Oct 16, 2024 •

edited

Loading

glistening Oct 16, 2024

seockho-kim Oct 16, 2024

glistening left a comment

	for (int32_t channel = 0; channel < channels; channel++)
	// normalize over last-axis
	for (int32_t channel = 0; channel < channels; channel++)

[compute/cker] Fix RmsNorm cker #14218

[compute/cker] Fix RmsNorm cker #14218

Conversation

seockho-kim commented Oct 14, 2024

glistening Oct 15, 2024

Choose a reason for hiding this comment

seockho-kim Oct 15, 2024

Choose a reason for hiding this comment

glistening Oct 16, 2024

Choose a reason for hiding this comment

seockho-kim Oct 16, 2024

Choose a reason for hiding this comment

glistening Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

glistening left a comment

Choose a reason for hiding this comment

glistening Oct 16, 2024 • edited Loading

Choose a reason for hiding this comment

glistening Oct 16, 2024

Choose a reason for hiding this comment

seockho-kim Oct 16, 2024

Choose a reason for hiding this comment

glistening left a comment

Choose a reason for hiding this comment

glistening Oct 16, 2024 •

edited

Loading

glistening Oct 16, 2024 •

edited

Loading