[GPU/OpenCL] Broadcasting support added for GPU Addition kernel. #2759

niket-agarwal · 2024-10-17T12:28:41Z

Performing addition where dimensions of InputA and InputB vary.
Added broadcasting support only where number of batches vary and other dimensions are same for both inputs.
Number of batch of InputB must be 1.
Output of add_i_cl(A,B) is stored in A inplace.

Self evaluation:

Build test: [X]Passed [ ]Failed [ ]Skipped
Run test: [X]Passed [ ]Failed [ ]Skipped

taos-ci · 2024-10-17T12:28:44Z

📝 TAOS-CI Version: 1.5.20200925. Thank you for submitting PR #2759. Please a submit 1commit/1PR (one commit per one PR) policy to get comments quickly from reviewers. Your PR must pass all verificiation processes of cibot before starting a review process from reviewers. If you are new member to join this project, please read manuals in documentation folder and wiki page. In order to monitor a progress status of your PR in more detail, visit http://ci.nnstreamer.ai/.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

baek2sm · 2024-10-18T06:26:02Z

nntrainer/tensor/cl_operations/blas_kernel_interface.cpp


-  CREATE_IF_EMPTY_DIMS(result, result.getDim());
+  CREATE_IF_EMPTY_DIMS(inputA, inputA.getDim());


I have a question. The result tensor before modification could be empty, but is it possible that inputA tensor is empty in the current modification?

Right! It shouldn't be. I'll modify it, thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

myungjoo · 2024-10-30T01:12:31Z

PTAL: @baek2sm @skykongkong8 @EunjuYang

djeong20 · 2024-10-30T01:46:14Z

nntrainer/tensor/cl_operations/blas_kernel_interface.cpp

-    addition_cl(data, rdata, size);
-
-  } else if (input.getDataType() == ml::train::TensorDim::DataType::FP16) {
+void add_i_cl(Tensor &inputA, Tensor const &inputB) {


is there any reason you change the input & result to inputA and inputB?

If there is an output with inputA and inputB in the future, this change would make sense. If not, I think it would be better to preserve the naming input and result.

I feel the same way. I guess the intention of this naming was to express "a = a + b" more clearly. (before "result(b) = input(a) + result(b)"). So the inputA and inputB parameter names are also good, but I think they seem to be inconsistent with other functions.

Okay I'll update with this naming convention.

djeong20 · 2024-10-30T01:58:53Z

nntrainer/tensor/cl_operations/blas_kernel_strings.h

-    if (idx < size) {
-        output[idx] = output[idx] + input[idx];
+    if (idx < size_res) {
+        output[idx] = output[idx] + input[idx % size_input];


For this kernel, we are assuming size_res is always greater than or equal to size_input, right?

Yes correct.

djeong20 · 2024-10-30T02:02:54Z

nntrainer/tensor/cl_operations/blas_kernel_interface.h

 */
-void add_i_cl(Tensor const &input, Tensor &result);
+void add_i_cl(Tensor &inputA, Tensor const &inputB);


I'm assuming inputA is a result, and inputB is an input. is this correct?

Both are inputs, the addition is taking inplace and inputA is returned as output

EunjuYang

Thank you for your contribution.
I left my opinions and questions bellow. Please check it. Thanks.

nntrainer/tensor/cl_operations/blas_kernel_interface.h

nntrainer/tensor/cl_operations/blas_kernel_strings.h

nntrainer/tensor/cl_operations/blas_kernels.h

nntrainer/tensor/cl_operations/blas_kernels_fp16.cpp

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

EunjuYang

In the current main branch, unittest_layers_addition_cl is disabled.

nntrainer/test/jni/Android.mk

Line 479 in a80a6e1

# ../unittest/layers/unittest_layers_addition_cl.cpp \

Please enable it and check the unittest_layers pass all unittest cases with ./tools/android_test.sh. If you find *.nnlayergolden missing errors, please update unittest_layers.tar.gz as well. (c.f. #2798)

EunjuYang · 2024-11-18T05:19:07Z

Also, for you added new feature of broadcasting support for addition kernel, what about adding unit test for the case?

djeong20

Overall, LGTM!
As @EunjuYang mentioned, please add test cases for the newly added feature.

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

Added support where number of batches vary for input A and input B. Added unit test case for new feature in unittest_blas_kernels_cl.cpp Self evaluation: Build test: [X]Passed [ ]Failed [ ]Skipped Run test: [X]Passed [ ]Failed [ ]Skipped Signed-off-by: Niket Agarwal <[email protected]>

taos-ci

@niket-agarwal, 💯 All CI checkers are successfully verified. Thanks.

EunjuYang

LGTM!

niket-agarwal requested review from myungjoo, jijoongmoon, again4you, jaeyun-jung, leemgs, wooksong, helloahn, kparichay, gichan-jang, anyj0527, zhoonit, lhs8928, songgot, jihochu, DonghakPark, SeoHyungjun, baek2sm, skykongkong8, djeong20, EunjuYang and a team as code owners October 17, 2024 12:28

taos-ci approved these changes Oct 17, 2024

View reviewed changes

baek2sm reviewed Oct 18, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch 2 times, most recently from 237e218 to b60e13f Compare October 18, 2024 08:05

github-actions bot added the Need Review label Oct 18, 2024

taos-ci approved these changes Oct 18, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch from b60e13f to aeb83e7 Compare October 18, 2024 08:58

taos-ci approved these changes Oct 18, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch from aeb83e7 to 507f943 Compare October 22, 2024 10:04

taos-ci approved these changes Oct 22, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch 2 times, most recently from 6d063ee to 52c823a Compare October 29, 2024 08:27

taos-ci approved these changes Oct 29, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch from 52c823a to 312b6ff Compare October 29, 2024 09:46

taos-ci approved these changes Oct 29, 2024

View reviewed changes

djeong20 reviewed Oct 30, 2024

View reviewed changes

EunjuYang reviewed Nov 10, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch 2 times, most recently from ed95ee3 to fcaad63 Compare November 13, 2024 07:30

taos-ci approved these changes Nov 13, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch from fcaad63 to 9dbde98 Compare November 15, 2024 06:16

taos-ci approved these changes Nov 15, 2024

View reviewed changes

EunjuYang reviewed Nov 18, 2024

View reviewed changes

djeong20 approved these changes Nov 19, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch 2 times, most recently from 4c30910 to 45e15a2 Compare November 19, 2024 10:46

taos-ci approved these changes Nov 19, 2024

View reviewed changes

niket-agarwal force-pushed the broadcasting branch from 45e15a2 to 9591434 Compare November 19, 2024 12:17

taos-ci approved these changes Nov 19, 2024

View reviewed changes

EunjuYang approved these changes Nov 20, 2024

View reviewed changes

github-actions bot added PR/READY2MERGE and removed Need Review labels Nov 20, 2024

jijoongmoon merged commit 7bd8e55 into nnstreamer:main Nov 22, 2024
38 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU/OpenCL] Broadcasting support added for GPU Addition kernel. #2759

[GPU/OpenCL] Broadcasting support added for GPU Addition kernel. #2759

niket-agarwal commented Oct 17, 2024 •

edited

Loading

taos-ci commented Oct 17, 2024

taos-ci left a comment

baek2sm Oct 18, 2024

niket-agarwal Oct 18, 2024

taos-ci left a comment

taos-ci left a comment

taos-ci left a comment

taos-ci left a comment

taos-ci left a comment

myungjoo commented Oct 30, 2024

djeong20 Oct 30, 2024

djeong20 Oct 30, 2024

baek2sm Oct 31, 2024

niket-agarwal Nov 7, 2024

djeong20 Oct 30, 2024

niket-agarwal Nov 7, 2024

djeong20 Oct 30, 2024

niket-agarwal Nov 7, 2024

EunjuYang left a comment

taos-ci left a comment

taos-ci left a comment

EunjuYang left a comment

EunjuYang commented Nov 18, 2024

djeong20 left a comment

taos-ci left a comment

taos-ci left a comment

EunjuYang left a comment


		CREATE_IF_EMPTY_DIMS(result, result.getDim());
		CREATE_IF_EMPTY_DIMS(inputA, inputA.getDim());

[GPU/OpenCL] Broadcasting support added for GPU Addition kernel. #2759

[GPU/OpenCL] Broadcasting support added for GPU Addition kernel. #2759

Conversation

niket-agarwal commented Oct 17, 2024 • edited Loading

taos-ci commented Oct 17, 2024

taos-ci left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

myungjoo commented Oct 30, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

EunjuYang left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

EunjuYang left a comment

Choose a reason for hiding this comment

EunjuYang commented Nov 18, 2024

djeong20 left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

taos-ci left a comment

Choose a reason for hiding this comment

EunjuYang left a comment

Choose a reason for hiding this comment

niket-agarwal commented Oct 17, 2024 •

edited

Loading