-
Notifications
You must be signed in to change notification settings - Fork 0
/
matrix_multiply_out.txt
98 lines (95 loc) · 4.71 KB
/
matrix_multiply_out.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
Block x:0,y:1 Thread x:0,y:0 => Adding A[8] + B[0] to sum
Block x:0,y:1 Thread x:1,y:0 => Adding A[8] + B[1] to sum
Block x:0,y:1 Thread x:0,y:1 => Adding A[12] + B[0] to sum
Block x:0,y:1 Thread x:1,y:1 => Adding A[12] + B[1] to sum
Block x:1,y:1 Thread x:0,y:0 => Adding A[8] + B[2] to sum
Block x:1,y:1 Thread x:1,y:0 => Adding A[8] + B[3] to sum
Block x:1,y:1 Thread x:0,y:1 => Adding A[12] + B[2] to sum
Block x:1,y:1 Thread x:1,y:1 => Adding A[12] + B[3] to sum
Block x:1,y:0 Thread x:0,y:0 => Adding A[0] + B[2] to sum
Block x:1,y:0 Thread x:1,y:0 => Adding A[0] + B[3] to sum
Block x:1,y:0 Thread x:0,y:1 => Adding A[4] + B[2] to sum
Block x:1,y:0 Thread x:1,y:1 => Adding A[4] + B[3] to sum
Block x:0,y:0 Thread x:0,y:0 => Adding A[0] + B[0] to sum
Block x:0,y:0 Thread x:1,y:0 => Adding A[0] + B[1] to sum
Block x:0,y:0 Thread x:0,y:1 => Adding A[4] + B[0] to sum
Block x:0,y:0 Thread x:1,y:1 => Adding A[4] + B[1] to sum
Block x:0,y:1 Thread x:0,y:0 => Adding A[9] + B[4] to sum
Block x:0,y:1 Thread x:1,y:0 => Adding A[9] + B[5] to sum
Block x:0,y:1 Thread x:0,y:1 => Adding A[13] + B[4] to sum
Block x:0,y:1 Thread x:1,y:1 => Adding A[13] + B[5] to sum
Block x:1,y:1 Thread x:0,y:0 => Adding A[9] + B[6] to sum
Block x:1,y:1 Thread x:1,y:0 => Adding A[9] + B[7] to sum
Block x:1,y:1 Thread x:0,y:1 => Adding A[13] + B[6] to sum
Block x:1,y:1 Thread x:1,y:1 => Adding A[13] + B[7] to sum
Block x:1,y:0 Thread x:0,y:0 => Adding A[1] + B[6] to sum
Block x:1,y:0 Thread x:1,y:0 => Adding A[1] + B[7] to sum
Block x:1,y:0 Thread x:0,y:1 => Adding A[5] + B[6] to sum
Block x:1,y:0 Thread x:1,y:1 => Adding A[5] + B[7] to sum
Block x:0,y:0 Thread x:0,y:0 => Adding A[1] + B[4] to sum
Block x:0,y:0 Thread x:1,y:0 => Adding A[1] + B[5] to sum
Block x:0,y:0 Thread x:0,y:1 => Adding A[5] + B[4] to sum
Block x:0,y:0 Thread x:1,y:1 => Adding A[5] + B[5] to sum
Block x:0,y:1 Thread x:0,y:0 => Adding A[10] + B[8] to sum
Block x:0,y:1 Thread x:1,y:0 => Adding A[10] + B[9] to sum
Block x:0,y:1 Thread x:0,y:1 => Adding A[14] + B[8] to sum
Block x:0,y:1 Thread x:1,y:1 => Adding A[14] + B[9] to sum
Block x:1,y:1 Thread x:0,y:0 => Adding A[10] + B[10] to sum
Block x:1,y:1 Thread x:1,y:0 => Adding A[10] + B[11] to sum
Block x:1,y:1 Thread x:0,y:1 => Adding A[14] + B[10] to sum
Block x:1,y:1 Thread x:1,y:1 => Adding A[14] + B[11] to sum
Block x:1,y:0 Thread x:0,y:0 => Adding A[2] + B[10] to sum
Block x:1,y:0 Thread x:1,y:0 => Adding A[2] + B[11] to sum
Block x:1,y:0 Thread x:0,y:1 => Adding A[6] + B[10] to sum
Block x:1,y:0 Thread x:1,y:1 => Adding A[6] + B[11] to sum
Block x:0,y:0 Thread x:0,y:0 => Adding A[2] + B[8] to sum
Block x:0,y:0 Thread x:1,y:0 => Adding A[2] + B[9] to sum
Block x:0,y:0 Thread x:0,y:1 => Adding A[6] + B[8] to sum
Block x:0,y:0 Thread x:1,y:1 => Adding A[6] + B[9] to sum
Block x:0,y:1 Thread x:0,y:0 => Adding A[11] + B[12] to sum
Block x:0,y:1 Thread x:1,y:0 => Adding A[11] + B[13] to sum
Block x:0,y:1 Thread x:0,y:1 => Adding A[15] + B[12] to sum
Block x:0,y:1 Thread x:1,y:1 => Adding A[15] + B[13] to sum
Block x:1,y:1 Thread x:0,y:0 => Adding A[11] + B[14] to sum
Block x:1,y:1 Thread x:1,y:0 => Adding A[11] + B[15] to sum
Block x:1,y:1 Thread x:0,y:1 => Adding A[15] + B[14] to sum
Block x:1,y:1 Thread x:1,y:1 => Adding A[15] + B[15] to sum
Block x:1,y:0 Thread x:0,y:0 => Adding A[3] + B[14] to sum
Block x:1,y:0 Thread x:1,y:0 => Adding A[3] + B[15] to sum
Block x:1,y:0 Thread x:0,y:1 => Adding A[7] + B[14] to sum
Block x:1,y:0 Thread x:1,y:1 => Adding A[7] + B[15] to sum
Block x:0,y:0 Thread x:0,y:0 => Adding A[3] + B[12] to sum
Block x:0,y:0 Thread x:1,y:0 => Adding A[3] + B[13] to sum
Block x:0,y:0 Thread x:0,y:1 => Adding A[7] + B[12] to sum
Block x:0,y:0 Thread x:1,y:1 => Adding A[7] + B[13] to sum
Block x:0,y:1 Thread x:0,y:0 => Saving sum to C[8]
Block x:0,y:1 Thread x:1,y:0 => Saving sum to C[9]
Block x:0,y:1 Thread x:0,y:1 => Saving sum to C[12]
Block x:0,y:1 Thread x:1,y:1 => Saving sum to C[13]
Block x:1,y:1 Thread x:0,y:0 => Saving sum to C[10]
Block x:1,y:1 Thread x:1,y:0 => Saving sum to C[11]
Block x:1,y:1 Thread x:0,y:1 => Saving sum to C[14]
Block x:1,y:1 Thread x:1,y:1 => Saving sum to C[15]
Block x:1,y:0 Thread x:0,y:0 => Saving sum to C[2]
Block x:1,y:0 Thread x:1,y:0 => Saving sum to C[3]
Block x:1,y:0 Thread x:0,y:1 => Saving sum to C[6]
Block x:1,y:0 Thread x:1,y:1 => Saving sum to C[7]
Block x:0,y:0 Thread x:0,y:0 => Saving sum to C[0]
Block x:0,y:0 Thread x:1,y:0 => Saving sum to C[1]
Block x:0,y:0 Thread x:0,y:1 => Saving sum to C[4]
Block x:0,y:0 Thread x:1,y:1 => Saving sum to C[5]
A
0 1 2 3
4 5 6 7
8 9 10 11
12 13 14 15
B
0 1 2 3
4 5 6 7
8 9 10 11
12 13 14 15
C
56 62 68 74
152 174 196 218
248 286 324 362
344 398 452 506