Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Feb 22, 2025
1 parent 57d7e9e commit b864b4d
Show file tree
Hide file tree
Showing 65 changed files with 455 additions and 455 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 5 additions & 5 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -232,7 +232,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p

vector-add-performance:
size Triton Torch
0 4096.0 9.600000 8.000000
0 4096.0 8.000000 9.600000
1 8192.0 15.999999 15.999999
2 16384.0 31.999999 31.999999
3 32768.0 63.999998 63.999998
Expand All @@ -241,12 +241,12 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
6 262144.0 384.000001 384.000001
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1228.800031 1228.800031
9 2097152.0 1045.787204 1023.999964
10 4194304.0 1260.307736 1228.800031
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1624.859540 1624.859540
14 67108864.0 1669.706983 1662.646960
14 67108864.0 1669.706983 1666.169441
15 134217728.0 1684.008546 1678.616907


Expand All @@ -255,7 +255,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 7.306 seconds)
**Total running time of the script:** (0 minutes 6.304 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -321,104 +321,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 465.044599 705.085923
1 384.0 616.695669 817.685769
2 512.0 747.366924 922.182090
3 640.0 824.100839 946.771850
4 768.0 882.151663 1026.234825
5 896.0 937.814385 1062.694926
6 1024.0 1007.711556 1122.646528
7 1152.0 1107.226266 611.074438
8 1280.0 1153.292361 669.958194
9 1408.0 1167.426735 721.314161
10 1536.0 1184.655877 780.087937
11 1664.0 1215.957175 811.110045
12 1792.0 1232.509469 854.720856
13 1920.0 1255.053420 907.695478
14 2048.0 1278.202547 953.971428
15 2176.0 1245.390831 978.381129
16 2304.0 1246.421817 1011.119265
17 2432.0 1274.105928 1052.903308
18 2560.0 1289.612947 1086.877454
19 2688.0 1288.519018 1099.261935
20 2816.0 1303.811463 1127.526268
21 2944.0 1314.629501 1166.712966
22 3072.0 1331.431119 1186.331573
23 3200.0 1329.521339 1197.060790
24 3328.0 1340.323524 1221.403606
25 3456.0 1348.004637 1252.099576
26 3584.0 1351.328455 1263.316706
27 3712.0 1371.009649 1266.292238
28 3840.0 1375.734055 1302.556841
29 3968.0 1375.636823 1315.865252
30 4096.0 1378.446933 1323.195881
31 4224.0 1333.002032 1159.397402
32 4352.0 1333.059653 1172.802572
33 4480.0 1355.127797 1185.407362
34 4608.0 1359.585522 1193.364827
35 4736.0 1356.834268 1200.177145
36 4864.0 1377.947633 1221.732348
37 4992.0 1376.310150 1237.508004
38 5120.0 1373.371121 1249.134141
39 5248.0 1375.842094 1258.416150
40 5376.0 1377.374344 1285.263886
41 5504.0 1376.380637 1297.422943
42 5632.0 1384.119000 1311.673950
43 5760.0 1399.134344 1325.566346
44 5888.0 1383.801995 1341.460366
45 6016.0 1403.606203 1352.971500
46 6144.0 1408.678249 1372.510180
47 6272.0 1411.074000 1374.370174
48 6400.0 1417.379870 1386.907428
49 6528.0 1417.652063 1395.593625
50 6656.0 1418.290484 1404.400020
51 6784.0 1414.575550 1413.085335
52 6912.0 1422.889949 1424.817182
53 7040.0 1421.676675 1429.721666
54 7168.0 1424.666541 1434.195692
55 7296.0 1434.073329 1440.171669
56 7424.0 1429.847787 1445.291410
57 7552.0 1427.238320 1452.813146
58 7680.0 1436.445012 1458.646588
59 7808.0 1435.610253 1461.437874
60 7936.0 1434.151543 1467.563053
61 8064.0 1437.509319 1474.531262
62 8192.0 1436.980212 1485.808477
63 8320.0 1384.420314 1401.523798
64 8448.0 1377.119153 1406.040284
65 8576.0 1387.883853 1395.213543
66 8704.0 1382.156479 1400.517925
67 8832.0 1376.828871 1404.837010
68 8960.0 1391.739919 1410.252488
69 9088.0 1402.760151 1414.378598
70 9216.0 1397.060199 1422.933698
71 9344.0 1397.435348 1424.618506
72 9472.0 1395.551105 1435.529116
73 9600.0 1386.163668 1433.677878
74 9728.0 1396.314517 1443.086701
75 9856.0 1410.939464 1441.835542
76 9984.0 1392.746688 1448.925178
77 10112.0 1410.038425 1457.028102
78 10240.0 1411.303162 1465.319578
79 10368.0 1406.758832 1464.620608
80 10496.0 1410.046788 1464.304759
81 10624.0 1405.733310 1470.350057
82 10752.0 1399.829600 1473.042899
83 10880.0 1400.962308 1481.235284
84 11008.0 1416.644570 1476.592898
85 11136.0 1418.938603 1484.775366
86 11264.0 1423.406848 1484.834461
87 11392.0 1411.064251 1492.913550
88 11520.0 1422.549590 1492.717318
89 11648.0 1422.704544 1500.106471
90 11776.0 1422.079107 1498.617659
91 11904.0 1436.276229 1504.934375
92 12032.0 1424.168671 1508.561383
93 12160.0 1416.878791 1510.086114
94 12288.0 1430.237533 1391.144940
95 12416.0 1446.293947 1391.689418
96 12544.0 1436.362233 1393.189436
97 12672.0 1444.526488 1391.688412
0 256.0 469.120677 696.481314
1 384.0 619.191379 819.555941
2 512.0 751.866726 926.985382
3 640.0 809.886303 961.302789
4 768.0 881.873012 1031.111103
5 896.0 943.373480 1064.494144
6 1024.0 1003.694689 1125.410473
7 1152.0 1111.477376 611.736669
8 1280.0 1142.209038 670.749920
9 1408.0 1156.398886 724.512584
10 1536.0 1190.384104 778.604053
11 1664.0 1217.298954 810.700923
12 1792.0 1236.332249 859.045059
13 1920.0 1247.350351 911.705359
14 2048.0 1270.314823 959.033599
15 2176.0 1237.810515 977.335732
16 2304.0 1243.916052 1008.153792
17 2432.0 1271.354445 1058.444158
18 2560.0 1284.860017 1083.476046
19 2688.0 1298.240774 1104.943726
20 2816.0 1300.259216 1132.976504
21 2944.0 1318.002991 1166.471618
22 3072.0 1330.006682 1185.642094
23 3200.0 1331.996009 1191.063889
24 3328.0 1346.118359 1220.833201
25 3456.0 1354.013397 1248.948625
26 3584.0 1350.370443 1261.471420
27 3712.0 1361.196573 1269.933194
28 3840.0 1369.176617 1298.487649
29 3968.0 1370.322817 1317.320837
30 4096.0 1376.168411 1324.028214
31 4224.0 1334.488670 1160.028766
32 4352.0 1337.815799 1172.769547
33 4480.0 1354.995109 1182.945253
34 4608.0 1368.727348 1197.220646
35 4736.0 1358.273684 1197.767432
36 4864.0 1376.430178 1219.484238
37 4992.0 1370.010387 1238.692738
38 5120.0 1376.374647 1250.714882
39 5248.0 1378.852479 1255.935605
40 5376.0 1374.076702 1282.689893
41 5504.0 1382.936423 1300.563926
42 5632.0 1387.184315 1316.406305
43 5760.0 1394.231804 1325.609677
44 5888.0 1389.370461 1345.235087
45 6016.0 1400.023245 1353.805826
46 6144.0 1412.550337 1375.196883
47 6272.0 1409.303795 1372.723714
48 6400.0 1418.194675 1387.834965
49 6528.0 1416.137373 1392.344373
50 6656.0 1420.995917 1400.722765
51 6784.0 1411.549448 1411.507195
52 6912.0 1429.500669 1422.376755
53 7040.0 1420.219428 1434.351656
54 7168.0 1427.378695 1435.609628
55 7296.0 1429.052213 1443.196806
56 7424.0 1429.746191 1444.323067
57 7552.0 1428.946458 1454.898524
58 7680.0 1435.514017 1458.130742
59 7808.0 1431.427976 1462.915200
60 7936.0 1434.612335 1465.337742
61 8064.0 1437.244589 1472.916366
62 8192.0 1438.906451 1484.752011
63 8320.0 1384.842106 1402.958728
64 8448.0 1377.083016 1403.196654
65 8576.0 1386.531945 1395.660474
66 8704.0 1383.301378 1399.907732
67 8832.0 1379.075758 1405.886821
68 8960.0 1391.428995 1412.149292
69 9088.0 1403.430888 1418.411130
70 9216.0 1397.556817 1424.053905
71 9344.0 1395.884096 1422.231435
72 9472.0 1397.101667 1434.109984
73 9600.0 1392.610653 1434.735611
74 9728.0 1396.021617 1439.600734
75 9856.0 1408.394511 1442.183853
76 9984.0 1397.668783 1450.449968
77 10112.0 1406.737169 1454.448931
78 10240.0 1416.291780 1466.403136
79 10368.0 1409.013072 1465.406709
80 10496.0 1411.251392 1466.363507
81 10624.0 1406.027676 1467.589524
82 10752.0 1400.147116 1473.321898
83 10880.0 1397.345593 1480.257108
84 11008.0 1415.872320 1475.455493
85 11136.0 1419.920841 1489.018901
86 11264.0 1428.072523 1485.690178
87 11392.0 1408.835978 1488.156016
88 11520.0 1418.084335 1491.717852
89 11648.0 1421.466701 1496.369859
90 11776.0 1422.544359 1503.955108
91 11904.0 1436.281627 1504.616364
92 12032.0 1418.136280 1508.616202
93 12160.0 1413.437587 1513.737683
94 12288.0 1430.042577 1392.854717
95 12416.0 1444.306777 1391.722782
96 12544.0 1435.316443 1391.734989
97 12672.0 1446.635149 1394.660915
Expand All @@ -433,7 +433,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.218 seconds)
**Total running time of the script:** (0 minutes 23.191 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -572,77 +572,77 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
matmul-performance-fp16:
M N K cuBLAS Triton
0 256.0 256.0 256.0 4.096000 4.096000
1 384.0 384.0 384.0 11.059200 12.288000
1 384.0 384.0 384.0 12.288000 12.288000
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 39.384616
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 63.195428
5 896.0 896.0 896.0 78.051553 87.808000
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 110.376426 99.864382
7 1152.0 1152.0 1152.0 135.726544 119.439363
8 1280.0 1280.0 1280.0 157.538463 151.703703
9 1408.0 1408.0 1408.0 151.438217 132.970149
9 1408.0 1408.0 1408.0 155.765024 132.970149
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 179.978245 173.056002
11 1664.0 1664.0 1664.0 183.651271 173.056002
12 1792.0 1792.0 1792.0 172.914215 200.703997
13 1920.0 1920.0 1920.0 200.347822 166.554219
14 2048.0 2048.0 2048.0 223.696203 182.361039
15 2176.0 2176.0 2176.0 211.827867 203.269178
16 2304.0 2304.0 2304.0 229.691080 221.184000
17 2432.0 2432.0 2432.0 203.583068 196.464787
18 2560.0 2560.0 2560.0 222.911566 212.779229
19 2688.0 2688.0 2688.0 198.602388 191.581096
13 1920.0 1920.0 1920.0 200.347822 164.571430
14 2048.0 2048.0 2048.0 223.696203 184.365008
15 2176.0 2176.0 2176.0 209.621326 203.269178
16 2304.0 2304.0 2304.0 227.503545 221.184000
17 2432.0 2432.0 2432.0 206.576938 195.100438
18 2560.0 2560.0 2560.0 224.438347 210.051289
19 2688.0 2688.0 2688.0 198.602388 193.536006
20 2816.0 2816.0 2816.0 210.696652 203.804711
21 2944.0 2944.0 2944.0 221.493479 215.740400
22 3072.0 3072.0 3072.0 208.941345 203.680236
23 3200.0 3200.0 3200.0 214.046818 214.046818
24 3328.0 3328.0 3328.0 207.467716 204.520726
25 3456.0 3456.0 3456.0 216.724640 212.721813
26 3584.0 3584.0 3584.0 219.305830 204.818663
27 3712.0 3712.0 3712.0 208.990259 222.488510
28 3840.0 3840.0 3840.0 209.851994 201.442627
29 3968.0 3968.0 3968.0 208.945088 213.889466
30 4096.0 4096.0 4096.0 219.668951 207.126128
21 2944.0 2944.0 2944.0 221.493479 220.513412
22 3072.0 3072.0 3072.0 210.494802 203.680236
23 3200.0 3200.0 3200.0 214.765101 211.920530
24 3328.0 3328.0 3328.0 207.467716 202.792385
25 3456.0 3456.0 3456.0 219.080343 212.721813
26 3584.0 3584.0 3584.0 218.772251 201.604022
27 3712.0 3712.0 3712.0 209.428397 211.646909
28 3840.0 3840.0 3840.0 210.451005 205.944129
29 3968.0 3968.0 3968.0 208.945088 214.453305
30 4096.0 4096.0 4096.0 216.829933 208.736744
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 4.096000
1 384.0 384.0 384.0 12.288000
0 256.0 256.0 256.0 3.640889
1 384.0 384.0 384.0 11.059200
2 512.0 512.0 512.0 26.214401
3 640.0 640.0 640.0 46.545454
4 768.0 768.0 768.0 58.982401
4 768.0 768.0 768.0 63.195428
5 896.0 896.0 896.0 87.808000
6 1024.0 1024.0 1024.0 95.325090
7 1152.0 1152.0 1152.0 124.415996
7 1152.0 1152.0 1152.0 119.439363
8 1280.0 1280.0 1280.0 141.241376
9 1408.0 1408.0 1408.0 136.294403
9 1408.0 1408.0 1408.0 132.970149
10 1536.0 1536.0 1536.0 150.593357
11 1664.0 1664.0 1664.0 149.981870
12 1792.0 1792.0 1792.0 167.752595
12 1792.0 1792.0 1792.0 170.294302
13 1920.0 1920.0 1920.0 160.744186
14 2048.0 2048.0 2048.0 182.361039
15 2176.0 2176.0 2176.0 173.479720
16 2304.0 2304.0 2304.0 192.644132
17 2432.0 2432.0 2432.0 186.056053
18 2560.0 2560.0 2560.0 188.321838
19 2688.0 2688.0 2688.0 182.370464
17 2432.0 2432.0 2432.0 188.553450
18 2560.0 2560.0 2560.0 185.129949
19 2688.0 2688.0 2688.0 181.497871
20 2816.0 2816.0 2816.0 195.579412
21 2944.0 2944.0 2944.0 192.417117
22 3072.0 3072.0 3072.0 191.294262
23 3200.0 3200.0 3200.0 189.910976
24 3328.0 3328.0 3328.0 189.450773
25 3456.0 3456.0 3456.0 190.145208
21 2944.0 2944.0 2944.0 193.162926
22 3072.0 3072.0 3072.0 190.010417
23 3200.0 3200.0 3200.0 188.790565
24 3328.0 3328.0 3328.0 187.721758
25 3456.0 3456.0 3456.0 187.929060
26 3584.0 3584.0 3584.0 190.095969
27 3712.0 3712.0 3712.0 190.280662
28 3840.0 3840.0 3840.0 190.675864
29 3968.0 3968.0 3968.0 192.770834
30 4096.0 4096.0 4096.0 199.283930
28 3840.0 3840.0 3840.0 188.402048
29 3968.0 3968.0 3968.0 191.861531
30 4096.0 4096.0 4096.0 197.233986
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 13.048 seconds)
**Total running time of the script:** (2 minutes 13.353 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -246,7 +246,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.701 seconds)
**Total running time of the script:** (0 minutes 0.711 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit b864b4d

Please sign in to comment.