Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Aug 30, 2024
1 parent 84b77ce commit 8ca3f77
Show file tree
Hide file tree
Showing 61 changed files with 427 additions and 427 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 5 additions & 5 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -231,7 +231,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
vector-add-performance:
size Triton Torch
0 4096.0 8.000000 8.000000
1 8192.0 15.999999 19.200000
1 8192.0 15.999999 15.999999
2 16384.0 31.999999 31.999999
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
Expand All @@ -240,20 +240,20 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1260.307736 1228.800031
11 8388608.0 1424.695621 1424.695621
10 4194304.0 1228.800031 1228.800031
11 8388608.0 1424.695621 1404.342820
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1631.601649 1624.859540
14 67108864.0 1669.706983 1662.646960
15 134217728.0 1684.910539 1678.616907
15 134217728.0 1684.008546 1678.616907





.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 15.433 seconds)
**Total running time of the script:** (0 minutes 6.796 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 481.122554 705.606305
1 384.0 610.434702 819.823114
2 512.0 761.656911 924.209827
3 640.0 799.226362 944.360090
4 768.0 867.047611 1027.173003
5 896.0 936.960598 1058.824580
6 1024.0 985.584033 1108.840124
7 1152.0 1107.393637 613.634046
8 1280.0 1146.930717 669.100706
9 1408.0 1162.115329 720.621389
10 1536.0 1185.177342 778.853902
11 1664.0 1217.205082 811.707348
12 1792.0 1241.317424 858.977076
13 1920.0 1248.378487 908.629411
14 2048.0 1278.451899 953.703798
15 2176.0 1255.573137 977.157558
16 2304.0 1268.279024 1008.176684
17 2432.0 1289.185340 1057.863892
18 2560.0 1308.571366 1084.184708
19 2688.0 1307.591343 1099.708516
20 2816.0 1319.874972 1130.874624
21 2944.0 1320.138954 1167.428064
22 3072.0 1351.664889 1185.122139
23 3200.0 1357.768931 1191.520567
24 3328.0 1353.422267 1220.980780
25 3456.0 1367.305367 1245.605859
26 3584.0 1379.735215 1258.157013
27 3712.0 1383.741181 1270.613298
28 3840.0 1386.899622 1302.070677
29 3968.0 1391.527406 1316.700118
30 4096.0 1399.801411 1325.898718
31 4224.0 1330.959459 1160.888959
32 4352.0 1335.577572 1172.599223
33 4480.0 1352.962036 1182.609123
34 4608.0 1362.112363 1193.610373
35 4736.0 1356.598590 1197.166243
36 4864.0 1375.028691 1222.774961
37 4992.0 1366.109616 1239.348747
38 5120.0 1372.128671 1250.058464
39 5248.0 1375.849884 1258.884814
40 5376.0 1374.978428 1286.796113
41 5504.0 1377.846198 1297.942346
42 5632.0 1384.413443 1312.716118
43 5760.0 1391.278564 1325.685551
44 5888.0 1394.194442 1340.336636
45 6016.0 1397.978091 1353.805381
46 6144.0 1406.785353 1372.097123
47 6272.0 1415.245090 1374.033813
48 6400.0 1410.782205 1388.175035
49 6528.0 1415.796932 1394.208904
50 6656.0 1421.096941 1403.630295
51 6784.0 1416.131251 1412.181764
52 6912.0 1427.941472 1425.406941
53 7040.0 1419.496350 1430.216877
54 7168.0 1429.089905 1433.832681
55 7296.0 1429.787556 1442.966669
56 7424.0 1429.375937 1442.943822
57 7552.0 1427.168501 1455.195697
58 7680.0 1437.135199 1460.596589
59 7808.0 1433.640230 1465.474209
60 7936.0 1432.577856 1468.218077
61 8064.0 1436.870645 1476.143488
62 8192.0 1441.160297 1482.565266
63 8320.0 1389.050740 1401.788111
64 8448.0 1378.888407 1404.382591
65 8576.0 1398.095805 1394.096530
66 8704.0 1388.941082 1402.802675
67 8832.0 1381.057924 1403.322637
68 8960.0 1395.218642 1414.527645
69 9088.0 1410.865783 1416.893112
70 9216.0 1404.596222 1423.388398
71 9344.0 1403.696909 1426.976061
72 9472.0 1398.858388 1434.562163
73 9600.0 1397.357600 1433.067199
74 9728.0 1399.107722 1441.921229
75 9856.0 1416.363642 1438.967318
76 9984.0 1402.643748 1451.618158
77 10112.0 1415.973788 1456.324055
78 10240.0 1420.144736 1466.642455
79 10368.0 1411.548536 1462.925131
80 10496.0 1412.824164 1468.937611
81 10624.0 1410.823497 1466.070334
82 10752.0 1405.901365 1474.315660
83 10880.0 1399.632282 1480.103880
84 11008.0 1418.699072 1478.589424
85 11136.0 1424.249125 1487.229175
86 11264.0 1427.982126 1486.433466
87 11392.0 1411.084768 1490.053428
88 11520.0 1424.872316 1494.723538
89 11648.0 1429.668921 1497.942460
90 11776.0 1430.401550 1500.797227
91 11904.0 1443.839914 1508.386152
92 12032.0 1425.162120 1509.201685
93 12160.0 1417.157747 1513.961434
94 12288.0 1435.210340 1393.452263
95 12416.0 1447.425904 1390.445989
96 12544.0 1441.090369 1394.385753
97 12672.0 1447.677043 1394.404387
0 256.0 479.064298 699.181219
1 384.0 605.935470 803.592870
2 512.0 745.815346 915.545165
3 640.0 784.150333 956.447857
4 768.0 880.320379 1014.509091
5 896.0 925.382153 1061.167478
6 1024.0 991.710669 1106.574890
7 1152.0 1096.039409 610.380558
8 1280.0 1139.520251 665.218284
9 1408.0 1165.391530 720.489335
10 1536.0 1194.943334 778.549520
11 1664.0 1210.181909 815.185632
12 1792.0 1230.968329 855.395778
13 1920.0 1248.428873 908.579062
14 2048.0 1280.924153 959.322337
15 2176.0 1259.385511 977.076350
16 2304.0 1264.254197 1009.942264
17 2432.0 1294.160529 1052.837215
18 2560.0 1307.374009 1080.764688
19 2688.0 1311.829354 1099.929421
20 2816.0 1318.948247 1127.749005
21 2944.0 1323.238625 1167.743718
22 3072.0 1351.373542 1180.095910
23 3200.0 1348.534675 1189.752898
24 3328.0 1360.344504 1220.824659
25 3456.0 1376.349668 1244.102144
26 3584.0 1377.704179 1262.922891
27 3712.0 1381.836169 1270.382626
28 3840.0 1392.447728 1297.064618
29 3968.0 1389.325086 1312.152604
30 4096.0 1398.450475 1323.191964
31 4224.0 1334.797686 1161.265364
32 4352.0 1338.772911 1175.420872
33 4480.0 1352.283250 1184.315205
34 4608.0 1365.124471 1193.417080
35 4736.0 1359.987409 1201.622382
36 4864.0 1377.048560 1223.412167
37 4992.0 1369.089609 1233.684450
38 5120.0 1369.713498 1248.899525
39 5248.0 1375.293807 1259.231379
40 5376.0 1376.540058 1282.852374
41 5504.0 1380.738669 1300.560458
42 5632.0 1389.659411 1316.116838
43 5760.0 1400.169920 1327.324114
44 5888.0 1389.808758 1344.885649
45 6016.0 1399.504754 1353.314020
46 6144.0 1411.262028 1378.023978
47 6272.0 1413.077806 1377.260245
48 6400.0 1414.187022 1385.721052
49 6528.0 1409.389343 1391.148905
50 6656.0 1416.721019 1401.034261
51 6784.0 1416.087697 1413.434651
52 6912.0 1421.294888 1424.131638
53 7040.0 1419.996979 1429.518647
54 7168.0 1424.466575 1435.376778
55 7296.0 1432.715729 1440.741962
56 7424.0 1425.848969 1443.345415
57 7552.0 1422.442528 1455.338142
58 7680.0 1431.808213 1461.805074
59 7808.0 1430.691639 1466.835213
60 7936.0 1439.014282 1471.625334
61 8064.0 1435.965760 1471.378689
62 8192.0 1442.490743 1483.499746
63 8320.0 1388.123848 1403.277206
64 8448.0 1381.538337 1406.434064
65 8576.0 1393.914249 1393.727174
66 8704.0 1392.537296 1401.229052
67 8832.0 1383.956238 1405.738522
68 8960.0 1397.579449 1412.947143
69 9088.0 1409.169292 1415.946044
70 9216.0 1404.468122 1426.164803
71 9344.0 1398.240923 1425.917475
72 9472.0 1398.110606 1431.903192
73 9600.0 1394.432321 1435.590164
74 9728.0 1402.505697 1440.086641
75 9856.0 1416.025327 1442.098940
76 9984.0 1403.290230 1448.423484
77 10112.0 1414.684415 1458.335590
78 10240.0 1419.416146 1467.917585
79 10368.0 1414.798734 1465.477163
80 10496.0 1418.796891 1467.609236
81 10624.0 1410.416625 1467.636756
82 10752.0 1408.517627 1472.373422
83 10880.0 1398.327703 1478.112478
84 11008.0 1420.878183 1476.109043
85 11136.0 1419.894600 1486.756216
86 11264.0 1426.367781 1484.467498
87 11392.0 1414.749863 1490.915219
88 11520.0 1423.156103 1495.031398
89 11648.0 1424.672284 1501.151663
90 11776.0 1432.868258 1502.181127
91 11904.0 1441.271418 1507.206922
92 12032.0 1424.872623 1504.367198
93 12160.0 1418.579120 1510.712381
94 12288.0 1432.830460 1390.538075
95 12416.0 1443.833465 1391.043034
96 12544.0 1442.269264 1393.062024
97 12672.0 1443.375437 1392.331793
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.218 seconds)
**Total running time of the script:** (0 minutes 23.282 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -570,77 +570,77 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
matmul-performance-fp16:
M N K cuBLAS Triton
0 256.0 256.0 256.0 4.096000 4.096000
1 384.0 384.0 384.0 12.288000 12.288000
1 384.0 384.0 384.0 11.059200 12.288000
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 68.056616
5 896.0 896.0 896.0 78.051553 93.661869
5 896.0 896.0 896.0 78.051553 87.808000
6 1024.0 1024.0 1024.0 110.376426 99.864382
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 155.765024 132.970149
9 1408.0 1408.0 1408.0 151.438217 132.970149
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 179.978245 179.978245
12 1792.0 1792.0 1792.0 172.914215 204.353162
11 1664.0 1664.0 1664.0 183.651271 179.978245
12 1792.0 1792.0 1792.0 172.914215 208.137481
13 1920.0 1920.0 1920.0 200.347822 168.585369
14 2048.0 2048.0 2048.0 226.719125 192.841562
15 2176.0 2176.0 2176.0 211.827867 214.081356
16 2304.0 2304.0 2304.0 234.194828 231.921091
17 2432.0 2432.0 2432.0 205.069087 202.118452
18 2560.0 2560.0 2560.0 225.986210 219.919464
19 2688.0 2688.0 2688.0 199.647657 201.771569
20 2816.0 2816.0 2816.0 216.986107 212.752230
21 2944.0 2944.0 2944.0 221.493479 225.502413
22 3072.0 3072.0 3072.0 208.941345 213.672083
23 3200.0 3200.0 3200.0 216.216207 219.178074
24 3328.0 3328.0 3328.0 209.277023 208.670419
25 3456.0 3456.0 3456.0 214.419058 216.143621
26 3584.0 3584.0 3584.0 218.772251 213.575751
27 3712.0 3712.0 3712.0 211.646909 217.641271
28 3840.0 3840.0 3840.0 210.250955 212.268710
29 3968.0 3968.0 3968.0 211.114084 214.453305
30 4096.0 4096.0 4096.0 220.029067 219.310012
15 2176.0 2176.0 2176.0 211.827867 211.827867
16 2304.0 2304.0 2304.0 231.921091 227.503545
17 2432.0 2432.0 2432.0 205.069087 203.583068
18 2560.0 2560.0 2560.0 227.555548 222.911566
19 2688.0 2688.0 2688.0 200.704002 200.704002
20 2816.0 2816.0 2816.0 212.752230 213.795141
21 2944.0 2944.0 2944.0 222.482283 225.502413
22 3072.0 3072.0 3072.0 209.715208 215.296978
23 3200.0 3200.0 3200.0 216.216207 214.046818
24 3328.0 3328.0 3328.0 211.118166 206.871539
25 3456.0 3456.0 3456.0 219.677297 216.143621
26 3584.0 3584.0 3584.0 221.466479 215.108588
27 3712.0 3712.0 3712.0 212.547541 217.641271
28 3840.0 3840.0 3840.0 210.250955 209.851994
29 3968.0 3968.0 3968.0 212.585252 217.511464
30 4096.0 4096.0 4096.0 221.481394 220.390365
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
1 384.0 384.0 384.0 9.216000
2 512.0 512.0 512.0 18.724571
3 640.0 640.0 640.0 32.000000
4 768.0 768.0 768.0 42.130286
5 896.0 896.0 896.0 61.083825
5 896.0 896.0 896.0 58.538665
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 102.400003
9 1408.0 1408.0 1408.0 81.369790
9 1408.0 1408.0 1408.0 82.602666
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
12 1792.0 1792.0 1792.0 133.802668
13 1920.0 1920.0 1920.0 100.173911
13 1920.0 1920.0 1920.0 99.453240
14 2048.0 2048.0 2048.0 114.130722
15 2176.0 2176.0 2176.0 119.783620
16 2304.0 2304.0 2304.0 133.451803
17 2432.0 2432.0 2432.0 133.149115
16 2304.0 2304.0 2304.0 134.959733
17 2432.0 2432.0 2432.0 132.521057
18 2560.0 2560.0 2560.0 146.285712
19 2688.0 2688.0 2688.0 118.171514
20 2816.0 2816.0 2816.0 128.277083
21 2944.0 2944.0 2944.0 140.383190
22 3072.0 3072.0 3072.0 143.713461
23 3200.0 3200.0 3200.0 140.350874
24 3328.0 3328.0 3328.0 131.852184
25 3456.0 3456.0 3456.0 139.002705
26 3584.0 3584.0 3584.0 148.620481
27 3712.0 3712.0 3712.0 141.899635
28 3840.0 3840.0 3840.0 137.895263
19 2688.0 2688.0 2688.0 117.439807
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 139.988852
22 3072.0 3072.0 3072.0 144.079147
23 3200.0 3200.0 3200.0 139.890713
24 3328.0 3328.0 3328.0 131.731564
25 3456.0 3456.0 3456.0 139.725414
26 3584.0 3584.0 3584.0 148.866543
27 3712.0 3712.0 3712.0 141.698358
28 3840.0 3840.0 3840.0 138.240003
29 3968.0 3968.0 3968.0 147.016795
30 4096.0 4096.0 4096.0 157.347868
30 4096.0 4096.0 4096.0 155.524599
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 18.230 seconds)
**Total running time of the script:** (2 minutes 18.375 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.713 seconds)
**Total running time of the script:** (0 minutes 0.726 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit 8ca3f77

Please sign in to comment.