Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Aug 29, 2024
1 parent 5f23386 commit 84b77ce
Show file tree
Hide file tree
Showing 61 changed files with 431 additions and 431 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
12 changes: 6 additions & 6 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -231,7 +231,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
vector-add-performance:
size Triton Torch
0 4096.0 8.000000 8.000000
1 8192.0 15.999999 15.999999
1 8192.0 15.999999 19.200000
2 16384.0 31.999999 31.999999
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
Expand All @@ -240,20 +240,20 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1228.800031 1228.800031
10 4194304.0 1260.307736 1228.800031
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1548.094408
13 33554432.0 1624.859540 1624.859540
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1631.601649 1624.859540
14 67108864.0 1669.706983 1662.646960
15 134217728.0 1684.008546 1678.616907
15 134217728.0 1684.910539 1678.616907





.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 6.648 seconds)
**Total running time of the script:** (0 minutes 15.433 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 464.655795 685.312292
1 384.0 605.702723 821.072100
2 512.0 747.707618 926.412113
3 640.0 798.172340 962.630300
4 768.0 876.508198 1026.576804
5 896.0 930.125937 1068.498561
6 1024.0 984.752608 1108.268532
7 1152.0 1104.639764 613.984033
8 1280.0 1140.776199 669.492207
9 1408.0 1154.330759 724.925621
10 1536.0 1195.447876 779.985367
11 1664.0 1220.658703 810.303068
12 1792.0 1233.745283 855.638253
13 1920.0 1255.201568 909.391103
14 2048.0 1280.518243 958.983648
15 2176.0 1255.478838 975.220036
16 2304.0 1268.310058 1010.034576
17 2432.0 1290.423310 1056.615932
18 2560.0 1304.765212 1087.097169
19 2688.0 1304.924053 1099.066257
20 2816.0 1326.798842 1126.802476
21 2944.0 1330.228157 1166.245931
22 3072.0 1353.142890 1185.286723
23 3200.0 1348.280297 1191.344360
24 3328.0 1355.415173 1220.085028
25 3456.0 1370.214785 1247.130303
26 3584.0 1373.509056 1261.282956
27 3712.0 1386.068031 1266.326327
28 3840.0 1391.549258 1302.871958
29 3968.0 1391.263602 1317.632825
30 4096.0 1399.991133 1322.843237
31 4224.0 1329.361886 1156.531376
32 4352.0 1334.436637 1175.462668
33 4480.0 1350.531680 1183.570708
34 4608.0 1361.433956 1191.887430
35 4736.0 1356.852747 1198.971173
36 4864.0 1376.699129 1223.181040
37 4992.0 1369.572146 1232.701133
38 5120.0 1378.573806 1252.593613
39 5248.0 1373.915844 1261.423562
40 5376.0 1380.538368 1286.877917
41 5504.0 1379.060369 1300.632713
42 5632.0 1385.747925 1310.938799
43 5760.0 1396.554380 1326.714476
44 5888.0 1389.139856 1342.376711
45 6016.0 1400.993359 1353.291731
46 6144.0 1411.347975 1376.272838
47 6272.0 1414.525309 1373.483341
48 6400.0 1416.756361 1386.317006
49 6528.0 1411.133757 1391.849890
50 6656.0 1418.385785 1400.297993
51 6784.0 1414.791997 1411.382678
52 6912.0 1424.776817 1422.190154
53 7040.0 1417.909435 1433.007877
54 7168.0 1426.588354 1435.670801
55 7296.0 1429.429985 1441.614363
56 7424.0 1431.539141 1448.093140
57 7552.0 1427.803116 1455.303708
58 7680.0 1436.570668 1461.593802
59 7808.0 1434.912741 1466.864913
60 7936.0 1435.689695 1468.794046
61 8064.0 1438.188879 1474.733206
62 8192.0 1440.995216 1483.853260
63 8320.0 1389.594804 1400.485786
64 8448.0 1386.133128 1403.946489
65 8576.0 1394.156614 1398.044087
66 8704.0 1389.456917 1398.258444
67 8832.0 1381.047318 1401.787946
68 8960.0 1396.741491 1412.905399
69 9088.0 1409.379478 1417.191329
70 9216.0 1402.603693 1424.897461
71 9344.0 1401.320220 1425.675380
72 9472.0 1403.538576 1435.772821
73 9600.0 1395.917472 1434.258920
74 9728.0 1399.851602 1439.266859
75 9856.0 1412.436358 1442.761130
76 9984.0 1397.380834 1453.632161
77 10112.0 1413.054738 1454.209543
78 10240.0 1425.068715 1465.606832
79 10368.0 1412.098877 1464.477670
80 10496.0 1414.101090 1465.726064
81 10624.0 1410.607725 1467.271652
82 10752.0 1407.419452 1473.544578
83 10880.0 1397.054268 1479.765719
84 11008.0 1419.845086 1481.436729
85 11136.0 1422.197781 1483.980516
86 11264.0 1431.691513 1487.313416
87 11392.0 1416.221814 1487.536557
88 11520.0 1422.466622 1493.559930
89 11648.0 1423.752892 1499.300630
90 11776.0 1431.223422 1501.024800
91 11904.0 1441.050847 1506.347763
92 12032.0 1426.524611 1509.849148
93 12160.0 1416.061152 1512.997219
94 12288.0 1438.516825 1392.627108
95 12416.0 1449.973613 1390.809057
96 12544.0 1440.489984 1393.973958
97 12672.0 1449.014134 1391.192365
0 256.0 481.122554 705.606305
1 384.0 610.434702 819.823114
2 512.0 761.656911 924.209827
3 640.0 799.226362 944.360090
4 768.0 867.047611 1027.173003
5 896.0 936.960598 1058.824580
6 1024.0 985.584033 1108.840124
7 1152.0 1107.393637 613.634046
8 1280.0 1146.930717 669.100706
9 1408.0 1162.115329 720.621389
10 1536.0 1185.177342 778.853902
11 1664.0 1217.205082 811.707348
12 1792.0 1241.317424 858.977076
13 1920.0 1248.378487 908.629411
14 2048.0 1278.451899 953.703798
15 2176.0 1255.573137 977.157558
16 2304.0 1268.279024 1008.176684
17 2432.0 1289.185340 1057.863892
18 2560.0 1308.571366 1084.184708
19 2688.0 1307.591343 1099.708516
20 2816.0 1319.874972 1130.874624
21 2944.0 1320.138954 1167.428064
22 3072.0 1351.664889 1185.122139
23 3200.0 1357.768931 1191.520567
24 3328.0 1353.422267 1220.980780
25 3456.0 1367.305367 1245.605859
26 3584.0 1379.735215 1258.157013
27 3712.0 1383.741181 1270.613298
28 3840.0 1386.899622 1302.070677
29 3968.0 1391.527406 1316.700118
30 4096.0 1399.801411 1325.898718
31 4224.0 1330.959459 1160.888959
32 4352.0 1335.577572 1172.599223
33 4480.0 1352.962036 1182.609123
34 4608.0 1362.112363 1193.610373
35 4736.0 1356.598590 1197.166243
36 4864.0 1375.028691 1222.774961
37 4992.0 1366.109616 1239.348747
38 5120.0 1372.128671 1250.058464
39 5248.0 1375.849884 1258.884814
40 5376.0 1374.978428 1286.796113
41 5504.0 1377.846198 1297.942346
42 5632.0 1384.413443 1312.716118
43 5760.0 1391.278564 1325.685551
44 5888.0 1394.194442 1340.336636
45 6016.0 1397.978091 1353.805381
46 6144.0 1406.785353 1372.097123
47 6272.0 1415.245090 1374.033813
48 6400.0 1410.782205 1388.175035
49 6528.0 1415.796932 1394.208904
50 6656.0 1421.096941 1403.630295
51 6784.0 1416.131251 1412.181764
52 6912.0 1427.941472 1425.406941
53 7040.0 1419.496350 1430.216877
54 7168.0 1429.089905 1433.832681
55 7296.0 1429.787556 1442.966669
56 7424.0 1429.375937 1442.943822
57 7552.0 1427.168501 1455.195697
58 7680.0 1437.135199 1460.596589
59 7808.0 1433.640230 1465.474209
60 7936.0 1432.577856 1468.218077
61 8064.0 1436.870645 1476.143488
62 8192.0 1441.160297 1482.565266
63 8320.0 1389.050740 1401.788111
64 8448.0 1378.888407 1404.382591
65 8576.0 1398.095805 1394.096530
66 8704.0 1388.941082 1402.802675
67 8832.0 1381.057924 1403.322637
68 8960.0 1395.218642 1414.527645
69 9088.0 1410.865783 1416.893112
70 9216.0 1404.596222 1423.388398
71 9344.0 1403.696909 1426.976061
72 9472.0 1398.858388 1434.562163
73 9600.0 1397.357600 1433.067199
74 9728.0 1399.107722 1441.921229
75 9856.0 1416.363642 1438.967318
76 9984.0 1402.643748 1451.618158
77 10112.0 1415.973788 1456.324055
78 10240.0 1420.144736 1466.642455
79 10368.0 1411.548536 1462.925131
80 10496.0 1412.824164 1468.937611
81 10624.0 1410.823497 1466.070334
82 10752.0 1405.901365 1474.315660
83 10880.0 1399.632282 1480.103880
84 11008.0 1418.699072 1478.589424
85 11136.0 1424.249125 1487.229175
86 11264.0 1427.982126 1486.433466
87 11392.0 1411.084768 1490.053428
88 11520.0 1424.872316 1494.723538
89 11648.0 1429.668921 1497.942460
90 11776.0 1430.401550 1500.797227
91 11904.0 1443.839914 1508.386152
92 12032.0 1425.162120 1509.201685
93 12160.0 1417.157747 1513.961434
94 12288.0 1435.210340 1393.452263
95 12416.0 1447.425904 1390.445989
96 12544.0 1441.090369 1394.385753
97 12672.0 1447.677043 1394.404387
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 25.076 seconds)
**Total running time of the script:** (0 minutes 23.218 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -570,77 +570,77 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
matmul-performance-fp16:
M N K cuBLAS Triton
0 256.0 256.0 256.0 4.096000 4.096000
1 384.0 384.0 384.0 11.059200 12.288000
1 384.0 384.0 384.0 12.288000 12.288000
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 68.056616
5 896.0 896.0 896.0 78.051553 87.808000
6 1024.0 1024.0 1024.0 104.857603 99.864382
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 110.376426 99.864382
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 155.765024 132.970149
10 1536.0 1536.0 1536.0 176.947204 157.286398
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 179.978245 179.978245
12 1792.0 1792.0 1792.0 172.914215 208.137481
12 1792.0 1792.0 1792.0 172.914215 204.353162
13 1920.0 1920.0 1920.0 200.347822 168.585369
14 2048.0 2048.0 2048.0 226.719125 190.650180
14 2048.0 2048.0 2048.0 226.719125 192.841562
15 2176.0 2176.0 2176.0 211.827867 214.081356
16 2304.0 2304.0 2304.0 229.691080 228.592087
17 2432.0 2432.0 2432.0 206.576938 203.583068
18 2560.0 2560.0 2560.0 224.438347 222.911566
19 2688.0 2688.0 2688.0 200.704002 199.647657
20 2816.0 2816.0 2816.0 214.848312 214.848312
21 2944.0 2944.0 2944.0 221.493479 222.482283
16 2304.0 2304.0 2304.0 234.194828 231.921091
17 2432.0 2432.0 2432.0 205.069087 202.118452
18 2560.0 2560.0 2560.0 225.986210 219.919464
19 2688.0 2688.0 2688.0 199.647657 201.771569
20 2816.0 2816.0 2816.0 216.986107 212.752230
21 2944.0 2944.0 2944.0 221.493479 225.502413
22 3072.0 3072.0 3072.0 208.941345 213.672083
23 3200.0 3200.0 3200.0 216.216207 219.178074
24 3328.0 3328.0 3328.0 208.067338 207.467716
25 3456.0 3456.0 3456.0 221.487820 219.080343
26 3584.0 3584.0 3584.0 216.142772 213.069643
27 3712.0 3712.0 3712.0 213.000737 217.641271
28 3840.0 3840.0 3840.0 210.250955 211.456969
29 3968.0 3968.0 3968.0 211.114084 214.830867
30 4096.0 4096.0 4096.0 222.214781 220.390365
24 3328.0 3328.0 3328.0 209.277023 208.670419
25 3456.0 3456.0 3456.0 214.419058 216.143621
26 3584.0 3584.0 3584.0 218.772251 213.575751
27 3712.0 3712.0 3712.0 211.646909 217.641271
28 3840.0 3840.0 3840.0 210.250955 212.268710
29 3968.0 3968.0 3968.0 211.114084 214.453305
30 4096.0 4096.0 4096.0 220.029067 219.310012
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
1 384.0 384.0 384.0 9.216000
2 512.0 512.0 512.0 18.724571
3 640.0 640.0 640.0 32.000000
4 768.0 768.0 768.0 42.130286
5 896.0 896.0 896.0 58.538665
5 896.0 896.0 896.0 61.083825
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 99.902441
8 1280.0 1280.0 1280.0 102.400003
9 1408.0 1408.0 1408.0 81.369790
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
12 1792.0 1792.0 1792.0 133.802668
13 1920.0 1920.0 1920.0 99.453240
13 1920.0 1920.0 1920.0 100.173911
14 2048.0 2048.0 2048.0 114.130722
15 2176.0 2176.0 2176.0 120.500882
16 2304.0 2304.0 2304.0 134.201527
17 2432.0 2432.0 2432.0 132.521057
18 2560.0 2560.0 2560.0 145.635558
19 2688.0 2688.0 2688.0 117.439807
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 139.596724
22 3072.0 3072.0 3072.0 144.079147
23 3200.0 3200.0 3200.0 140.043768
24 3328.0 3328.0 3328.0 132.215416
15 2176.0 2176.0 2176.0 119.783620
16 2304.0 2304.0 2304.0 133.451803
17 2432.0 2432.0 2432.0 133.149115
18 2560.0 2560.0 2560.0 146.285712
19 2688.0 2688.0 2688.0 118.171514
20 2816.0 2816.0 2816.0 128.277083
21 2944.0 2944.0 2944.0 140.383190
22 3072.0 3072.0 3072.0 143.713461
23 3200.0 3200.0 3200.0 140.350874
24 3328.0 3328.0 3328.0 131.852184
25 3456.0 3456.0 3456.0 139.002705
26 3584.0 3584.0 3584.0 148.866543
27 3712.0 3712.0 3712.0 141.698358
28 3840.0 3840.0 3840.0 138.240003
26 3584.0 3584.0 3584.0 148.620481
27 3712.0 3712.0 3712.0 141.899635
28 3840.0 3840.0 3840.0 137.895263
29 3968.0 3968.0 3968.0 147.016795
30 4096.0 4096.0 4096.0 154.985826
30 4096.0 4096.0 4096.0 157.347868
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 17.837 seconds)
**Total running time of the script:** (2 minutes 18.230 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.716 seconds)
**Total running time of the script:** (0 minutes 0.713 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit 84b77ce

Please sign in to comment.