Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Sep 9, 2024
1 parent 58f6d9a commit 8e5600a
Show file tree
Hide file tree
Showing 61 changed files with 405 additions and 405 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 5 additions & 5 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -232,28 +232,28 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
size Triton Torch
0 4096.0 8.000000 8.000000
1 8192.0 15.999999 15.999999
2 16384.0 38.400001 31.999999
2 16384.0 31.999999 31.999999
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
5 131072.0 219.428568 219.428568
6 262144.0 384.000001 384.000001
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1260.307736 1228.800031
9 2097152.0 1068.521715 1023.999964
10 4194304.0 1228.800031 1260.307736
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1631.601649 1624.859540
14 67108864.0 1669.706983 1662.646960
15 134217728.0 1685.813499 1678.616907
15 134217728.0 1684.008546 1680.410210





.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 6.655 seconds)
**Total running time of the script:** (0 minutes 8.085 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 483.504849 705.858619
1 384.0 609.909256 819.825104
2 512.0 752.501069 924.816481
3 640.0 797.672563 963.880055
4 768.0 881.726191 1018.420477
5 896.0 929.768442 1064.831875
6 1024.0 999.494799 1122.001862
7 1152.0 1102.558978 614.441742
8 1280.0 1142.090409 669.986852
9 1408.0 1168.353519 725.768367
10 1536.0 1193.606858 780.036253
11 1664.0 1211.625586 816.613371
12 1792.0 1234.005969 857.585530
13 1920.0 1257.909453 907.932932
14 2048.0 1279.372923 953.199062
15 2176.0 1256.866836 976.941599
16 2304.0 1272.111609 1007.991351
17 2432.0 1293.568044 1053.980286
18 2560.0 1302.443672 1084.044097
19 2688.0 1310.543069 1100.064422
20 2816.0 1329.550121 1132.845900
21 2944.0 1325.106103 1165.101730
22 3072.0 1354.263898 1181.937432
23 3200.0 1356.768765 1196.734630
24 3328.0 1361.569081 1223.640447
25 3456.0 1372.260392 1251.372527
26 3584.0 1377.824967 1263.990813
27 3712.0 1388.452077 1269.160168
28 3840.0 1390.195771 1300.371636
29 3968.0 1392.128272 1316.642004
30 4096.0 1401.440985 1325.993289
31 4224.0 1335.750716 1161.324964
32 4352.0 1341.910848 1174.716639
33 4480.0 1357.279725 1184.106737
34 4608.0 1363.505422 1194.268459
35 4736.0 1355.972089 1202.458558
36 4864.0 1376.545694 1221.469200
37 4992.0 1371.193311 1233.980250
38 5120.0 1372.928619 1249.661093
39 5248.0 1376.524535 1256.566657
40 5376.0 1380.404894 1286.955387
41 5504.0 1376.185871 1296.784498
42 5632.0 1387.308569 1316.781373
43 5760.0 1394.788007 1325.776437
44 5888.0 1393.254701 1340.097197
45 6016.0 1397.522942 1351.778948
46 6144.0 1407.036902 1372.097123
47 6272.0 1415.862255 1374.950602
48 6400.0 1412.979892 1391.500644
49 6528.0 1413.225499 1392.287672
50 6656.0 1419.501659 1401.350961
51 6784.0 1414.210591 1417.375134
52 6912.0 1427.230434 1425.416165
53 7040.0 1415.621263 1433.818007
54 7168.0 1427.284870 1435.603462
55 7296.0 1433.278373 1442.232505
56 7424.0 1429.853412 1448.062699
57 7552.0 1430.825507 1455.662158
58 7680.0 1436.538042 1462.944463
59 7808.0 1432.563937 1465.515737
60 7936.0 1436.234983 1468.974307
61 8064.0 1438.197861 1472.449191
62 8192.0 1438.820876 1484.991319
63 8320.0 1388.828363 1400.923094
64 8448.0 1382.080815 1404.912243
65 8576.0 1397.828756 1398.077460
66 8704.0 1389.103978 1401.588898
67 8832.0 1388.464599 1406.114984
68 8960.0 1394.242119 1411.884544
69 9088.0 1406.154896 1416.931387
70 9216.0 1401.332416 1424.186015
71 9344.0 1399.101498 1425.765896
72 9472.0 1397.400184 1437.613232
73 9600.0 1397.414486 1429.620929
74 9728.0 1404.383086 1443.278039
75 9856.0 1414.338572 1442.781395
76 9984.0 1400.183684 1449.253765
77 10112.0 1413.006160 1455.536657
78 10240.0 1420.935224 1468.628917
79 10368.0 1412.594682 1463.525008
80 10496.0 1412.404248 1466.541052
81 10624.0 1411.253194 1469.755374
82 10752.0 1403.887393 1470.905480
83 10880.0 1401.058090 1481.231802
84 11008.0 1418.032552 1476.656156
85 11136.0 1423.022405 1483.522921
86 11264.0 1426.833981 1487.372540
87 11392.0 1415.976223 1489.733547
88 11520.0 1420.858834 1493.378907
89 11648.0 1428.996384 1496.158752
90 11776.0 1429.987493 1501.891181
91 11904.0 1441.947990 1504.710219
92 12032.0 1420.270029 1508.437638
93 12160.0 1420.123557 1509.548090
94 12288.0 1434.772004 1390.825795
95 12416.0 1447.592587 1389.969032
96 12544.0 1443.655266 1391.978887
97 12672.0 1449.629836 1395.389754
0 256.0 483.648575 704.279422
1 384.0 611.207871 811.310482
2 512.0 760.029946 930.244010
3 640.0 789.112633 962.289683
4 768.0 885.260387 1014.476488
5 896.0 930.769527 1075.646692
6 1024.0 1000.412757 1115.599438
7 1152.0 1109.040519 610.401148
8 1280.0 1149.123110 671.226388
9 1408.0 1154.278795 720.621389
10 1536.0 1193.883514 778.498289
11 1664.0 1210.021635 814.654418
12 1792.0 1242.793808 859.324356
13 1920.0 1254.580283 908.801326
14 2048.0 1275.278633 959.301903
15 2176.0 1263.968962 973.449375
16 2304.0 1266.587365 1008.539223
17 2432.0 1298.087764 1057.024314
18 2560.0 1298.588250 1088.941346
19 2688.0 1318.063968 1100.100149
20 2816.0 1327.522874 1131.238323
21 2944.0 1323.948838 1168.479819
22 3072.0 1354.041176 1183.863854
23 3200.0 1358.514156 1194.711058
24 3328.0 1354.976383 1225.274576
25 3456.0 1371.850435 1244.211621
26 3584.0 1372.984323 1256.138600
27 3712.0 1387.329527 1274.311099
28 3840.0 1388.664938 1303.713374
29 3968.0 1393.204422 1313.355077
30 4096.0 1403.886883 1325.316368
31 4224.0 1337.103991 1158.609778
32 4352.0 1335.716353 1173.812661
33 4480.0 1356.217955 1183.574263
34 4608.0 1362.450812 1193.809073
35 4736.0 1357.293947 1201.733861
36 4864.0 1379.159459 1221.861291
37 4992.0 1373.826137 1234.229426
38 5120.0 1377.862279 1249.951049
39 5248.0 1377.333230 1257.339477
40 5376.0 1379.883175 1285.170192
41 5504.0 1375.907088 1294.483712
42 5632.0 1384.322144 1315.326958
43 5760.0 1389.700975 1323.655303
44 5888.0 1388.358894 1340.965603
45 6016.0 1396.791808 1357.890048
46 6144.0 1407.352454 1376.836366
47 6272.0 1416.510509 1373.925271
48 6400.0 1417.547916 1390.114938
49 6528.0 1412.386919 1394.558616
50 6656.0 1429.171955 1404.118948
51 6784.0 1414.168228 1414.152945
52 6912.0 1428.677389 1423.545837
53 7040.0 1419.135908 1431.027028
54 7168.0 1428.190246 1432.573337
55 7296.0 1430.990750 1442.788896
56 7424.0 1432.261790 1448.247037
57 7552.0 1429.992641 1455.168665
58 7680.0 1434.263134 1461.397513
59 7808.0 1434.109985 1464.693503
60 7936.0 1438.390409 1469.118869
61 8064.0 1441.079679 1476.328321
62 8192.0 1439.713319 1484.101664
63 8320.0 1389.698454 1401.500977
64 8448.0 1379.724473 1404.643597
65 8576.0 1397.012595 1396.208756
66 8704.0 1390.381255 1402.374490
67 8832.0 1381.410375 1402.728397
68 8960.0 1399.763867 1412.439691
69 9088.0 1409.690151 1417.513298
70 9216.0 1401.348792 1425.147763
71 9344.0 1398.078238 1423.023168
72 9472.0 1397.977452 1433.206099
73 9600.0 1393.409483 1431.106833
74 9728.0 1400.878586 1441.134502
75 9856.0 1411.871327 1441.104112
76 9984.0 1399.378512 1453.973620
77 10112.0 1413.104126 1455.984016
78 10240.0 1424.171473 1469.898677
79 10368.0 1412.048415 1465.451919
80 10496.0 1414.760017 1469.418293
81 10624.0 1411.783049 1466.482982
82 10752.0 1408.101565 1473.308784
83 10880.0 1402.392760 1483.867742
84 11008.0 1420.196950 1476.739754
85 11136.0 1422.397196 1485.601988
86 11264.0 1428.979758 1487.306093
87 11392.0 1414.354964 1489.877609
88 11520.0 1423.180249 1496.093741
89 11648.0 1428.865810 1497.329329
90 11776.0 1431.690275 1499.983541
91 11904.0 1442.327917 1506.970568
92 12032.0 1423.804221 1507.745673
93 12160.0 1421.960002 1512.223550
94 12288.0 1436.623032 1393.599156
95 12416.0 1447.158770 1390.654769
96 12544.0 1442.963517 1393.886753
97 12672.0 1445.889735 1394.097025
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.233 seconds)
**Total running time of the script:** (0 minutes 23.220 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -575,31 +575,31 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 68.056616
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 104.857603 104.857603
6 1024.0 1024.0 1024.0 110.376426 104.857603
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 155.765024 132.970149
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 183.651271 179.978245
10 1536.0 1536.0 1536.0 176.947204 157.286398
11 1664.0 1664.0 1664.0 179.978245 179.978245
12 1792.0 1792.0 1792.0 172.914215 208.137481
13 1920.0 1920.0 1920.0 200.347822 166.554219
13 1920.0 1920.0 1920.0 200.347822 168.585369
14 2048.0 2048.0 2048.0 226.719125 192.841562
15 2176.0 2176.0 2176.0 211.827867 209.621326
16 2304.0 2304.0 2304.0 229.691080 231.921091
17 2432.0 2432.0 2432.0 205.069087 202.118452
18 2560.0 2560.0 2560.0 224.438347 218.453323
19 2688.0 2688.0 2688.0 200.704002 198.602388
20 2816.0 2816.0 2816.0 212.752230 207.686706
21 2944.0 2944.0 2944.0 220.513412 222.482283
22 3072.0 3072.0 3072.0 210.494802 212.868821
23 3200.0 3200.0 3200.0 218.430042 219.178074
24 3328.0 3328.0 3328.0 209.887165 209.887165
25 3456.0 3456.0 3456.0 220.880999 218.486642
26 3584.0 3584.0 3584.0 218.772251 215.108588
27 3712.0 3712.0 3712.0 208.990259 214.833002
15 2176.0 2176.0 2176.0 211.827867 211.827867
16 2304.0 2304.0 2304.0 229.691080 227.503545
17 2432.0 2432.0 2432.0 206.576938 203.583068
18 2560.0 2560.0 2560.0 222.911566 219.919464
19 2688.0 2688.0 2688.0 198.602388 199.647657
20 2816.0 2816.0 2816.0 212.752230 212.752230
21 2944.0 2944.0 2944.0 221.493479 223.479969
22 3072.0 3072.0 3072.0 208.941345 211.280236
23 3200.0 3200.0 3200.0 216.949149 221.453296
24 3328.0 3328.0 3328.0 207.467716 211.118166
25 3456.0 3456.0 3456.0 219.677297 219.080343
26 3584.0 3584.0 3584.0 216.142772 215.624440
27 3712.0 3712.0 3712.0 211.646909 214.833002
28 3840.0 3840.0 3840.0 210.250955 209.851994
29 3968.0 3968.0 3968.0 208.945088 217.899880
30 4096.0 4096.0 4096.0 220.029067 220.029067
29 3968.0 3968.0 3968.0 211.114084 219.467517
30 4096.0 4096.0 4096.0 217.180793 220.029067
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
Expand All @@ -610,7 +610,7 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
5 896.0 896.0 896.0 58.538665
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 102.400003
8 1280.0 1280.0 1280.0 99.902441
9 1408.0 1408.0 1408.0 81.369790
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
Expand All @@ -620,27 +620,27 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
15 2176.0 2176.0 2176.0 120.500882
16 2304.0 2304.0 2304.0 134.959733
17 2432.0 2432.0 2432.0 132.521057
18 2560.0 2560.0 2560.0 145.635558
19 2688.0 2688.0 2688.0 118.171514
18 2560.0 2560.0 2560.0 145.959916
19 2688.0 2688.0 2688.0 117.077336
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 138.819031
22 3072.0 3072.0 3072.0 144.079147
23 3200.0 3200.0 3200.0 139.433550
24 3328.0 3328.0 3328.0 130.893266
25 3456.0 3456.0 3456.0 138.287420
26 3584.0 3584.0 3584.0 149.113421
21 2944.0 2944.0 2944.0 139.988852
22 3072.0 3072.0 3072.0 143.713461
23 3200.0 3200.0 3200.0 139.737993
24 3328.0 3328.0 3328.0 132.336939
25 3456.0 3456.0 3456.0 139.725414
26 3584.0 3584.0 3584.0 148.620481
27 3712.0 3712.0 3712.0 142.303911
28 3840.0 3840.0 3840.0 137.723536
29 3968.0 3968.0 3968.0 147.194128
30 4096.0 4096.0 4096.0 154.807064
28 3840.0 3840.0 3840.0 137.895263
29 3968.0 3968.0 3968.0 147.550102
30 4096.0 4096.0 4096.0 155.165002
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 17.240 seconds)
**Total running time of the script:** (2 minutes 17.681 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -244,7 +244,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.709 seconds)
**Total running time of the script:** (0 minutes 0.695 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit 8e5600a

Please sign in to comment.