Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Sep 1, 2024
1 parent 64e4f4f commit 78edcd0
Show file tree
Hide file tree
Showing 61 changed files with 415 additions and 415 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
6 changes: 3 additions & 3 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -231,7 +231,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
vector-add-performance:
size Triton Torch
0 4096.0 8.000000 8.000000
1 8192.0 19.200000 19.200000
1 8192.0 15.999999 19.200000
2 16384.0 31.999999 31.999999
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
Expand All @@ -243,7 +243,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
10 4194304.0 1228.800031 1228.800031
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1631.601649 1624.859540
13 33554432.0 1624.859540 1624.859540
14 67108864.0 1669.706983 1662.646960
15 134217728.0 1684.008546 1678.616907

Expand All @@ -253,7 +253,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 7.322 seconds)
**Total running time of the script:** (0 minutes 10.546 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 475.850564 695.725408
1 384.0 610.493178 821.522925
2 512.0 750.306068 937.346360
3 640.0 796.733123 960.457050
4 768.0 881.664050 1016.465839
5 896.0 936.324438 1057.657645
6 1024.0 992.861548 1110.077966
7 1152.0 1108.676130 614.233540
8 1280.0 1150.692968 669.514660
9 1408.0 1158.582790 725.746252
10 1536.0 1193.672029 784.340342
11 1664.0 1213.072712 814.439099
12 1792.0 1234.857564 856.123243
13 1920.0 1249.351308 905.956888
14 2048.0 1280.858942 961.046855
15 2176.0 1267.142379 977.060489
16 2304.0 1267.458788 1012.050520
17 2432.0 1302.253867 1057.918195
18 2560.0 1302.488874 1087.968537
19 2688.0 1310.649860 1100.460099
20 2816.0 1324.307118 1126.838051
21 2944.0 1321.761247 1168.429206
22 3072.0 1352.406328 1184.729527
23 3200.0 1349.226461 1197.241110
24 3328.0 1357.948811 1223.987640
25 3456.0 1377.812763 1252.705845
26 3584.0 1371.912656 1263.141392
27 3712.0 1384.016855 1271.378585
28 3840.0 1385.514922 1299.949745
29 3968.0 1391.498780 1317.571943
30 4096.0 1398.962202 1329.606903
31 4224.0 1336.980914 1157.652202
32 4352.0 1335.278951 1178.965055
33 4480.0 1358.394009 1184.344606
34 4608.0 1365.774069 1193.142783
35 4736.0 1355.840221 1195.081975
36 4864.0 1375.647740 1223.188034
37 4992.0 1375.251908 1235.134019
38 5120.0 1375.973954 1250.629710
39 5248.0 1371.192296 1256.835838
40 5376.0 1374.006961 1288.219067
41 5504.0 1377.160620 1298.873347
42 5632.0 1386.031107 1316.616051
43 5760.0 1393.073437 1325.573590
44 5888.0 1384.514443 1344.477591
45 6016.0 1402.229447 1353.904998
46 6144.0 1406.090800 1372.143944
47 6272.0 1411.159383 1377.042845
48 6400.0 1420.363397 1386.946958
49 6528.0 1417.478867 1396.651476
50 6656.0 1422.664975 1402.503745
51 6784.0 1414.297738 1413.491568
52 6912.0 1424.725000 1425.883779
53 7040.0 1418.800094 1431.643453
54 7168.0 1427.013003 1435.753709
55 7296.0 1428.164448 1440.691248
56 7424.0 1429.115300 1446.343748
57 7552.0 1428.217908 1456.068658
58 7680.0 1435.402188 1459.118131
59 7808.0 1432.049687 1464.103369
60 7936.0 1433.225180 1469.306879
61 8064.0 1439.446085 1476.361630
62 8192.0 1438.300708 1486.025328
63 8320.0 1388.962371 1400.338506
64 8448.0 1381.683628 1406.529581
65 8576.0 1394.282364 1393.081452
66 8704.0 1391.900835 1402.063229
67 8832.0 1383.477614 1405.623253
68 8960.0 1395.157292 1411.442257
69 9088.0 1410.804729 1413.481622
70 9216.0 1404.418290 1425.135636
71 9344.0 1395.579370 1425.386794
72 9472.0 1401.021723 1436.639423
73 9600.0 1397.299238 1433.041420
74 9728.0 1404.334592 1439.593652
75 9856.0 1418.089563 1439.920260
76 9984.0 1397.059480 1454.070686
77 10112.0 1410.432129 1455.029779
78 10240.0 1419.508487 1466.724038
79 10368.0 1416.064977 1460.785999
80 10496.0 1415.748779 1468.217737
81 10624.0 1413.552619 1468.242190
82 10752.0 1406.350759 1472.523624
83 10880.0 1399.917277 1482.411502
84 11008.0 1421.219211 1475.153289
85 11136.0 1422.793715 1483.329630
86 11264.0 1429.921991 1487.162682
87 11392.0 1414.617726 1488.746417
88 11520.0 1426.169420 1492.420112
89 11648.0 1425.516381 1497.208548
90 11776.0 1431.093073 1500.518771
91 11904.0 1440.873964 1509.154537
92 12032.0 1424.795279 1504.442472
93 12160.0 1418.936980 1510.449892
94 12288.0 1434.619046 1394.006075
95 12416.0 1450.910886 1388.227788
96 12544.0 1442.122532 1391.546047
97 12672.0 1445.332228 1390.707465
0 256.0 475.119885 695.943816
1 384.0 599.888221 814.497591
2 512.0 742.053950 905.459280
3 640.0 794.116965 957.358723
4 768.0 879.416529 1023.560336
5 896.0 936.762923 1061.441449
6 1024.0 983.253647 1122.508738
7 1152.0 1108.239858 615.398673
8 1280.0 1153.352464 670.071145
9 1408.0 1167.015563 724.837203
10 1536.0 1195.945540 779.626741
11 1664.0 1220.537268 815.326220
12 1792.0 1241.950121 856.177836
13 1920.0 1249.585273 909.664327
14 2048.0 1273.230591 960.698648
15 2176.0 1268.466515 978.609505
16 2304.0 1263.531713 1007.182920
17 2432.0 1297.489790 1058.429065
18 2560.0 1306.562331 1083.032610
19 2688.0 1307.392364 1104.273396
20 2816.0 1322.647017 1131.158457
21 2944.0 1326.012351 1168.911267
22 3072.0 1343.380037 1185.706550
23 3200.0 1348.612079 1193.949893
24 3328.0 1354.787880 1225.840105
25 3456.0 1373.628768 1246.345744
26 3584.0 1378.841844 1258.454542
27 3712.0 1385.044034 1267.715478
28 3840.0 1384.708021 1303.540698
29 3968.0 1391.843593 1317.011436
30 4096.0 1403.647323 1325.326313
31 4224.0 1342.050572 1160.172094
32 4352.0 1337.013188 1171.954908
33 4480.0 1350.924941 1182.126433
34 4608.0 1363.041761 1192.366524
35 4736.0 1361.789798 1200.462100
36 4864.0 1374.719507 1221.040141
37 4992.0 1370.134282 1236.756270
38 5120.0 1371.452347 1254.721400
39 5248.0 1373.996815 1258.531440
40 5376.0 1381.959174 1287.710689
41 5504.0 1382.424216 1301.744382
42 5632.0 1383.669790 1313.890249
43 5760.0 1395.490268 1328.129387
44 5888.0 1393.078866 1341.729457
45 6016.0 1395.468420 1352.132780
46 6144.0 1404.943892 1375.912915
47 6272.0 1412.920018 1376.754185
48 6400.0 1414.621565 1386.229751
49 6528.0 1415.416810 1392.492551
50 6656.0 1419.798806 1402.930427
51 6784.0 1410.589964 1413.886230
52 6912.0 1422.346796 1422.023969
53 7040.0 1417.202669 1429.671257
54 7168.0 1426.162280 1433.066177
55 7296.0 1427.262154 1443.939225
56 7424.0 1428.358829 1448.288150
57 7552.0 1428.796417 1455.921475
58 7680.0 1434.462314 1463.005167
59 7808.0 1432.161610 1467.619209
60 7936.0 1441.399705 1469.825270
61 8064.0 1440.680326 1471.727818
62 8192.0 1438.266258 1484.059239
63 8320.0 1385.761167 1401.849118
64 8448.0 1380.087726 1401.760620
65 8576.0 1394.502172 1397.637315
66 8704.0 1388.602811 1398.065719
67 8832.0 1381.139636 1406.115697
68 8960.0 1397.823159 1412.850284
69 9088.0 1409.352940 1414.398267
70 9216.0 1402.877686 1418.335068
71 9344.0 1398.310289 1426.344682
72 9472.0 1403.407980 1432.377616
73 9600.0 1395.768695 1433.486509
74 9728.0 1398.973953 1438.753724
75 9856.0 1413.121056 1445.897151
76 9984.0 1404.157024 1452.065360
77 10112.0 1413.298932 1456.867818
78 10240.0 1421.081302 1467.577480
79 10368.0 1413.005624 1462.667132
80 10496.0 1419.822223 1465.329561
81 10624.0 1408.313193 1465.703894
82 10752.0 1409.411735 1470.409409
83 10880.0 1399.427289 1481.431927
84 11008.0 1420.316444 1476.508172
85 11136.0 1424.671045 1488.620813
86 11264.0 1427.649123 1486.835959
87 11392.0 1415.113487 1489.200069
88 11520.0 1425.896005 1493.262533
89 11648.0 1423.979897 1498.025510
90 11776.0 1429.213893 1500.226685
91 11904.0 1441.956250 1507.282425
92 12032.0 1421.475467 1506.596440
93 12160.0 1421.558899 1511.507914
94 12288.0 1433.876810 1393.287585
95 12416.0 1448.061992 1390.440324
96 12544.0 1443.524720 1390.573352
97 12672.0 1446.491371 1391.947883
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.289 seconds)
**Total running time of the script:** (0 minutes 23.226 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -570,36 +570,36 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
matmul-performance-fp16:
M N K cuBLAS Triton
0 256.0 256.0 256.0 4.096000 4.096000
1 384.0 384.0 384.0 12.288000 12.288000
1 384.0 384.0 384.0 11.059200 12.288000
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 68.056616
4 768.0 768.0 768.0 63.195428 63.195428
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 104.857603 99.864382
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 155.765024 132.970149
9 1408.0 1408.0 1408.0 151.438217 132.970149
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 183.651271 179.978245
12 1792.0 1792.0 1792.0 172.914215 204.353162
11 1664.0 1664.0 1664.0 179.978245 179.978245
12 1792.0 1792.0 1792.0 170.294302 204.353162
13 1920.0 1920.0 1920.0 200.347822 168.585369
14 2048.0 2048.0 2048.0 223.696203 190.650180
15 2176.0 2176.0 2176.0 211.827867 211.827867
16 2304.0 2304.0 2304.0 229.691080 225.357284
17 2432.0 2432.0 2432.0 200.674737 202.118452
18 2560.0 2560.0 2560.0 224.438347 219.919464
19 2688.0 2688.0 2688.0 198.602388 199.647657
20 2816.0 2816.0 2816.0 212.752230 211.719459
21 2944.0 2944.0 2944.0 220.513412 222.482283
22 3072.0 3072.0 3072.0 208.173173 213.672083
23 3200.0 3200.0 3200.0 213.333323 218.430042
24 3328.0 3328.0 3328.0 208.670419 207.467716
25 3456.0 3456.0 3456.0 216.724640 216.724640
26 3584.0 3584.0 3584.0 218.772251 213.069643
27 3712.0 3712.0 3712.0 210.310194 214.371984
28 3840.0 3840.0 3840.0 210.250955 211.456969
29 3968.0 3968.0 3968.0 208.231980 211.847104
30 4096.0 4096.0 4096.0 219.668951 221.481394
15 2176.0 2176.0 2176.0 211.827867 209.621326
16 2304.0 2304.0 2304.0 229.691080 227.503545
17 2432.0 2432.0 2432.0 205.069087 200.674737
18 2560.0 2560.0 2560.0 218.453323 212.779229
19 2688.0 2688.0 2688.0 195.531224 198.602388
20 2816.0 2816.0 2816.0 213.795141 212.752230
21 2944.0 2944.0 2944.0 220.513412 220.513412
22 3072.0 3072.0 3072.0 208.173173 212.868821
23 3200.0 3200.0 3200.0 217.687077 218.430042
24 3328.0 3328.0 3328.0 208.067338 209.887165
25 3456.0 3456.0 3456.0 216.724640 216.433749
26 3584.0 3584.0 3584.0 216.663602 214.595213
27 3712.0 3712.0 3712.0 209.868376 214.602246
28 3840.0 3840.0 3840.0 211.456969 209.454544
29 3968.0 3968.0 3968.0 209.663117 214.830867
30 4096.0 4096.0 4096.0 219.668951 219.668951
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
Expand All @@ -610,37 +610,37 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
5 896.0 896.0 896.0 58.538665
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 99.902441
8 1280.0 1280.0 1280.0 102.400003
9 1408.0 1408.0 1408.0 82.602666
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
12 1792.0 1792.0 1792.0 133.802668
13 1920.0 1920.0 1920.0 100.173911
13 1920.0 1920.0 1920.0 99.453240
14 2048.0 2048.0 2048.0 114.130722
15 2176.0 2176.0 2176.0 120.500882
16 2304.0 2304.0 2304.0 133.451803
17 2432.0 2432.0 2432.0 131.898888
18 2560.0 2560.0 2560.0 146.285712
19 2688.0 2688.0 2688.0 117.439807
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 139.596724
22 3072.0 3072.0 3072.0 144.079147
23 3200.0 3200.0 3200.0 138.228941
24 3328.0 3328.0 3328.0 130.655709
25 3456.0 3456.0 3456.0 138.763456
26 3584.0 3584.0 3584.0 148.375230
27 3712.0 3712.0 3712.0 140.502593
19 2688.0 2688.0 2688.0 117.077336
20 2816.0 2816.0 2816.0 128.277083
21 2944.0 2944.0 2944.0 138.819031
22 3072.0 3072.0 3072.0 143.713461
23 3200.0 3200.0 3200.0 139.433550
24 3328.0 3328.0 3328.0 130.419012
25 3456.0 3456.0 3456.0 138.287420
26 3584.0 3584.0 3584.0 146.920574
27 3712.0 3712.0 3712.0 140.700486
28 3840.0 3840.0 3840.0 137.895263
29 3968.0 3968.0 3968.0 145.439735
30 4096.0 4096.0 4096.0 154.985826
29 3968.0 3968.0 3968.0 145.961642
30 4096.0 4096.0 4096.0 155.524599
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 17.685 seconds)
**Total running time of the script:** (2 minutes 17.857 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.735 seconds)
**Total running time of the script:** (0 minutes 0.709 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit 78edcd0

Please sign in to comment.