Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Sep 8, 2024
1 parent b8d4b68 commit 58f6d9a
Show file tree
Hide file tree
Showing 61 changed files with 417 additions and 417 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 5 additions & 5 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -232,28 +232,28 @@ We can now run the decorated function above. Pass `print_data=True` to see the p
size Triton Torch
0 4096.0 8.000000 8.000000
1 8192.0 15.999999 15.999999
2 16384.0 31.999999 31.999999
2 16384.0 38.400001 31.999999
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
5 131072.0 219.428568 219.428568
6 262144.0 384.000001 384.000001
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1228.800031 1228.800031
11 8388608.0 1424.695621 1404.342820
10 4194304.0 1260.307736 1228.800031
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1631.601649 1624.859540
14 67108864.0 1669.706983 1662.646960
15 134217728.0 1684.008546 1678.616907
15 134217728.0 1685.813499 1678.616907





.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 6.723 seconds)
**Total running time of the script:** (0 minutes 6.655 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 479.591087 686.780777
1 384.0 606.329385 811.084783
2 512.0 759.148903 917.236325
3 640.0 784.606674 963.285389
4 768.0 878.976622 1012.756900
5 896.0 936.982431 1067.047583
6 1024.0 993.348041 1109.168311
7 1152.0 1102.971819 613.547884
8 1280.0 1138.184560 671.634363
9 1408.0 1155.897226 724.090333
10 1536.0 1191.316485 778.447155
11 1664.0 1216.788990 809.628678
12 1792.0 1238.494639 858.334953
13 1920.0 1253.417042 908.037302
14 2048.0 1269.259338 953.436477
15 2176.0 1256.971745 975.439013
16 2304.0 1273.536301 1011.647880
17 2432.0 1293.589320 1052.346948
18 2560.0 1300.039855 1087.310601
19 2688.0 1313.709434 1098.697721
20 2816.0 1321.950869 1127.038728
21 2944.0 1321.396901 1167.914567
22 3072.0 1352.850905 1180.985293
23 3200.0 1350.322541 1194.869292
24 3328.0 1356.696780 1220.645876
25 3456.0 1370.160685 1247.851511
26 3584.0 1376.749220 1261.037051
27 3712.0 1387.322807 1265.337871
28 3840.0 1384.630822 1298.935824
29 3968.0 1392.280764 1312.133859
30 4096.0 1395.284741 1327.113740
31 4224.0 1333.870054 1157.573503
32 4352.0 1334.753963 1177.066793
33 4480.0 1354.228198 1183.200907
34 4608.0 1361.654589 1196.523793
35 4736.0 1356.343469 1199.647767
36 4864.0 1376.871385 1222.899944
37 4992.0 1368.337785 1238.186815
38 5120.0 1375.114021 1252.122355
39 5248.0 1371.741881 1254.659695
40 5376.0 1377.132400 1286.350160
41 5504.0 1374.163119 1296.429048
42 5632.0 1383.639491 1315.019705
43 5760.0 1394.006823 1323.923430
44 5888.0 1395.175890 1339.341988
45 6016.0 1398.239767 1356.552649
46 6144.0 1407.111343 1375.300442
47 6272.0 1414.286270 1376.879531
48 6400.0 1416.124408 1390.245667
49 6528.0 1413.135628 1396.274773
50 6656.0 1421.462948 1400.787411
51 6784.0 1405.418965 1413.893064
52 6912.0 1429.922242 1425.265663
53 7040.0 1420.268859 1431.970540
54 7168.0 1426.904234 1436.504632
55 7296.0 1428.848471 1439.903310
56 7424.0 1429.889980 1442.603892
57 7552.0 1427.233875 1454.240622
58 7680.0 1433.454680 1460.214321
59 7808.0 1429.409941 1466.211424
60 7936.0 1439.817454 1466.702774
61 8064.0 1435.910645 1471.880026
62 8192.0 1440.488795 1482.469613
63 8320.0 1385.196502 1403.558392
64 8448.0 1381.196296 1405.175146
65 8576.0 1394.654387 1392.750587
66 8704.0 1389.283047 1400.915735
67 8832.0 1381.447123 1404.645393
68 8960.0 1400.210307 1411.632670
69 9088.0 1409.877714 1415.073481
70 9216.0 1406.550586 1426.026518
71 9344.0 1398.570404 1423.363447
72 9472.0 1394.784036 1431.838719
73 9600.0 1396.219370 1432.961948
74 9728.0 1400.115561 1443.202285
75 9856.0 1413.348391 1442.315923
76 9984.0 1401.335040 1451.377582
77 10112.0 1412.219870 1452.109506
78 10240.0 1420.010313 1467.536831
79 10368.0 1413.298436 1463.078903
80 10496.0 1413.837933 1469.054667
81 10624.0 1410.845223 1465.886907
82 10752.0 1401.779613 1470.154689
83 10880.0 1399.075895 1483.569163
84 11008.0 1415.474194 1475.722004
85 11136.0 1422.763130 1482.443433
86 11264.0 1428.716004 1486.668993
87 11392.0 1416.737929 1486.493588
88 11520.0 1421.482944 1496.738310
89 11648.0 1423.347027 1500.292973
90 11776.0 1430.859961 1501.073525
91 11904.0 1443.636709 1505.693946
92 12032.0 1429.536656 1508.197570
93 12160.0 1420.532604 1508.366363
94 12288.0 1439.085584 1392.058470
95 12416.0 1446.501558 1390.692311
96 12544.0 1441.391276 1391.394575
97 12672.0 1446.591091 1393.094678
0 256.0 483.504849 705.858619
1 384.0 609.909256 819.825104
2 512.0 752.501069 924.816481
3 640.0 797.672563 963.880055
4 768.0 881.726191 1018.420477
5 896.0 929.768442 1064.831875
6 1024.0 999.494799 1122.001862
7 1152.0 1102.558978 614.441742
8 1280.0 1142.090409 669.986852
9 1408.0 1168.353519 725.768367
10 1536.0 1193.606858 780.036253
11 1664.0 1211.625586 816.613371
12 1792.0 1234.005969 857.585530
13 1920.0 1257.909453 907.932932
14 2048.0 1279.372923 953.199062
15 2176.0 1256.866836 976.941599
16 2304.0 1272.111609 1007.991351
17 2432.0 1293.568044 1053.980286
18 2560.0 1302.443672 1084.044097
19 2688.0 1310.543069 1100.064422
20 2816.0 1329.550121 1132.845900
21 2944.0 1325.106103 1165.101730
22 3072.0 1354.263898 1181.937432
23 3200.0 1356.768765 1196.734630
24 3328.0 1361.569081 1223.640447
25 3456.0 1372.260392 1251.372527
26 3584.0 1377.824967 1263.990813
27 3712.0 1388.452077 1269.160168
28 3840.0 1390.195771 1300.371636
29 3968.0 1392.128272 1316.642004
30 4096.0 1401.440985 1325.993289
31 4224.0 1335.750716 1161.324964
32 4352.0 1341.910848 1174.716639
33 4480.0 1357.279725 1184.106737
34 4608.0 1363.505422 1194.268459
35 4736.0 1355.972089 1202.458558
36 4864.0 1376.545694 1221.469200
37 4992.0 1371.193311 1233.980250
38 5120.0 1372.928619 1249.661093
39 5248.0 1376.524535 1256.566657
40 5376.0 1380.404894 1286.955387
41 5504.0 1376.185871 1296.784498
42 5632.0 1387.308569 1316.781373
43 5760.0 1394.788007 1325.776437
44 5888.0 1393.254701 1340.097197
45 6016.0 1397.522942 1351.778948
46 6144.0 1407.036902 1372.097123
47 6272.0 1415.862255 1374.950602
48 6400.0 1412.979892 1391.500644
49 6528.0 1413.225499 1392.287672
50 6656.0 1419.501659 1401.350961
51 6784.0 1414.210591 1417.375134
52 6912.0 1427.230434 1425.416165
53 7040.0 1415.621263 1433.818007
54 7168.0 1427.284870 1435.603462
55 7296.0 1433.278373 1442.232505
56 7424.0 1429.853412 1448.062699
57 7552.0 1430.825507 1455.662158
58 7680.0 1436.538042 1462.944463
59 7808.0 1432.563937 1465.515737
60 7936.0 1436.234983 1468.974307
61 8064.0 1438.197861 1472.449191
62 8192.0 1438.820876 1484.991319
63 8320.0 1388.828363 1400.923094
64 8448.0 1382.080815 1404.912243
65 8576.0 1397.828756 1398.077460
66 8704.0 1389.103978 1401.588898
67 8832.0 1388.464599 1406.114984
68 8960.0 1394.242119 1411.884544
69 9088.0 1406.154896 1416.931387
70 9216.0 1401.332416 1424.186015
71 9344.0 1399.101498 1425.765896
72 9472.0 1397.400184 1437.613232
73 9600.0 1397.414486 1429.620929
74 9728.0 1404.383086 1443.278039
75 9856.0 1414.338572 1442.781395
76 9984.0 1400.183684 1449.253765
77 10112.0 1413.006160 1455.536657
78 10240.0 1420.935224 1468.628917
79 10368.0 1412.594682 1463.525008
80 10496.0 1412.404248 1466.541052
81 10624.0 1411.253194 1469.755374
82 10752.0 1403.887393 1470.905480
83 10880.0 1401.058090 1481.231802
84 11008.0 1418.032552 1476.656156
85 11136.0 1423.022405 1483.522921
86 11264.0 1426.833981 1487.372540
87 11392.0 1415.976223 1489.733547
88 11520.0 1420.858834 1493.378907
89 11648.0 1428.996384 1496.158752
90 11776.0 1429.987493 1501.891181
91 11904.0 1441.947990 1504.710219
92 12032.0 1420.270029 1508.437638
93 12160.0 1420.123557 1509.548090
94 12288.0 1434.772004 1390.825795
95 12416.0 1447.592587 1389.969032
96 12544.0 1443.655266 1391.978887
97 12672.0 1449.629836 1395.389754
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.295 seconds)
**Total running time of the script:** (0 minutes 23.233 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -573,33 +573,33 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
1 384.0 384.0 384.0 12.288000 12.288000
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 63.195428
5 896.0 896.0 896.0 78.051553 87.808000
6 1024.0 1024.0 1024.0 110.376426 99.864382
4 768.0 768.0 768.0 63.195428 68.056616
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 104.857603 104.857603
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 155.765024 132.970149
10 1536.0 1536.0 1536.0 176.947204 157.286398
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 183.651271 179.978245
12 1792.0 1792.0 1792.0 172.914215 204.353162
12 1792.0 1792.0 1792.0 172.914215 208.137481
13 1920.0 1920.0 1920.0 200.347822 166.554219
14 2048.0 2048.0 2048.0 226.719125 192.841562
15 2176.0 2176.0 2176.0 211.827867 211.827867
16 2304.0 2304.0 2304.0 231.921091 229.691080
17 2432.0 2432.0 2432.0 205.069087 205.069087
18 2560.0 2560.0 2560.0 224.438347 222.911566
19 2688.0 2688.0 2688.0 199.647657 200.704002
20 2816.0 2816.0 2816.0 212.752230 210.696652
21 2944.0 2944.0 2944.0 221.493479 223.479969
22 3072.0 3072.0 3072.0 208.941345 212.868821
23 3200.0 3200.0 3200.0 213.333323 220.689658
24 3328.0 3328.0 3328.0 209.277023 209.887165
25 3456.0 3456.0 3456.0 214.419058 220.880999
26 3584.0 3584.0 3584.0 215.624440 213.069643
27 3712.0 3712.0 3712.0 210.310194 217.168134
28 3840.0 3840.0 3840.0 209.851994 209.454544
29 3968.0 3968.0 3968.0 210.749463 217.511464
30 4096.0 4096.0 4096.0 219.668951 220.029067
15 2176.0 2176.0 2176.0 211.827867 209.621326
16 2304.0 2304.0 2304.0 229.691080 231.921091
17 2432.0 2432.0 2432.0 205.069087 202.118452
18 2560.0 2560.0 2560.0 224.438347 218.453323
19 2688.0 2688.0 2688.0 200.704002 198.602388
20 2816.0 2816.0 2816.0 212.752230 207.686706
21 2944.0 2944.0 2944.0 220.513412 222.482283
22 3072.0 3072.0 3072.0 210.494802 212.868821
23 3200.0 3200.0 3200.0 218.430042 219.178074
24 3328.0 3328.0 3328.0 209.887165 209.887165
25 3456.0 3456.0 3456.0 220.880999 218.486642
26 3584.0 3584.0 3584.0 218.772251 215.108588
27 3712.0 3712.0 3712.0 208.990259 214.833002
28 3840.0 3840.0 3840.0 210.250955 209.851994
29 3968.0 3968.0 3968.0 208.945088 217.899880
30 4096.0 4096.0 4096.0 220.029067 220.029067
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
Expand All @@ -610,7 +610,7 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
5 896.0 896.0 896.0 58.538665
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 99.902441
8 1280.0 1280.0 1280.0 102.400003
9 1408.0 1408.0 1408.0 81.369790
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
Expand All @@ -620,27 +620,27 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
15 2176.0 2176.0 2176.0 120.500882
16 2304.0 2304.0 2304.0 134.959733
17 2432.0 2432.0 2432.0 132.521057
18 2560.0 2560.0 2560.0 146.285712
19 2688.0 2688.0 2688.0 117.439807
18 2560.0 2560.0 2560.0 145.635558
19 2688.0 2688.0 2688.0 118.171514
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 139.988852
21 2944.0 2944.0 2944.0 138.819031
22 3072.0 3072.0 3072.0 144.079147
23 3200.0 3200.0 3200.0 139.737993
24 3328.0 3328.0 3328.0 131.852184
25 3456.0 3456.0 3456.0 139.725414
26 3584.0 3584.0 3584.0 148.620481
27 3712.0 3712.0 3712.0 141.698358
28 3840.0 3840.0 3840.0 137.895263
29 3968.0 3968.0 3968.0 147.016795
30 4096.0 4096.0 4096.0 154.985826
23 3200.0 3200.0 3200.0 139.433550
24 3328.0 3328.0 3328.0 130.893266
25 3456.0 3456.0 3456.0 138.287420
26 3584.0 3584.0 3584.0 149.113421
27 3712.0 3712.0 3712.0 142.303911
28 3840.0 3840.0 3840.0 137.723536
29 3968.0 3968.0 3968.0 147.194128
30 4096.0 4096.0 4096.0 154.807064
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 17.342 seconds)
**Total running time of the script:** (2 minutes 17.240 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -114,7 +114,7 @@ Let's first take a look at the baseline implementation.

.. code-block:: none
/home/runner/_work/triton/triton/python/triton/language/semantic.py:1502: UserWarning: tl.where with a non-boolean condition is deprecated and will error out in a future triton release. Got int32
/home/runner/_work/triton/triton/python/triton/language/semantic.py:1506: UserWarning: tl.where with a non-boolean condition is deprecated and will error out in a future triton release. Got int32
warnings.warn(
--------- ------- --------- -------- -------- -------- -------- -------- -------- --------- ---------
input 1.541 -0.293429 -2.17879 0.568431 -1.08452 -1.3986 0.403347 0.838026 -0.719258 -0.403344
Expand Down Expand Up @@ -244,7 +244,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.703 seconds)
**Total running time of the script:** (0 minutes 0.709 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit 58f6d9a

Please sign in to comment.