Skip to content

Commit

Permalink
[GH-PAGES] Updated website
Browse files Browse the repository at this point in the history
  • Loading branch information
gh-actions-bot authored and gh-actions-bot committed Sep 2, 2024
1 parent 78edcd0 commit c8e05a7
Show file tree
Hide file tree
Showing 61 changed files with 411 additions and 411 deletions.
Binary file modified main/.doctrees/environment.pickle
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/01-vector-add.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/05-layer-norm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/getting-started/tutorials/08-grouped-gemm.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/.doctrees/sg_execution_times.doctree
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file modified main/_images/sphx_glr_01-vector-add_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_01-vector-add_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_02-fused-softmax_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_03-matrix-multiplication_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_05-layer-norm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_003.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_06-fused-attention_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_001.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file modified main/_images/sphx_glr_08-grouped-gemm_thumb.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
8 changes: 4 additions & 4 deletions main/_sources/getting-started/tutorials/01-vector-add.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -230,17 +230,17 @@ We can now run the decorated function above. Pass `print_data=True` to see the p

vector-add-performance:
size Triton Torch
0 4096.0 8.000000 8.000000
0 4096.0 9.600000 9.600000
1 8192.0 15.999999 19.200000
2 16384.0 31.999999 31.999999
2 16384.0 31.999999 38.400001
3 32768.0 63.999998 63.999998
4 65536.0 127.999995 127.999995
5 131072.0 219.428568 219.428568
6 262144.0 384.000001 384.000001
7 524288.0 614.400016 614.400016
8 1048576.0 819.200021 819.200021
9 2097152.0 1023.999964 1023.999964
10 4194304.0 1228.800031 1228.800031
10 4194304.0 1260.307736 1260.307736
11 8388608.0 1424.695621 1424.695621
12 16777216.0 1560.380965 1560.380965
13 33554432.0 1624.859540 1624.859540
Expand All @@ -253,7 +253,7 @@ We can now run the decorated function above. Pass `print_data=True` to see the p

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 10.546 seconds)
**Total running time of the script:** (0 minutes 9.366 seconds)


.. _sphx_glr_download_getting-started_tutorials_01-vector-add.py:
Expand Down
198 changes: 99 additions & 99 deletions main/_sources/getting-started/tutorials/02-fused-softmax.rst.txt
Original file line number Diff line number Diff line change
Expand Up @@ -330,104 +330,104 @@ We will then compare its performance against (1) :code:`torch.softmax` and (2) t
softmax-performance:
N Triton Torch
0 256.0 475.119885 695.943816
1 384.0 599.888221 814.497591
2 512.0 742.053950 905.459280
3 640.0 794.116965 957.358723
4 768.0 879.416529 1023.560336
5 896.0 936.762923 1061.441449
6 1024.0 983.253647 1122.508738
7 1152.0 1108.239858 615.398673
8 1280.0 1153.352464 670.071145
9 1408.0 1167.015563 724.837203
10 1536.0 1195.945540 779.626741
11 1664.0 1220.537268 815.326220
12 1792.0 1241.950121 856.177836
13 1920.0 1249.585273 909.664327
14 2048.0 1273.230591 960.698648
15 2176.0 1268.466515 978.609505
16 2304.0 1263.531713 1007.182920
17 2432.0 1297.489790 1058.429065
18 2560.0 1306.562331 1083.032610
19 2688.0 1307.392364 1104.273396
20 2816.0 1322.647017 1131.158457
21 2944.0 1326.012351 1168.911267
22 3072.0 1343.380037 1185.706550
23 3200.0 1348.612079 1193.949893
24 3328.0 1354.787880 1225.840105
25 3456.0 1373.628768 1246.345744
26 3584.0 1378.841844 1258.454542
27 3712.0 1385.044034 1267.715478
28 3840.0 1384.708021 1303.540698
29 3968.0 1391.843593 1317.011436
30 4096.0 1403.647323 1325.326313
31 4224.0 1342.050572 1160.172094
32 4352.0 1337.013188 1171.954908
33 4480.0 1350.924941 1182.126433
34 4608.0 1363.041761 1192.366524
35 4736.0 1361.789798 1200.462100
36 4864.0 1374.719507 1221.040141
37 4992.0 1370.134282 1236.756270
38 5120.0 1371.452347 1254.721400
39 5248.0 1373.996815 1258.531440
40 5376.0 1381.959174 1287.710689
41 5504.0 1382.424216 1301.744382
42 5632.0 1383.669790 1313.890249
43 5760.0 1395.490268 1328.129387
44 5888.0 1393.078866 1341.729457
45 6016.0 1395.468420 1352.132780
46 6144.0 1404.943892 1375.912915
47 6272.0 1412.920018 1376.754185
48 6400.0 1414.621565 1386.229751
49 6528.0 1415.416810 1392.492551
50 6656.0 1419.798806 1402.930427
51 6784.0 1410.589964 1413.886230
52 6912.0 1422.346796 1422.023969
53 7040.0 1417.202669 1429.671257
54 7168.0 1426.162280 1433.066177
55 7296.0 1427.262154 1443.939225
56 7424.0 1428.358829 1448.288150
57 7552.0 1428.796417 1455.921475
58 7680.0 1434.462314 1463.005167
59 7808.0 1432.161610 1467.619209
60 7936.0 1441.399705 1469.825270
61 8064.0 1440.680326 1471.727818
62 8192.0 1438.266258 1484.059239
63 8320.0 1385.761167 1401.849118
64 8448.0 1380.087726 1401.760620
65 8576.0 1394.502172 1397.637315
66 8704.0 1388.602811 1398.065719
67 8832.0 1381.139636 1406.115697
68 8960.0 1397.823159 1412.850284
69 9088.0 1409.352940 1414.398267
70 9216.0 1402.877686 1418.335068
71 9344.0 1398.310289 1426.344682
72 9472.0 1403.407980 1432.377616
73 9600.0 1395.768695 1433.486509
74 9728.0 1398.973953 1438.753724
75 9856.0 1413.121056 1445.897151
76 9984.0 1404.157024 1452.065360
77 10112.0 1413.298932 1456.867818
78 10240.0 1421.081302 1467.577480
79 10368.0 1413.005624 1462.667132
80 10496.0 1419.822223 1465.329561
81 10624.0 1408.313193 1465.703894
82 10752.0 1409.411735 1470.409409
83 10880.0 1399.427289 1481.431927
84 11008.0 1420.316444 1476.508172
85 11136.0 1424.671045 1488.620813
86 11264.0 1427.649123 1486.835959
87 11392.0 1415.113487 1489.200069
88 11520.0 1425.896005 1493.262533
89 11648.0 1423.979897 1498.025510
90 11776.0 1429.213893 1500.226685
91 11904.0 1441.956250 1507.282425
92 12032.0 1421.475467 1506.596440
93 12160.0 1421.558899 1511.507914
94 12288.0 1433.876810 1393.287585
95 12416.0 1448.061992 1390.440324
96 12544.0 1443.524720 1390.573352
97 12672.0 1446.491371 1391.947883
0 256.0 475.850513 687.463360
1 384.0 616.598324 825.112804
2 512.0 752.152129 930.062178
3 640.0 797.584582 964.169118
4 768.0 872.367065 1029.463380
5 896.0 934.884339 1074.519017
6 1024.0 985.316544 1124.934713
7 1152.0 1100.938093 614.483101
8 1280.0 1144.145829 666.282666
9 1408.0 1166.470053 725.768367
10 1536.0 1187.488503 783.040887
11 1664.0 1217.560989 813.102953
12 1792.0 1239.594837 858.767981
13 1920.0 1255.130671 908.532628
14 2048.0 1278.701545 958.923718
15 2176.0 1254.419049 975.922840
16 2304.0 1265.354642 1008.972445
17 2432.0 1294.586034 1055.682987
18 2560.0 1308.759678 1087.911769
19 2688.0 1315.911953 1100.549637
20 2816.0 1321.422365 1134.170069
21 2944.0 1329.266873 1170.250995
22 3072.0 1347.321669 1183.111636
23 3200.0 1352.021891 1197.043150
24 3328.0 1355.746493 1228.090240
25 3456.0 1370.843429 1251.315455
26 3584.0 1379.689927 1257.778825
27 3712.0 1381.911262 1269.746611
28 3840.0 1382.284802 1301.266371
29 3968.0 1386.797272 1317.050896
30 4096.0 1400.924938 1327.234495
31 4224.0 1337.627442 1159.593694
32 4352.0 1339.284010 1177.002373
33 4480.0 1353.845012 1187.309380
34 4608.0 1361.356313 1191.749776
35 4736.0 1358.162950 1197.660073
36 4864.0 1381.446585 1222.915464
37 4992.0 1372.271561 1234.020894
38 5120.0 1378.461482 1248.665170
39 5248.0 1377.689456 1260.307868
40 5376.0 1374.849864 1283.708936
41 5504.0 1381.970726 1299.104494
42 5632.0 1388.620927 1313.794357
43 5760.0 1395.120302 1325.953945
44 5888.0 1387.343305 1345.302760
45 6016.0 1397.500649 1355.093919
46 6144.0 1404.645044 1373.124161
47 6272.0 1412.280252 1376.754872
48 6400.0 1419.217401 1389.266764
49 6528.0 1414.823095 1392.347225
50 6656.0 1422.273057 1405.216930
51 6784.0 1412.584923 1413.432776
52 6912.0 1422.823719 1422.035144
53 7040.0 1422.158871 1433.663183
54 7168.0 1426.703144 1436.355406
55 7296.0 1433.546795 1446.248913
56 7424.0 1428.803362 1444.594870
57 7552.0 1425.983852 1456.184970
58 7680.0 1435.346314 1461.436969
59 7808.0 1435.478493 1466.267894
60 7936.0 1438.223739 1469.039474
61 8064.0 1435.894832 1468.802294
62 8192.0 1438.607272 1482.975345
63 8320.0 1386.721060 1400.951414
64 8448.0 1374.077041 1404.988426
65 8576.0 1394.363804 1398.598262
66 8704.0 1387.006891 1397.755040
67 8832.0 1387.124078 1403.296882
68 8960.0 1402.363238 1410.031484
69 9088.0 1410.029811 1416.243861
70 9216.0 1401.633733 1426.089034
71 9344.0 1399.234216 1426.214196
72 9472.0 1400.425838 1435.131292
73 9600.0 1393.387227 1437.205521
74 9728.0 1399.954388 1440.983161
75 9856.0 1415.125637 1442.984461
76 9984.0 1402.043881 1449.643036
77 10112.0 1412.833069 1455.004550
78 10240.0 1417.773187 1465.911287
79 10368.0 1415.281766 1462.021080
80 10496.0 1415.052579 1465.689950
81 10624.0 1414.876713 1467.154076
82 10752.0 1407.639319 1473.342661
83 10880.0 1400.496469 1479.757848
84 11008.0 1417.468412 1477.633613
85 11136.0 1423.261095 1483.761393
86 11264.0 1430.326091 1485.956573
87 11392.0 1418.367390 1488.453275
88 11520.0 1422.428604 1493.777990
89 11648.0 1427.661323 1498.169731
90 11776.0 1428.903804 1500.873947
91 11904.0 1443.563467 1506.566059
92 12032.0 1420.868906 1505.421205
93 12160.0 1421.112554 1511.968572
94 12288.0 1436.313305 1392.213390
95 12416.0 1446.708961 1389.593389
96 12544.0 1438.186113 1394.755195
97 12672.0 1445.136445 1392.092798
Expand All @@ -442,7 +442,7 @@ In the above plot, we can see that:

.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 23.226 seconds)
**Total running time of the script:** (0 minutes 23.286 seconds)


.. _sphx_glr_download_getting-started_tutorials_02-fused-softmax.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -574,32 +574,32 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
2 512.0 512.0 512.0 26.214401 26.214401
3 640.0 640.0 640.0 42.666665 42.666665
4 768.0 768.0 768.0 63.195428 63.195428
5 896.0 896.0 896.0 78.051553 93.661869
6 1024.0 1024.0 1024.0 104.857603 99.864382
5 896.0 896.0 896.0 78.051553 87.808000
6 1024.0 1024.0 1024.0 104.857603 104.857603
7 1152.0 1152.0 1152.0 135.726544 129.825388
8 1280.0 1280.0 1280.0 157.538463 163.840004
9 1408.0 1408.0 1408.0 151.438217 132.970149
9 1408.0 1408.0 1408.0 155.765024 132.970149
10 1536.0 1536.0 1536.0 176.947204 153.867127
11 1664.0 1664.0 1664.0 179.978245 179.978245
12 1792.0 1792.0 1792.0 170.294302 204.353162
13 1920.0 1920.0 1920.0 200.347822 168.585369
14 2048.0 2048.0 2048.0 223.696203 190.650180
15 2176.0 2176.0 2176.0 211.827867 209.621326
16 2304.0 2304.0 2304.0 229.691080 227.503545
17 2432.0 2432.0 2432.0 205.069087 200.674737
18 2560.0 2560.0 2560.0 218.453323 212.779229
19 2688.0 2688.0 2688.0 195.531224 198.602388
20 2816.0 2816.0 2816.0 213.795141 212.752230
11 1664.0 1664.0 1664.0 183.651271 179.978245
12 1792.0 1792.0 1792.0 172.914215 208.137481
13 1920.0 1920.0 1920.0 197.485709 166.554219
14 2048.0 2048.0 2048.0 223.696203 192.841562
15 2176.0 2176.0 2176.0 209.621326 209.621326
16 2304.0 2304.0 2304.0 229.691080 229.691080
17 2432.0 2432.0 2432.0 206.576938 202.118452
18 2560.0 2560.0 2560.0 221.405396 219.919464
19 2688.0 2688.0 2688.0 197.567993 198.602388
20 2816.0 2816.0 2816.0 208.680416 210.696652
21 2944.0 2944.0 2944.0 220.513412 220.513412
22 3072.0 3072.0 3072.0 208.173173 212.868821
23 3200.0 3200.0 3200.0 217.687077 218.430042
24 3328.0 3328.0 3328.0 208.067338 209.887165
25 3456.0 3456.0 3456.0 216.724640 216.433749
26 3584.0 3584.0 3584.0 216.663602 214.595213
27 3712.0 3712.0 3712.0 209.868376 214.602246
28 3840.0 3840.0 3840.0 211.456969 209.454544
29 3968.0 3968.0 3968.0 209.663117 214.830867
30 4096.0 4096.0 4096.0 219.668951 219.668951
22 3072.0 3072.0 3072.0 206.653671 213.672083
23 3200.0 3200.0 3200.0 213.333323 216.216207
24 3328.0 3328.0 3328.0 208.067338 208.670419
25 3456.0 3456.0 3456.0 214.419058 219.080343
26 3584.0 3584.0 3584.0 216.142772 215.624440
27 3712.0 3712.0 3712.0 208.990259 211.199462
28 3840.0 3840.0 3840.0 207.879708 209.454544
29 3968.0 3968.0 3968.0 212.215536 214.830867
30 4096.0 4096.0 4096.0 221.481394 217.532790
matmul-performance-fp8:
M N K Triton
0 256.0 256.0 256.0 3.276800
Expand All @@ -610,28 +610,28 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
5 896.0 896.0 896.0 58.538665
6 1024.0 1024.0 1024.0 61.680940
7 1152.0 1152.0 1152.0 80.702267
8 1280.0 1280.0 1280.0 102.400003
9 1408.0 1408.0 1408.0 82.602666
8 1280.0 1280.0 1280.0 99.902441
9 1408.0 1408.0 1408.0 81.369790
10 1536.0 1536.0 1536.0 98.303997
11 1664.0 1664.0 1664.0 115.370671
12 1792.0 1792.0 1792.0 133.802668
13 1920.0 1920.0 1920.0 99.453240
13 1920.0 1920.0 1920.0 100.173911
14 2048.0 2048.0 2048.0 114.130722
15 2176.0 2176.0 2176.0 120.500882
16 2304.0 2304.0 2304.0 133.451803
16 2304.0 2304.0 2304.0 134.201527
17 2432.0 2432.0 2432.0 131.898888
18 2560.0 2560.0 2560.0 146.285712
19 2688.0 2688.0 2688.0 117.077336
20 2816.0 2816.0 2816.0 128.277083
19 2688.0 2688.0 2688.0 117.439807
20 2816.0 2816.0 2816.0 128.655484
21 2944.0 2944.0 2944.0 138.819031
22 3072.0 3072.0 3072.0 143.713461
23 3200.0 3200.0 3200.0 139.433550
23 3200.0 3200.0 3200.0 138.828637
24 3328.0 3328.0 3328.0 130.419012
25 3456.0 3456.0 3456.0 138.287420
26 3584.0 3584.0 3584.0 146.920574
26 3584.0 3584.0 3584.0 148.620481
27 3712.0 3712.0 3712.0 140.700486
28 3840.0 3840.0 3840.0 137.895263
29 3968.0 3968.0 3968.0 145.961642
28 3840.0 3840.0 3840.0 137.723536
29 3968.0 3968.0 3968.0 145.439735
30 4096.0 4096.0 4096.0 155.524599
Expand All @@ -640,7 +640,7 @@ but feel free to arrange this script as you wish to benchmark any other matrix s
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (2 minutes 17.857 seconds)
**Total running time of the script:** (2 minutes 17.535 seconds)


.. _sphx_glr_download_getting-started_tutorials_03-matrix-multiplication.py:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -242,7 +242,7 @@ References
.. rst-class:: sphx-glr-timing

**Total running time of the script:** (0 minutes 0.709 seconds)
**Total running time of the script:** (0 minutes 0.699 seconds)


.. _sphx_glr_download_getting-started_tutorials_04-low-memory-dropout.py:
Expand Down
Loading

0 comments on commit c8e05a7

Please sign in to comment.