-
Notifications
You must be signed in to change notification settings - Fork 12
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix primitive_type for complex (#193)
* fix primitive_type for complex * add simple complex runtime test
- Loading branch information
Showing
2 changed files
with
9 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
35a51ad
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reactant.jl Benchmarks
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1233718266
ns1236544173
ns1.00
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1272290171
ns1242881142
ns1.02
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1414092991
ns1235658910
ns1.14
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2507794592
ns2350423218
ns1.07
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Lux
218244448
ns217352004
ns1.00
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
7057019589
ns6794443727
ns1.04
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant
5136395044
ns5221674868
ns0.98
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
5202106378
ns5093314901
ns1.02
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
6826278112
ns6885416164
ns0.99
ViT base (256 x 256 x 3 x 32)/forward/CPU/Lux
33666785970
ns33707808014
ns1.00
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1308209141
ns1266808372
ns1.03
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1354474554.5
ns1273495665.5
ns1.06
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1312596491
ns1333702124.5
ns0.98
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2573078535
ns2512672037
ns1.02
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Lux
8770241
ns8492497
ns1.03
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1571674207.5
ns1623142223
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant
1540710272.5
ns1600931077
ns0.96
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1548078395
ns1598434969
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2744115264
ns2912977764
ns0.94
ViT small (256 x 256 x 3 x 4)/forward/CPU/Lux
2213143753
ns2178291674
ns1.02
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1270044962
ns1191252514
ns1.07
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1234436938
ns1267623530.5
ns0.97
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1294952462.5
ns1254309112.5
ns1.03
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2715588750
ns2522324425
ns1.08
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Lux
22668152
ns21024085
ns1.08
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
2134055837
ns2219640487
ns0.96
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant
2138464342
ns2218982964
ns0.96
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2124865845
ns2252600530
ns0.94
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
3323398587
ns3553615972
ns0.94
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Lux
6167302164
ns5451831944
ns1.13
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1255998954
ns1219179894
ns1.03
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1314721136.5
ns1266307789
ns1.04
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1279651269
ns1292867362.5
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2554634529
ns2395083528
ns1.07
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Lux
7369398
ns7182253.5
ns1.03
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1402852118
ns1478653783
ns0.95
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant
1403963317
ns1473188659
ns0.95
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1400054223.5
ns1458846498
ns0.96
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
2604456962
ns2745979768
ns0.95
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Lux
1231519358
ns1076729677.5
ns1.14
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1307469859.5
ns1307047788
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1271107436.5
ns1277025925
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1331859396
ns1226039283.5
ns1.09
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2683313614
ns2531008275
ns1.06
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Lux
12276652
ns11116198
ns1.10
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
1688567666
ns1763174402
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant
1688900936
ns1762263171
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
1687635701
ns1755205670
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
2910962217
ns3044952264
ns0.96
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Lux
2991919710
ns3217666637.5
ns0.93
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1290428008
ns1396484090
ns0.92
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1278577051
ns1263883093.5
ns1.01
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1247880063.5
ns1266345521
ns0.99
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2595166528
ns2519876095
ns1.03
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Lux
27317118
ns25484298.5
ns1.07
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
2166869626
ns2255146771
ns0.96
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant
2153166296
ns2293550587
ns0.94
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
2169198327
ns2308771124
ns0.94
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
3389215278
ns3541388172
ns0.96
ViT small (256 x 256 x 3 x 16)/forward/CPU/Lux
7315062267
ns8606549877
ns0.85
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1303537244
ns1246919786
ns1.05
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1357078784
ns1246468994
ns1.09
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1310042021
ns1242420953
ns1.05
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
2603025150
ns2496367360
ns1.04
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Lux
52794954.5
ns50142997
ns1.05
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
3030272839
ns3110834290
ns0.97
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant
2959738629
ns3087052147
ns0.96
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2929830073
ns3073238853
ns0.95
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
4299042483
ns4465557597
ns0.96
ViT small (256 x 256 x 3 x 32)/forward/CPU/Lux
9053699739
ns12448453806
ns0.73
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1303409186
ns1207653927
ns1.08
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1299005328
ns1252669878.5
ns1.04
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1327270461
ns1284419122
ns1.03
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
2424433409
ns2435990194
ns1.00
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Lux
71019572
ns67827398
ns1.05
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
3188826413
ns3295855546
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant
3252391187
ns3257027916
ns1.00
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
3190391621
ns3317925170
ns0.96
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
4502893678
ns4627513897
ns0.97
ViT base (256 x 256 x 3 x 16)/forward/CPU/Lux
13177363170
ns13704380374
ns0.96
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1264446460
ns1250112499
ns1.01
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1275969531
ns1225627385
ns1.04
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1236289619.5
ns1253439196.5
ns0.99
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
2377926514
ns2466672353
ns0.96
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Lux
20589563.5
ns19404003
ns1.06
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1840153244
ns1913014192
ns0.96
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant
1822459856
ns1918443489
ns0.95
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1811923850
ns1919983927
ns0.94
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3057262107
ns3210069008
ns0.95
ViT base (256 x 256 x 3 x 4)/forward/CPU/Lux
5829780775
ns3218575500.5
ns1.81
This comment was automatically generated by workflow using github-action-benchmark.