Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Minor] Fix tensorclass prints #1218

Merged
merged 1 commit into from
Feb 13, 2025
Merged

[Minor] Fix tensorclass prints #1218

merged 1 commit into from
Feb 13, 2025

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 13, 2025

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Feb 13, 2025
ghstack-source-id: 46d70d783e59f86f1720cd6bede12049de57d095
Pull Request resolved: #1218
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 13, 2025
@vmoens vmoens merged commit 619a6df into gh/vmoens/47/base Feb 13, 2025
32 of 35 checks passed
vmoens added a commit that referenced this pull request Feb 13, 2025
ghstack-source-id: 46d70d783e59f86f1720cd6bede12049de57d095
Pull Request resolved: #1218
@vmoens vmoens deleted the gh/vmoens/47/head branch February 13, 2025 16:13
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}16$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 43.5110μs 20.7399μs 48.2162 KOps/s 48.9514 KOps/s $\color{#d91a1a}-1.50\%$
test_plain_set_stack_nested 59.4810μs 20.9319μs 47.7740 KOps/s 47.8534 KOps/s $\color{#d91a1a}-0.17\%$
test_plain_set_nested_inplace 87.8640μs 22.5249μs 44.3953 KOps/s 44.1562 KOps/s $\color{#35bf28}+0.54\%$
test_plain_set_stack_nested_inplace 77.0340μs 22.5800μs 44.2870 KOps/s 43.9668 KOps/s $\color{#35bf28}+0.73\%$
test_items 31.8200μs 4.2630μs 234.5766 KOps/s 239.4746 KOps/s $\color{#d91a1a}-2.05\%$
test_items_nested 0.7978ms 0.4078ms 2.4523 KOps/s 2.3994 KOps/s $\color{#35bf28}+2.20\%$
test_items_nested_locked 0.5514ms 0.4097ms 2.4409 KOps/s 2.3998 KOps/s $\color{#35bf28}+1.71\%$
test_items_nested_leaf 0.1429ms 78.4301μs 12.7502 KOps/s 12.8816 KOps/s $\color{#d91a1a}-1.02\%$
test_items_stack_nested 0.7587ms 0.4112ms 2.4320 KOps/s 2.3812 KOps/s $\color{#35bf28}+2.13\%$
test_items_stack_nested_leaf 0.1544ms 81.6204μs 12.2518 KOps/s 12.3959 KOps/s $\color{#d91a1a}-1.16\%$
test_items_stack_nested_locked 0.5279ms 0.4077ms 2.4528 KOps/s 2.3743 KOps/s $\color{#35bf28}+3.30\%$
test_keys 27.6210μs 3.4569μs 289.2729 KOps/s 285.7785 KOps/s $\color{#35bf28}+1.22\%$
test_keys_nested 0.2548ms 0.1647ms 6.0708 KOps/s 6.1111 KOps/s $\color{#d91a1a}-0.66\%$
test_keys_nested_locked 1.8167ms 0.1714ms 5.8336 KOps/s 5.8612 KOps/s $\color{#d91a1a}-0.47\%$
test_keys_nested_leaf 0.2536ms 0.1436ms 6.9661 KOps/s 7.0251 KOps/s $\color{#d91a1a}-0.84\%$
test_keys_stack_nested 0.2722ms 0.1635ms 6.1180 KOps/s 6.1937 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_stack_nested_leaf 0.2771ms 0.1431ms 6.9865 KOps/s 7.3016 KOps/s $\color{#d91a1a}-4.32\%$
test_keys_stack_nested_locked 0.2544ms 0.1704ms 5.8676 KOps/s 5.9778 KOps/s $\color{#d91a1a}-1.84\%$
test_values 10.3854μs 1.0386μs 962.8022 KOps/s 966.4115 KOps/s $\color{#d91a1a}-0.37\%$
test_values_nested 0.1068ms 62.9791μs 15.8783 KOps/s 15.9531 KOps/s $\color{#d91a1a}-0.47\%$
test_values_nested_locked 0.1415ms 62.6013μs 15.9741 KOps/s 15.7403 KOps/s $\color{#35bf28}+1.49\%$
test_values_nested_leaf 0.1435ms 72.4662μs 13.7995 KOps/s 14.2854 KOps/s $\color{#d91a1a}-3.40\%$
test_values_stack_nested 0.1296ms 63.8690μs 15.6570 KOps/s 15.0596 KOps/s $\color{#35bf28}+3.97\%$
test_values_stack_nested_leaf 0.1368ms 73.1779μs 13.6653 KOps/s 14.4006 KOps/s $\textbf{\color{#d91a1a}-5.11\%}$
test_values_stack_nested_locked 0.1190ms 63.7664μs 15.6822 KOps/s 15.5213 KOps/s $\color{#35bf28}+1.04\%$
test_membership 16.9810μs 0.8609μs 1.1616 MOps/s 1.1526 MOps/s $\color{#35bf28}+0.78\%$
test_membership_nested 27.8920μs 2.9564μs 338.2542 KOps/s 338.7347 KOps/s $\color{#d91a1a}-0.14\%$
test_membership_nested_leaf 36.7190μs 2.9845μs 335.0597 KOps/s 325.2229 KOps/s $\color{#35bf28}+3.02\%$
test_membership_stacked_nested 43.2000μs 2.9345μs 340.7764 KOps/s 341.0266 KOps/s $\color{#d91a1a}-0.07\%$
test_membership_stacked_nested_leaf 48.8210μs 2.9714μs 336.5423 KOps/s 342.4397 KOps/s $\color{#d91a1a}-1.72\%$
test_membership_nested_last 33.0020μs 4.4457μs 224.9370 KOps/s 229.5586 KOps/s $\color{#d91a1a}-2.01\%$
test_membership_nested_leaf_last 76.8330μs 4.4324μs 225.6093 KOps/s 227.7016 KOps/s $\color{#d91a1a}-0.92\%$
test_membership_stacked_nested_last 42.0190μs 4.3848μs 228.0598 KOps/s 72.2829 KOps/s $\textbf{\color{#35bf28}+215.51\%}$
test_membership_stacked_nested_leaf_last 24.9660μs 4.4290μs 225.7863 KOps/s 73.0315 KOps/s $\textbf{\color{#35bf28}+209.16\%}$
test_nested_getleaf 62.7870μs 10.7841μs 92.7294 KOps/s 92.3981 KOps/s $\color{#35bf28}+0.36\%$
test_nested_get 58.0070μs 10.0557μs 99.4464 KOps/s 97.1912 KOps/s $\color{#35bf28}+2.32\%$
test_stacked_getleaf 69.8800μs 10.7266μs 93.2263 KOps/s 93.9040 KOps/s $\color{#d91a1a}-0.72\%$
test_stacked_get 42.5890μs 10.1319μs 98.6983 KOps/s 98.0913 KOps/s $\color{#35bf28}+0.62\%$
test_nested_getitemleaf 60.8730μs 11.2752μs 88.6903 KOps/s 87.5402 KOps/s $\color{#35bf28}+1.31\%$
test_nested_getitem 49.9630μs 10.8365μs 92.2810 KOps/s 92.2289 KOps/s $\color{#35bf28}+0.06\%$
test_stacked_getitemleaf 72.6860μs 11.3190μs 88.3470 KOps/s 88.6223 KOps/s $\color{#d91a1a}-0.31\%$
test_stacked_getitem 43.6720μs 10.7911μs 92.6693 KOps/s 92.5059 KOps/s $\color{#35bf28}+0.18\%$
test_lock_nested 0.7269ms 0.4222ms 2.3684 KOps/s 2.4193 KOps/s $\color{#d91a1a}-2.10\%$
test_lock_stack_nested 0.7129ms 0.4310ms 2.3201 KOps/s 2.3929 KOps/s $\color{#d91a1a}-3.04\%$
test_unlock_nested 0.8056ms 0.3437ms 2.9094 KOps/s 2.9070 KOps/s $\color{#35bf28}+0.08\%$
test_unlock_stack_nested 0.7072ms 0.3489ms 2.8664 KOps/s 2.9663 KOps/s $\color{#d91a1a}-3.37\%$
test_flatten_speed 0.2002ms 0.1039ms 9.6232 KOps/s 10.0275 KOps/s $\color{#d91a1a}-4.03\%$
test_unflatten_speed 0.9199ms 0.5332ms 1.8753 KOps/s 1.8916 KOps/s $\color{#d91a1a}-0.86\%$
test_common_ops 1.0315ms 0.8207ms 1.2185 KOps/s 1.2132 KOps/s $\color{#35bf28}+0.43\%$
test_creation 64.2900μs 2.5457μs 392.8132 KOps/s 400.2539 KOps/s $\color{#d91a1a}-1.86\%$
test_creation_empty 43.2710μs 12.6347μs 79.1468 KOps/s 85.4863 KOps/s $\textbf{\color{#d91a1a}-7.42\%}$
test_creation_nested_1 74.7290μs 16.0582μs 62.2736 KOps/s 64.6348 KOps/s $\color{#d91a1a}-3.65\%$
test_creation_nested_2 60.3530μs 20.4380μs 48.9284 KOps/s 50.9887 KOps/s $\color{#d91a1a}-4.04\%$
test_clone 63.2480μs 13.5011μs 74.0683 KOps/s 71.2157 KOps/s $\color{#35bf28}+4.01\%$
test_getitem[int] 0.8374ms 13.0131μs 76.8459 KOps/s 75.1348 KOps/s $\color{#35bf28}+2.28\%$
test_getitem[slice_int] 0.1493ms 25.8461μs 38.6906 KOps/s 38.5115 KOps/s $\color{#35bf28}+0.47\%$
test_getitem[range] 0.1703ms 51.2764μs 19.5021 KOps/s 19.2426 KOps/s $\color{#35bf28}+1.35\%$
test_getitem[tuple] 0.1343ms 20.3660μs 49.1014 KOps/s 46.4289 KOps/s $\textbf{\color{#35bf28}+5.76\%}$
test_getitem[list] 0.2784ms 46.9084μs 21.3182 KOps/s 21.3610 KOps/s $\color{#d91a1a}-0.20\%$
test_setitem_dim[int] 74.1880μs 26.8860μs 37.1940 KOps/s 38.6624 KOps/s $\color{#d91a1a}-3.80\%$
test_setitem_dim[slice_int] 94.1550μs 53.0454μs 18.8518 KOps/s 18.6889 KOps/s $\color{#35bf28}+0.87\%$
test_setitem_dim[range] 0.1431ms 78.6381μs 12.7165 KOps/s 13.0449 KOps/s $\color{#d91a1a}-2.52\%$
test_setitem_dim[tuple] 0.1078ms 42.7052μs 23.4163 KOps/s 23.9867 KOps/s $\color{#d91a1a}-2.38\%$
test_setitem 0.1176ms 20.8852μs 47.8809 KOps/s 47.0690 KOps/s $\color{#35bf28}+1.72\%$
test_set 0.1304ms 20.5303μs 48.7086 KOps/s 48.5491 KOps/s $\color{#35bf28}+0.33\%$
test_set_shared 4.4666ms 0.1866ms 5.3598 KOps/s 5.3083 KOps/s $\color{#35bf28}+0.97\%$
test_update 0.1610ms 23.3650μs 42.7990 KOps/s 42.0150 KOps/s $\color{#35bf28}+1.87\%$
test_update_nested 0.2344ms 35.0615μs 28.5213 KOps/s 27.9604 KOps/s $\color{#35bf28}+2.01\%$
test_update__nested 0.4393ms 33.2671μs 30.0597 KOps/s 28.3968 KOps/s $\textbf{\color{#35bf28}+5.86\%}$
test_set_nested 96.5300μs 23.2759μs 42.9629 KOps/s 42.3727 KOps/s $\color{#35bf28}+1.39\%$
test_set_nested_new 0.1369ms 28.3971μs 35.2149 KOps/s 34.5627 KOps/s $\color{#35bf28}+1.89\%$
test_select 0.1656ms 45.6074μs 21.9263 KOps/s 21.9395 KOps/s $\color{#d91a1a}-0.06\%$
test_select_nested 0.1449ms 62.7544μs 15.9351 KOps/s 16.0393 KOps/s $\color{#d91a1a}-0.65\%$
test_exclude_nested 0.1569ms 81.3445μs 12.2934 KOps/s 12.3360 KOps/s $\color{#d91a1a}-0.35\%$
test_empty[True] 0.7398ms 0.4133ms 2.4193 KOps/s 2.4220 KOps/s $\color{#d91a1a}-0.11\%$
test_empty[False] 8.9040μs 1.3883μs 720.2804 KOps/s 725.4287 KOps/s $\color{#d91a1a}-0.71\%$
test_unbind_speed 0.4711ms 0.2727ms 3.6674 KOps/s 3.6534 KOps/s $\color{#35bf28}+0.38\%$
test_unbind_speed_stack0 0.3981ms 0.2685ms 3.7242 KOps/s 3.8042 KOps/s $\color{#d91a1a}-2.10\%$
test_unbind_speed_stack1 0.1178s 0.7467ms 1.3391 KOps/s 1.3454 KOps/s $\color{#d91a1a}-0.47\%$
test_split 0.1183s 1.7701ms 564.9312 Ops/s 489.5105 Ops/s $\textbf{\color{#35bf28}+15.41\%}$
test_chunk 0.1253s 1.7876ms 559.3960 Ops/s 616.4269 Ops/s $\textbf{\color{#d91a1a}-9.25\%}$
test_consolidate_njt[False-None] 9.4343ms 8.4521ms 118.3135 Ops/s 116.9115 Ops/s $\color{#35bf28}+1.20\%$
test_creation[device0] 0.3111ms 93.0325μs 10.7489 KOps/s 10.4137 KOps/s $\color{#35bf28}+3.22\%$
test_creation_from_tensor 4.1990ms 98.9834μs 10.1027 KOps/s 10.0819 KOps/s $\color{#35bf28}+0.21\%$
test_add_one[memmap_tensor0] 0.1060ms 5.0214μs 199.1462 KOps/s 184.9328 KOps/s $\textbf{\color{#35bf28}+7.69\%}$
test_contiguous[memmap_tensor0] 9.9380μs 0.5024μs 1.9906 MOps/s 1.9748 MOps/s $\color{#35bf28}+0.80\%$
test_stack[memmap_tensor0] 33.8130μs 3.3618μs 297.4560 KOps/s 271.7475 KOps/s $\textbf{\color{#35bf28}+9.46\%}$
test_memmaptd_index 1.3842ms 0.2279ms 4.3875 KOps/s 4.1492 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_memmaptd_index_astensor 0.6518ms 0.3161ms 3.1640 KOps/s 3.0426 KOps/s $\color{#35bf28}+3.99\%$
test_memmaptd_index_op 1.0753ms 0.5963ms 1.6769 KOps/s 1.6290 KOps/s $\color{#35bf28}+2.94\%$
test_serialize_model 0.1311s 0.1198s 8.3440 Ops/s 7.2198 Ops/s $\textbf{\color{#35bf28}+15.57\%}$
test_serialize_model_pickle 0.4980s 0.4052s 2.4677 Ops/s 2.4949 Ops/s $\color{#d91a1a}-1.09\%$
test_serialize_weights 0.1303s 0.1184s 8.4456 Ops/s 8.2239 Ops/s $\color{#35bf28}+2.70\%$
test_serialize_weights_returnearly 0.1908s 0.1641s 6.0957 Ops/s 6.1330 Ops/s $\color{#d91a1a}-0.61\%$
test_serialize_weights_pickle 0.6119s 0.4464s 2.2402 Ops/s 2.5591 Ops/s $\textbf{\color{#d91a1a}-12.46\%}$
test_serialize_weights_filesystem 0.1538s 0.1437s 6.9589 Ops/s 6.8784 Ops/s $\color{#35bf28}+1.17\%$
test_serialize_model_filesystem 0.2668s 0.1702s 5.8759 Ops/s 5.5584 Ops/s $\textbf{\color{#35bf28}+5.71\%}$
test_reshape_pytree 75.9310μs 26.1780μs 38.1999 KOps/s 36.5366 KOps/s $\color{#35bf28}+4.55\%$
test_reshape_td 0.1020ms 33.6322μs 29.7334 KOps/s 29.5465 KOps/s $\color{#35bf28}+0.63\%$
test_view_pytree 0.1227ms 27.1262μs 36.8647 KOps/s 37.8423 KOps/s $\color{#d91a1a}-2.58\%$
test_view_td 88.6250μs 41.3596μs 24.1782 KOps/s 24.3777 KOps/s $\color{#d91a1a}-0.82\%$
test_unbind_pytree 66.9650μs 29.7053μs 33.6640 KOps/s 33.3774 KOps/s $\color{#35bf28}+0.86\%$
test_unbind_td 0.3394ms 40.0043μs 24.9973 KOps/s 24.2832 KOps/s $\color{#35bf28}+2.94\%$
test_split_pytree 71.5930μs 28.9826μs 34.5034 KOps/s 33.4949 KOps/s $\color{#35bf28}+3.01\%$
test_split_td 0.5808ms 45.4965μs 21.9797 KOps/s 21.3031 KOps/s $\color{#35bf28}+3.18\%$
test_add_pytree 83.1950μs 35.8342μs 27.9063 KOps/s 27.0730 KOps/s $\color{#35bf28}+3.08\%$
test_add_td 0.1198ms 58.0068μs 17.2394 KOps/s 16.4776 KOps/s $\color{#35bf28}+4.62\%$
test_compile_add_one_nested[tensordict-compile] 0.1699ms 68.0617μs 14.6926 KOps/s 14.8298 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_add_one_nested[tensordict-eager] 0.3803ms 0.1731ms 5.7757 KOps/s 5.7132 KOps/s $\color{#35bf28}+1.09\%$
test_compile_add_one_nested[pytree-compile] 0.1830ms 46.4693μs 21.5196 KOps/s 21.7823 KOps/s $\color{#d91a1a}-1.21\%$
test_compile_add_one_nested[pytree-eager] 0.2492ms 0.1189ms 8.4072 KOps/s 8.2079 KOps/s $\color{#35bf28}+2.43\%$
test_compile_copy_nested[tensordict-compile] 81.5520μs 28.8713μs 34.6364 KOps/s 35.1849 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_copy_nested[tensordict-eager] 0.1150ms 59.1438μs 16.9080 KOps/s 16.8559 KOps/s $\color{#35bf28}+0.31\%$
test_compile_copy_nested[pytree-compile] 0.1570ms 81.2127μs 12.3133 KOps/s 12.3810 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_copy_nested[pytree-eager] 0.1296ms 67.1601μs 14.8898 KOps/s 14.6959 KOps/s $\color{#35bf28}+1.32\%$
test_compile_add_one_flat[tensordict-compile] 0.2004ms 0.1079ms 9.2711 KOps/s 9.3711 KOps/s $\color{#d91a1a}-1.07\%$
test_compile_add_one_flat[tensordict-eager] 0.3961ms 0.2168ms 4.6117 KOps/s 4.6281 KOps/s $\color{#d91a1a}-0.36\%$
test_compile_add_one_flat[tensorclass-compile] 0.1683ms 47.3958μs 21.0989 KOps/s 20.6634 KOps/s $\color{#35bf28}+2.11\%$
test_compile_add_one_flat[tensorclass-eager] 0.1557ms 67.5114μs 14.8123 KOps/s 14.4481 KOps/s $\color{#35bf28}+2.52\%$
test_compile_add_one_flat[pytree-compile] 0.2208ms 0.1010ms 9.9030 KOps/s 9.8635 KOps/s $\color{#35bf28}+0.40\%$
test_compile_add_one_flat[pytree-eager] 0.3418ms 0.2020ms 4.9517 KOps/s 4.8131 KOps/s $\color{#35bf28}+2.88\%$
test_compile_add_self_flat[tensordict-eager] 0.3983ms 0.2340ms 4.2741 KOps/s 4.2209 KOps/s $\color{#35bf28}+1.26\%$
test_compile_add_self_flat[tensordict-compile] 0.2806ms 0.1093ms 9.1473 KOps/s 9.1866 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_add_self_flat[tensorclass-eager] 0.2784ms 64.3523μs 15.5394 KOps/s 15.5621 KOps/s $\color{#d91a1a}-0.15\%$
test_compile_add_self_flat[tensorclass-compile] 0.2770ms 50.2747μs 19.8907 KOps/s 20.6403 KOps/s $\color{#d91a1a}-3.63\%$
test_compile_add_self_flat[pytree-eager] 0.3484ms 0.1584ms 6.3147 KOps/s 6.1279 KOps/s $\color{#35bf28}+3.05\%$
test_compile_add_self_flat[pytree-compile] 0.2218ms 0.1011ms 9.8936 KOps/s 9.7939 KOps/s $\color{#35bf28}+1.02\%$
test_compile_copy_flat[tensordict-compile] 68.7790μs 23.5543μs 42.4552 KOps/s 45.3385 KOps/s $\textbf{\color{#d91a1a}-6.36\%}$
test_compile_copy_flat[tensordict-eager] 0.1375ms 67.0927μs 14.9047 KOps/s 14.6740 KOps/s $\color{#35bf28}+1.57\%$
test_compile_copy_flat[pytree-compile] 0.1654ms 83.4595μs 11.9819 KOps/s 12.0869 KOps/s $\color{#d91a1a}-0.87\%$
test_compile_copy_flat[pytree-eager] 0.1490ms 68.9636μs 14.5004 KOps/s 14.6125 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_assign_and_add[tensordict-compile] 0.4711ms 0.2186ms 4.5739 KOps/s 4.5572 KOps/s $\color{#35bf28}+0.37\%$
test_compile_assign_and_add[tensordict-eager] 1.6710ms 1.4056ms 711.4640 Ops/s 710.7093 Ops/s $\color{#35bf28}+0.11\%$
test_compile_assign_and_add[pytree-compile] 0.3680ms 0.2122ms 4.7117 KOps/s 4.7065 KOps/s $\color{#35bf28}+0.11\%$
test_compile_assign_and_add[pytree-eager] 1.4963ms 0.8348ms 1.1978 KOps/s 1.1750 KOps/s $\color{#35bf28}+1.94\%$
test_compile_assign_and_add_stack[compile] 0.7221ms 0.4668ms 2.1422 KOps/s 2.1520 KOps/s $\color{#d91a1a}-0.45\%$
test_compile_assign_and_add_stack[eager] 3.1537ms 2.7210ms 367.5088 Ops/s 355.9822 Ops/s $\color{#35bf28}+3.24\%$
test_compile_indexing[tensor-tensordict-compile] 0.1178ms 38.5462μs 25.9429 KOps/s 25.2474 KOps/s $\color{#35bf28}+2.75\%$
test_compile_indexing[tensor-tensordict-eager] 0.8013ms 33.6950μs 29.6780 KOps/s 28.5986 KOps/s $\color{#35bf28}+3.77\%$
test_compile_indexing[tensor-tensorclass-compile] 93.4640μs 31.8151μs 31.4316 KOps/s 31.6685 KOps/s $\color{#d91a1a}-0.75\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1018ms 24.1455μs 41.4156 KOps/s 41.5732 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[tensor-pytree-compile] 96.4710μs 33.2629μs 30.0635 KOps/s 32.2324 KOps/s $\textbf{\color{#d91a1a}-6.73\%}$
test_compile_indexing[tensor-pytree-eager] 0.2258ms 23.7637μs 42.0811 KOps/s 40.9733 KOps/s $\color{#35bf28}+2.70\%$
test_compile_indexing[slice-tensordict-compile] 0.1495ms 55.0439μs 18.1673 KOps/s 18.4546 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_indexing[slice-tensordict-eager] 0.3660ms 20.3234μs 49.2042 KOps/s 48.7622 KOps/s $\color{#35bf28}+0.91\%$
test_compile_indexing[slice-tensorclass-compile] 0.1274ms 46.7871μs 21.3734 KOps/s 21.2036 KOps/s $\color{#35bf28}+0.80\%$
test_compile_indexing[slice-tensorclass-eager] 77.7780μs 19.0401μs 52.5206 KOps/s 49.7512 KOps/s $\textbf{\color{#35bf28}+5.57\%}$
test_compile_indexing[slice-pytree-compile] 0.1159ms 48.1521μs 20.7675 KOps/s 20.8660 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_indexing[slice-pytree-eager] 62.1860μs 18.9219μs 52.8488 KOps/s 51.7424 KOps/s $\color{#35bf28}+2.14\%$
test_compile_indexing[int-tensordict-compile] 0.1395ms 56.4447μs 17.7165 KOps/s 17.6094 KOps/s $\color{#35bf28}+0.61\%$
test_compile_indexing[int-tensordict-eager] 0.9328ms 19.8996μs 50.2522 KOps/s 46.9008 KOps/s $\textbf{\color{#35bf28}+7.15\%}$
test_compile_indexing[int-tensorclass-compile] 0.1301ms 47.4728μs 21.0647 KOps/s 21.1039 KOps/s $\color{#d91a1a}-0.19\%$
test_compile_indexing[int-tensorclass-eager] 60.1320μs 18.8794μs 52.9678 KOps/s 51.7849 KOps/s $\color{#35bf28}+2.28\%$
test_compile_indexing[int-pytree-compile] 0.1500ms 47.6968μs 20.9658 KOps/s 20.7599 KOps/s $\color{#35bf28}+0.99\%$
test_compile_indexing[int-pytree-eager] 84.2670μs 18.7496μs 53.3345 KOps/s 52.3665 KOps/s $\color{#35bf28}+1.85\%$
test_mod_add[eager] 0.1143ms 35.9790μs 27.7940 KOps/s 27.5221 KOps/s $\color{#35bf28}+0.99\%$
test_mod_add[compile] 0.1600ms 66.2418μs 15.0962 KOps/s 14.6981 KOps/s $\color{#35bf28}+2.71\%$
test_mod_add[compile-overhead] 0.1879ms 65.0782μs 15.3661 KOps/s 14.9643 KOps/s $\color{#35bf28}+2.69\%$
test_mod_wrap[eager] 0.4364ms 0.2235ms 4.4750 KOps/s 4.2981 KOps/s $\color{#35bf28}+4.12\%$
test_mod_wrap[compile] 2.0480ms 0.2339ms 4.2746 KOps/s 4.1157 KOps/s $\color{#35bf28}+3.86\%$
test_mod_wrap[compile-overhead] 0.5708ms 0.2289ms 4.3681 KOps/s 4.2565 KOps/s $\color{#35bf28}+2.62\%$
test_mod_wrap_and_backward[eager] 12.8066ms 11.3993ms 87.7245 Ops/s 70.4749 Ops/s $\textbf{\color{#35bf28}+24.48\%}$
test_mod_wrap_and_backward[compile] 13.1286ms 11.6169ms 86.0818 Ops/s 81.0688 Ops/s $\textbf{\color{#35bf28}+6.18\%}$
test_mod_wrap_and_backward[compile-overhead] 12.6164ms 11.4291ms 87.4960 Ops/s 82.2672 Ops/s $\textbf{\color{#35bf28}+6.36\%}$
test_seq_add[eager] 0.2188ms 0.1175ms 8.5104 KOps/s 8.1959 KOps/s $\color{#35bf28}+3.84\%$
test_seq_add[compile] 0.2579ms 79.4939μs 12.5796 KOps/s 12.7747 KOps/s $\color{#d91a1a}-1.53\%$
test_seq_add[compile-overhead] 0.2076ms 77.2605μs 12.9432 KOps/s 13.0652 KOps/s $\color{#d91a1a}-0.93\%$
test_seq_wrap[eager] 0.7475ms 0.4456ms 2.2444 KOps/s 2.1608 KOps/s $\color{#35bf28}+3.87\%$
test_seq_wrap[compile] 0.5000ms 0.2534ms 3.9462 KOps/s 3.9348 KOps/s $\color{#35bf28}+0.29\%$
test_seq_wrap[compile-overhead] 0.5003ms 0.2517ms 3.9724 KOps/s 3.9944 KOps/s $\color{#d91a1a}-0.55\%$
test_func_call_runtime[False-eager] 0.9298ms 0.5499ms 1.8184 KOps/s 1.8151 KOps/s $\color{#35bf28}+0.18\%$
test_func_call_runtime[False-compile] 0.6495ms 0.4513ms 2.2158 KOps/s 2.1708 KOps/s $\color{#35bf28}+2.07\%$
test_func_call_runtime[False-compile-overhead] 0.6652ms 0.4519ms 2.2126 KOps/s 2.1723 KOps/s $\color{#35bf28}+1.86\%$
test_func_call_runtime[True-eager] 1.0784ms 0.7690ms 1.3004 KOps/s 1.2988 KOps/s $\color{#35bf28}+0.12\%$
test_func_call_runtime[True-compile] 0.6337ms 0.4756ms 2.1025 KOps/s 2.0804 KOps/s $\color{#35bf28}+1.06\%$
test_func_call_runtime[True-compile-overhead] 0.6664ms 0.4739ms 2.1103 KOps/s 2.0663 KOps/s $\color{#35bf28}+2.13\%$
test_func_call_cm_runtime[False-eager] 0.9694ms 0.5441ms 1.8379 KOps/s 1.8145 KOps/s $\color{#35bf28}+1.29\%$
test_func_call_cm_runtime[False-compile] 0.6043ms 0.4528ms 2.2084 KOps/s 2.1960 KOps/s $\color{#35bf28}+0.57\%$
test_func_call_cm_runtime[False-compile-overhead] 0.7988ms 0.4513ms 2.2157 KOps/s 2.1934 KOps/s $\color{#35bf28}+1.02\%$
test_func_call_cm_runtime[True-eager] 1.9589ms 0.9321ms 1.0728 KOps/s 1.0875 KOps/s $\color{#d91a1a}-1.35\%$
test_func_call_cm_runtime[True-compile] 1.1938ms 0.8155ms 1.2263 KOps/s 1.2179 KOps/s $\color{#35bf28}+0.69\%$
test_func_call_cm_runtime[True-compile-overhead] 1.1832ms 0.8233ms 1.2147 KOps/s 1.2132 KOps/s $\color{#35bf28}+0.12\%$
test_vmap_func_call_cm_runtime[eager] 3.0304ms 1.9994ms 500.1518 Ops/s 496.6516 Ops/s $\color{#35bf28}+0.70\%$
test_vmap_func_call_cm_runtime[compile] 0.9535ms 0.5516ms 1.8128 KOps/s 1.7699 KOps/s $\color{#35bf28}+2.42\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7300ms 0.5474ms 1.8267 KOps/s 1.8061 KOps/s $\color{#35bf28}+1.14\%$
test_distributed 0.3797ms 0.1285ms 7.7813 KOps/s 7.6012 KOps/s $\color{#35bf28}+2.37\%$
test_tdmodule 65.0910μs 26.5371μs 37.6831 KOps/s 37.6261 KOps/s $\color{#35bf28}+0.15\%$
test_tdmodule_dispatch 0.1353ms 49.7269μs 20.1098 KOps/s 20.6253 KOps/s $\color{#d91a1a}-2.50\%$
test_tdseq 57.4070μs 29.7673μs 33.5939 KOps/s 32.9453 KOps/s $\color{#35bf28}+1.97\%$
test_tdseq_dispatch 0.1067ms 56.4228μs 17.7233 KOps/s 18.1770 KOps/s $\color{#d91a1a}-2.50\%$
test_instantiation_functorch 1.8800ms 1.5779ms 633.7665 Ops/s 631.4339 Ops/s $\color{#35bf28}+0.37\%$
test_exec_functorch 0.3340ms 0.1850ms 5.4063 KOps/s 5.4359 KOps/s $\color{#d91a1a}-0.54\%$
test_exec_functional_call 0.3431ms 0.1742ms 5.7391 KOps/s 5.7151 KOps/s $\color{#35bf28}+0.42\%$
test_exec_td_decorator 0.5114ms 0.2399ms 4.1689 KOps/s 4.1209 KOps/s $\color{#35bf28}+1.16\%$
test_vmap_mlp_speed_decorator[True-True] 0.9270ms 0.6778ms 1.4754 KOps/s 1.4174 KOps/s $\color{#35bf28}+4.09\%$
test_vmap_mlp_speed_decorator[True-False] 1.0115ms 0.6813ms 1.4678 KOps/s 1.4689 KOps/s $\color{#d91a1a}-0.07\%$
test_vmap_mlp_speed_decorator[False-True] 1.4166ms 0.5531ms 1.8081 KOps/s 1.8166 KOps/s $\color{#d91a1a}-0.47\%$
test_vmap_mlp_speed_decorator[False-False] 0.9789ms 0.5500ms 1.8183 KOps/s 1.8197 KOps/s $\color{#d91a1a}-0.08\%$
test_to_module_speed[True] 2.2132ms 1.3849ms 722.0675 Ops/s 732.6107 Ops/s $\color{#d91a1a}-1.44\%$
test_to_module_speed[False] 1.9657ms 1.3356ms 748.7108 Ops/s 752.2878 Ops/s $\color{#d91a1a}-0.48\%$
test_tc_init 89.5670μs 49.2901μs 20.2880 KOps/s 21.4014 KOps/s $\textbf{\color{#d91a1a}-5.20\%}$
test_tc_init_nested 0.1723ms 98.8640μs 10.1149 KOps/s 10.9395 KOps/s $\textbf{\color{#d91a1a}-7.54\%}$
test_tc_first_layer_tensor 24.4550μs 1.6100μs 621.1069 KOps/s 614.0215 KOps/s $\color{#35bf28}+1.15\%$
test_tc_first_layer_nontensor 20.2880μs 4.7497μs 210.5402 KOps/s 209.5067 KOps/s $\color{#35bf28}+0.49\%$
test_tc_second_layer_tensor 45.0340μs 2.8974μs 345.1384 KOps/s 346.1637 KOps/s $\color{#d91a1a}-0.30\%$
test_tc_second_layer_nontensor 43.4320μs 6.1524μs 162.5371 KOps/s 163.7864 KOps/s $\color{#d91a1a}-0.76\%$
test_unbind 0.2810s 15.3510ms 65.1422 Ops/s 71.7735 Ops/s $\textbf{\color{#d91a1a}-9.24\%}$
test_full_like 13.6577ms 11.0542ms 90.4632 Ops/s 105.0875 Ops/s $\textbf{\color{#d91a1a}-13.92\%}$
test_zeros_like 6.0950ms 3.8084ms 262.5801 Ops/s 268.3161 Ops/s $\color{#d91a1a}-2.14\%$
test_ones_like 6.1373ms 4.1016ms 243.8068 Ops/s 227.7247 Ops/s $\textbf{\color{#35bf28}+7.06\%}$
test_clone 9.2285ms 6.3450ms 157.6049 Ops/s 151.7601 Ops/s $\color{#35bf28}+3.85\%$
test_squeeze 91.9020μs 12.8704μs 77.6977 KOps/s 79.9209 KOps/s $\color{#d91a1a}-2.78\%$
test_unsqueeze 0.2938ms 94.7268μs 10.5567 KOps/s 10.4274 KOps/s $\color{#35bf28}+1.24\%$
test_split 0.4217ms 0.1993ms 5.0168 KOps/s 5.0318 KOps/s $\color{#d91a1a}-0.30\%$
test_permute 0.4326ms 0.2091ms 4.7834 KOps/s 4.8638 KOps/s $\color{#d91a1a}-1.65\%$
test_stack 35.6618ms 28.4037ms 35.2066 Ops/s 37.8296 Ops/s $\textbf{\color{#d91a1a}-6.93\%}$
test_cat 39.5340ms 28.8088ms 34.7116 Ops/s 36.8266 Ops/s $\textbf{\color{#d91a1a}-5.74\%}$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}10$. Worsened: $\large\color{#d91a1a}16$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 46.6910μs 12.8197μs 78.0049 KOps/s 77.6266 KOps/s $\color{#35bf28}+0.49\%$
test_plain_set_stack_nested 32.7900μs 13.0066μs 76.8842 KOps/s 77.9717 KOps/s $\color{#d91a1a}-1.39\%$
test_plain_set_nested_inplace 41.7800μs 14.0059μs 71.3983 KOps/s 71.1922 KOps/s $\color{#35bf28}+0.29\%$
test_plain_set_stack_nested_inplace 46.8010μs 14.0055μs 71.4005 KOps/s 71.9786 KOps/s $\color{#d91a1a}-0.80\%$
test_items 31.2110μs 2.8687μs 348.5843 KOps/s 342.9265 KOps/s $\color{#35bf28}+1.65\%$
test_items_nested 0.3905ms 0.3627ms 2.7573 KOps/s 2.7307 KOps/s $\color{#35bf28}+0.97\%$
test_items_nested_locked 0.4186ms 0.3672ms 2.7235 KOps/s 2.7374 KOps/s $\color{#d91a1a}-0.51\%$
test_items_nested_leaf 85.4610μs 60.4981μs 16.5295 KOps/s 16.5614 KOps/s $\color{#d91a1a}-0.19\%$
test_items_stack_nested 0.4110ms 0.3631ms 2.7542 KOps/s 2.7422 KOps/s $\color{#35bf28}+0.44\%$
test_items_stack_nested_leaf 0.1028ms 61.7171μs 16.2030 KOps/s 16.1809 KOps/s $\color{#35bf28}+0.14\%$
test_items_stack_nested_locked 0.4065ms 0.3664ms 2.7292 KOps/s 2.7703 KOps/s $\color{#d91a1a}-1.48\%$
test_keys 31.0500μs 3.4401μs 290.6855 KOps/s 291.2843 KOps/s $\color{#d91a1a}-0.21\%$
test_keys_nested 0.1239ms 88.1443μs 11.3450 KOps/s 11.5001 KOps/s $\color{#d91a1a}-1.35\%$
test_keys_nested_locked 0.7383ms 93.5999μs 10.6838 KOps/s 10.7913 KOps/s $\color{#d91a1a}-1.00\%$
test_keys_nested_leaf 0.1202ms 79.1114μs 12.6404 KOps/s 12.7969 KOps/s $\color{#d91a1a}-1.22\%$
test_keys_stack_nested 0.1116ms 87.3500μs 11.4482 KOps/s 11.4506 KOps/s $\color{#d91a1a}-0.02\%$
test_keys_stack_nested_leaf 0.1093ms 78.8113μs 12.6885 KOps/s 12.6285 KOps/s $\color{#35bf28}+0.48\%$
test_keys_stack_nested_locked 0.1411ms 91.9918μs 10.8705 KOps/s 10.7076 KOps/s $\color{#35bf28}+1.52\%$
test_values 5.5383μs 0.8499μs 1.1766 MOps/s 1.1711 MOps/s $\color{#35bf28}+0.47\%$
test_values_nested 64.9510μs 37.3570μs 26.7688 KOps/s 26.7874 KOps/s $\color{#d91a1a}-0.07\%$
test_values_nested_locked 65.1210μs 39.1621μs 25.5349 KOps/s 25.5285 KOps/s $\color{#35bf28}+0.03\%$
test_values_nested_leaf 68.1610μs 42.4447μs 23.5601 KOps/s 23.4273 KOps/s $\color{#35bf28}+0.57\%$
test_values_stack_nested 72.7610μs 37.9922μs 26.3212 KOps/s 26.4806 KOps/s $\color{#d91a1a}-0.60\%$
test_values_stack_nested_leaf 82.2910μs 42.7062μs 23.4158 KOps/s 23.4554 KOps/s $\color{#d91a1a}-0.17\%$
test_values_stack_nested_locked 66.8610μs 39.7887μs 25.1328 KOps/s 25.4041 KOps/s $\color{#d91a1a}-1.07\%$
test_membership 1.8020μs 0.5001μs 1.9998 MOps/s 1.9855 MOps/s $\color{#35bf28}+0.72\%$
test_membership_nested 16.9050μs 2.0380μs 490.6659 KOps/s 477.5626 KOps/s $\color{#35bf28}+2.74\%$
test_membership_nested_leaf 15.5405μs 2.0460μs 488.7492 KOps/s 469.8962 KOps/s $\color{#35bf28}+4.01\%$
test_membership_stacked_nested 28.6500μs 2.1036μs 475.3654 KOps/s 478.6299 KOps/s $\color{#d91a1a}-0.68\%$
test_membership_stacked_nested_leaf 37.3600μs 2.1206μs 471.5688 KOps/s 473.0125 KOps/s $\color{#d91a1a}-0.31\%$
test_membership_nested_last 29.0800μs 3.1143μs 321.0946 KOps/s 320.2583 KOps/s $\color{#35bf28}+0.26\%$
test_membership_nested_leaf_last 39.7110μs 3.1279μs 319.7076 KOps/s 320.1354 KOps/s $\color{#d91a1a}-0.13\%$
test_membership_stacked_nested_last 33.1110μs 8.2467μs 121.2604 KOps/s 324.2907 KOps/s $\textbf{\color{#d91a1a}-62.61\%}$
test_membership_stacked_nested_leaf_last 42.3900μs 8.2558μs 121.1275 KOps/s 322.9279 KOps/s $\textbf{\color{#d91a1a}-62.49\%}$
test_nested_getleaf 31.2110μs 6.1401μs 162.8643 KOps/s 162.3877 KOps/s $\color{#35bf28}+0.29\%$
test_nested_get 40.7110μs 5.9638μs 167.6771 KOps/s 168.5078 KOps/s $\color{#d91a1a}-0.49\%$
test_stacked_getleaf 37.0300μs 6.1760μs 161.9160 KOps/s 161.3323 KOps/s $\color{#35bf28}+0.36\%$
test_stacked_get 47.1500μs 5.8143μs 171.9889 KOps/s 171.1380 KOps/s $\color{#35bf28}+0.50\%$
test_nested_getitemleaf 28.8710μs 6.4294μs 155.5360 KOps/s 154.8652 KOps/s $\color{#35bf28}+0.43\%$
test_nested_getitem 44.0400μs 6.1075μs 163.7341 KOps/s 164.6926 KOps/s $\color{#d91a1a}-0.58\%$
test_stacked_getitemleaf 39.9100μs 6.4178μs 155.8158 KOps/s 156.3829 KOps/s $\color{#d91a1a}-0.36\%$
test_stacked_getitem 33.2500μs 6.0063μs 166.4924 KOps/s 165.6157 KOps/s $\color{#35bf28}+0.53\%$
test_lock_nested 9.0842ms 0.3439ms 2.9080 KOps/s 3.0054 KOps/s $\color{#d91a1a}-3.24\%$
test_lock_stack_nested 0.3751ms 0.3360ms 2.9764 KOps/s 2.9620 KOps/s $\color{#35bf28}+0.49\%$
test_unlock_nested 0.4716ms 0.2783ms 3.5930 KOps/s 3.6283 KOps/s $\color{#d91a1a}-0.97\%$
test_unlock_stack_nested 0.3137ms 0.2739ms 3.6508 KOps/s 3.6384 KOps/s $\color{#35bf28}+0.34\%$
test_flatten_speed 0.1260ms 77.4830μs 12.9061 KOps/s 13.1444 KOps/s $\color{#d91a1a}-1.81\%$
test_unflatten_speed 0.3611ms 0.3183ms 3.1415 KOps/s 3.1649 KOps/s $\color{#d91a1a}-0.74\%$
test_common_ops 0.9229ms 0.6238ms 1.6030 KOps/s 1.6260 KOps/s $\color{#d91a1a}-1.41\%$
test_creation 97.4510μs 1.7722μs 564.2782 KOps/s 577.8826 KOps/s $\color{#d91a1a}-2.35\%$
test_creation_empty 34.5000μs 9.2647μs 107.9371 KOps/s 107.3996 KOps/s $\color{#35bf28}+0.50\%$
test_creation_nested_1 40.7700μs 10.9157μs 91.6107 KOps/s 90.7857 KOps/s $\color{#35bf28}+0.91\%$
test_creation_nested_2 36.1400μs 13.6680μs 73.1634 KOps/s 73.2221 KOps/s $\color{#d91a1a}-0.08\%$
test_clone 42.5410μs 10.3707μs 96.4259 KOps/s 99.5023 KOps/s $\color{#d91a1a}-3.09\%$
test_getitem[int] 1.2541ms 10.5132μs 95.1185 KOps/s 96.0944 KOps/s $\color{#d91a1a}-1.02\%$
test_getitem[slice_int] 0.1071ms 20.2499μs 49.3830 KOps/s 49.7169 KOps/s $\color{#d91a1a}-0.67\%$
test_getitem[range] 0.1264ms 37.5711μs 26.6162 KOps/s 27.0442 KOps/s $\color{#d91a1a}-1.58\%$
test_getitem[tuple] 0.1064ms 17.6363μs 56.7013 KOps/s 56.8018 KOps/s $\color{#d91a1a}-0.18\%$
test_getitem[list] 0.1986ms 32.6527μs 30.6254 KOps/s 30.3593 KOps/s $\color{#35bf28}+0.88\%$
test_setitem_dim[int] 40.1500μs 18.4766μs 54.1225 KOps/s 53.2497 KOps/s $\color{#35bf28}+1.64\%$
test_setitem_dim[slice_int] 0.1300ms 37.6816μs 26.5381 KOps/s 26.4219 KOps/s $\color{#35bf28}+0.44\%$
test_setitem_dim[range] 89.4700μs 53.1458μs 18.8162 KOps/s 18.9863 KOps/s $\color{#d91a1a}-0.90\%$
test_setitem_dim[tuple] 60.2900μs 30.9145μs 32.3473 KOps/s 31.0964 KOps/s $\color{#35bf28}+4.02\%$
test_setitem 71.3110μs 14.9975μs 66.6778 KOps/s 65.1013 KOps/s $\color{#35bf28}+2.42\%$
test_set 74.0210μs 14.7248μs 67.9124 KOps/s 68.8269 KOps/s $\color{#d91a1a}-1.33\%$
test_set_shared 0.5283ms 0.1556ms 6.4274 KOps/s 6.3747 KOps/s $\color{#35bf28}+0.83\%$
test_update 0.4441ms 18.3971μs 54.3565 KOps/s 55.7534 KOps/s $\color{#d91a1a}-2.51\%$
test_update_nested 78.3310μs 24.6617μs 40.5488 KOps/s 42.7758 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_update__nested 0.5426ms 24.6018μs 40.6475 KOps/s 41.9866 KOps/s $\color{#d91a1a}-3.19\%$
test_set_nested 58.3510μs 15.7360μs 63.5486 KOps/s 63.2127 KOps/s $\color{#35bf28}+0.53\%$
test_set_nested_new 72.9800μs 18.0592μs 55.3736 KOps/s 54.6831 KOps/s $\color{#35bf28}+1.26\%$
test_select 71.9300μs 29.1502μs 34.3051 KOps/s 32.8960 KOps/s $\color{#35bf28}+4.28\%$
test_select_nested 73.0510μs 43.4908μs 22.9934 KOps/s 22.9850 KOps/s $\color{#35bf28}+0.04\%$
test_exclude_nested 0.1164ms 61.2723μs 16.3206 KOps/s 15.8147 KOps/s $\color{#35bf28}+3.20\%$
test_empty[True] 0.3796ms 0.2922ms 3.4219 KOps/s 3.3920 KOps/s $\color{#35bf28}+0.88\%$
test_empty[False] 2.8320μs 0.8163μs 1.2250 MOps/s 1.2281 MOps/s $\color{#d91a1a}-0.25\%$
test_to 98.5110μs 56.4591μs 17.7119 KOps/s 18.0457 KOps/s $\color{#d91a1a}-1.85\%$
test_to_nonblocking 94.2510μs 45.8922μs 21.7902 KOps/s 21.7184 KOps/s $\color{#35bf28}+0.33\%$
test_unbind_speed 0.2832ms 0.2376ms 4.2091 KOps/s 4.2416 KOps/s $\color{#d91a1a}-0.77\%$
test_unbind_speed_stack0 0.3077ms 0.2315ms 4.3203 KOps/s 4.2868 KOps/s $\color{#35bf28}+0.78\%$
test_unbind_speed_stack1 93.9816ms 0.8160ms 1.2255 KOps/s 1.3884 KOps/s $\textbf{\color{#d91a1a}-11.73\%}$
test_split 1.5602ms 1.4319ms 698.3579 Ops/s 637.2861 Ops/s $\textbf{\color{#35bf28}+9.58\%}$
test_chunk 96.8348ms 1.7120ms 584.1280 Ops/s 634.7362 Ops/s $\textbf{\color{#d91a1a}-7.97\%}$
test_consolidate[False-None] 3.3610ms 2.6599ms 375.9478 Ops/s 376.3507 Ops/s $\color{#d91a1a}-0.11\%$
test_consolidate[default-None] 1.8301ms 1.7007ms 587.9896 Ops/s 590.9999 Ops/s $\color{#d91a1a}-0.51\%$
test_consolidate[reduce-overhead-None] 1.8243ms 1.7226ms 580.5137 Ops/s 585.6753 Ops/s $\color{#d91a1a}-0.88\%$
test_consolidate_njt[False-None] 6.7027ms 6.4383ms 155.3201 Ops/s 158.6316 Ops/s $\color{#d91a1a}-2.09\%$
test_to[False-False-None] 1.7179ms 1.6461ms 607.5092 Ops/s 616.0839 Ops/s $\color{#d91a1a}-1.39\%$
test_to[True-False-None] 1.5544ms 1.3148ms 760.5678 Ops/s 758.0677 Ops/s $\color{#35bf28}+0.33\%$
test_to[within-False-None] 4.3359ms 4.1322ms 242.0034 Ops/s 180.4394 Ops/s $\textbf{\color{#35bf28}+34.12\%}$
test_to[True-default-None] 5.3481ms 5.0809ms 196.8147 Ops/s 191.8695 Ops/s $\color{#35bf28}+2.58\%$
test_to_njt[False-False-None] 7.0061ms 6.8347ms 146.3124 Ops/s 140.0859 Ops/s $\color{#35bf28}+4.44\%$
test_to_njt[True-False-None] 5.4991ms 5.3425ms 187.1770 Ops/s 179.0879 Ops/s $\color{#35bf28}+4.52\%$
test_to_njt[within-False-None] 11.9047ms 11.8188ms 84.6107 Ops/s 75.7825 Ops/s $\textbf{\color{#35bf28}+11.65\%}$
test_creation[device0] 0.4613ms 78.8775μs 12.6779 KOps/s 12.1032 KOps/s $\color{#35bf28}+4.75\%$
test_creation_from_tensor 0.4534ms 83.3281μs 12.0008 KOps/s 11.5392 KOps/s $\color{#35bf28}+4.00\%$
test_add_one[memmap_tensor0] 0.5673ms 6.4422μs 155.2269 KOps/s 155.1310 KOps/s $\color{#35bf28}+0.06\%$
test_contiguous[memmap_tensor0] 1.8220μs 0.4260μs 2.3475 MOps/s 2.3845 MOps/s $\color{#d91a1a}-1.56\%$
test_stack[memmap_tensor0] 38.6500μs 4.2067μs 237.7186 KOps/s 237.8749 KOps/s $\color{#d91a1a}-0.07\%$
test_memmaptd_index 1.5161ms 0.2356ms 4.2445 KOps/s 4.1825 KOps/s $\color{#35bf28}+1.48\%$
test_memmaptd_index_astensor 0.4297ms 0.2955ms 3.3845 KOps/s 3.3673 KOps/s $\color{#35bf28}+0.51\%$
test_memmaptd_index_op 0.7018ms 0.5701ms 1.7541 KOps/s 1.7038 KOps/s $\color{#35bf28}+2.96\%$
test_serialize_model 0.4389s 0.1741s 5.7429 Ops/s 7.6821 Ops/s $\textbf{\color{#d91a1a}-25.24\%}$
test_serialize_model_pickle 1.3486s 1.1865s 0.8428 Ops/s 0.8222 Ops/s $\color{#35bf28}+2.51\%$
test_serialize_weights 0.1307s 0.1295s 7.7248 Ops/s 7.7217 Ops/s $\color{#35bf28}+0.04\%$
test_serialize_weights_returnearly 0.3176s 54.8938ms 18.2170 Ops/s 11.3965 Ops/s $\textbf{\color{#35bf28}+59.85\%}$
test_serialize_weights_pickle 1.3608s 1.2169s 0.8218 Ops/s 0.8223 Ops/s $\color{#d91a1a}-0.07\%$
test_reshape_pytree 52.5100μs 22.4259μs 44.5913 KOps/s 45.1960 KOps/s $\color{#d91a1a}-1.34\%$
test_reshape_td 66.9910μs 26.6512μs 37.5218 KOps/s 37.0855 KOps/s $\color{#35bf28}+1.18\%$
test_view_pytree 58.3810μs 23.1660μs 43.1667 KOps/s 45.9772 KOps/s $\textbf{\color{#d91a1a}-6.11\%}$
test_view_td 59.7110μs 31.8349μs 31.4121 KOps/s 31.0110 KOps/s $\color{#35bf28}+1.29\%$
test_unbind_pytree 56.9300μs 29.3854μs 34.0305 KOps/s 36.0780 KOps/s $\textbf{\color{#d91a1a}-5.68\%}$
test_unbind_td 0.7977ms 36.9769μs 27.0439 KOps/s 27.5110 KOps/s $\color{#d91a1a}-1.70\%$
test_split_pytree 69.8310μs 30.7637μs 32.5058 KOps/s 33.3172 KOps/s $\color{#d91a1a}-2.44\%$
test_split_td 0.7814ms 38.6584μs 25.8676 KOps/s 25.1133 KOps/s $\color{#35bf28}+3.00\%$
test_add_pytree 81.7510μs 34.3711μs 29.0942 KOps/s 29.3666 KOps/s $\color{#d91a1a}-0.93\%$
test_add_td 83.4010μs 49.0194μs 20.4001 KOps/s 19.8770 KOps/s $\color{#35bf28}+2.63\%$
test_compile_add_one_nested[tensordict-compile] 0.2590ms 0.1240ms 8.0652 KOps/s 7.9965 KOps/s $\color{#35bf28}+0.86\%$
test_compile_add_one_nested[tensordict-eager] 0.2618ms 0.1357ms 7.3690 KOps/s 7.3616 KOps/s $\color{#35bf28}+0.10\%$
test_compile_add_one_nested[pytree-compile] 0.2330ms 96.7780μs 10.3329 KOps/s 10.4852 KOps/s $\color{#d91a1a}-1.45\%$
test_compile_add_one_nested[pytree-eager] 1.3535ms 0.1494ms 6.6951 KOps/s 6.6517 KOps/s $\color{#35bf28}+0.65\%$
test_compile_copy_nested[tensordict-compile] 48.3610μs 24.6590μs 40.5531 KOps/s 42.2364 KOps/s $\color{#d91a1a}-3.99\%$
test_compile_copy_nested[tensordict-eager] 62.4710μs 29.1890μs 34.2594 KOps/s 34.4074 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_copy_nested[pytree-compile] 0.4636ms 63.8292μs 15.6668 KOps/s 15.7686 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_copy_nested[pytree-eager] 86.6310μs 48.8225μs 20.4824 KOps/s 20.4516 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[tensordict-compile] 0.2820ms 0.1449ms 6.9022 KOps/s 7.0254 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_add_one_flat[tensordict-eager] 0.3287ms 0.2186ms 4.5744 KOps/s 4.6617 KOps/s $\color{#d91a1a}-1.87\%$
test_compile_add_one_flat[tensorclass-compile] 0.2181ms 99.6630μs 10.0338 KOps/s 10.2020 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_add_one_flat[tensorclass-eager] 0.1226ms 55.9198μs 17.8827 KOps/s 17.8563 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[pytree-compile] 0.1785ms 0.1366ms 7.3230 KOps/s 7.4825 KOps/s $\color{#d91a1a}-2.13\%$
test_compile_add_one_flat[pytree-eager] 0.5657ms 0.4817ms 2.0761 KOps/s 2.0971 KOps/s $\color{#d91a1a}-1.00\%$
test_compile_add_self_flat[tensordict-eager] 0.3909ms 0.2620ms 3.8165 KOps/s 3.8486 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_add_self_flat[tensordict-compile] 0.2076ms 0.1481ms 6.7537 KOps/s 6.9986 KOps/s $\color{#d91a1a}-3.50\%$
test_compile_add_self_flat[tensorclass-eager] 0.1646ms 71.5820μs 13.9700 KOps/s 14.5332 KOps/s $\color{#d91a1a}-3.88\%$
test_compile_add_self_flat[tensorclass-compile] 0.1398ms 98.0856μs 10.1952 KOps/s 10.1248 KOps/s $\color{#35bf28}+0.70\%$
test_compile_add_self_flat[pytree-eager] 0.4715ms 0.4058ms 2.4642 KOps/s 2.5089 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_add_self_flat[pytree-compile] 0.1868ms 0.1392ms 7.1859 KOps/s 7.5236 KOps/s $\color{#d91a1a}-4.49\%$
test_compile_copy_flat[tensordict-compile] 50.0400μs 19.3967μs 51.5551 KOps/s 57.8099 KOps/s $\textbf{\color{#d91a1a}-10.82\%}$
test_compile_copy_flat[tensordict-eager] 61.4500μs 31.6369μs 31.6086 KOps/s 32.0859 KOps/s $\color{#d91a1a}-1.49\%$
test_compile_copy_flat[pytree-compile] 0.2083ms 70.0115μs 14.2834 KOps/s 14.4031 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_copy_flat[pytree-eager] 81.3010μs 52.1833μs 19.1632 KOps/s 19.2648 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_assign_and_add[tensordict-compile] 1.6113ms 0.3913ms 2.5555 KOps/s 2.2042 KOps/s $\textbf{\color{#35bf28}+15.94\%}$
test_compile_assign_and_add[tensordict-eager] 2.9061ms 2.5296ms 395.3241 Ops/s 388.6302 Ops/s $\color{#35bf28}+1.72\%$
test_compile_assign_and_add[pytree-compile] 1.6176ms 0.4376ms 2.2851 KOps/s 2.2817 KOps/s $\color{#35bf28}+0.15\%$
test_compile_assign_and_add[pytree-eager] 2.6919ms 2.5373ms 394.1180 Ops/s 392.4757 Ops/s $\color{#35bf28}+0.42\%$
test_compile_indexing[tensor-tensordict-compile] 0.1801ms 0.1129ms 8.8547 KOps/s 8.4819 KOps/s $\color{#35bf28}+4.40\%$
test_compile_indexing[tensor-tensordict-eager] 0.5924ms 82.2059μs 12.1646 KOps/s 12.7448 KOps/s $\color{#d91a1a}-4.55\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2463ms 0.1054ms 9.4834 KOps/s 9.4262 KOps/s $\color{#35bf28}+0.61\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2113ms 66.6735μs 14.9985 KOps/s 14.6797 KOps/s $\color{#35bf28}+2.17\%$
test_compile_indexing[tensor-pytree-compile] 0.2551ms 0.1125ms 8.8875 KOps/s 9.3231 KOps/s $\color{#d91a1a}-4.67\%$
test_compile_indexing[tensor-pytree-eager] 0.2231ms 70.9828μs 14.0879 KOps/s 14.9035 KOps/s $\textbf{\color{#d91a1a}-5.47\%}$
test_compile_indexing[slice-tensordict-compile] 0.1444ms 0.1024ms 9.7637 KOps/s 10.1251 KOps/s $\color{#d91a1a}-3.57\%$
test_compile_indexing[slice-tensordict-eager] 0.1431ms 17.3680μs 57.5772 KOps/s 57.8091 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_indexing[slice-tensorclass-compile] 0.2044ms 0.1006ms 9.9439 KOps/s 10.4678 KOps/s $\textbf{\color{#d91a1a}-5.00\%}$
test_compile_indexing[slice-tensorclass-eager] 47.6910μs 15.5909μs 64.1398 KOps/s 64.4175 KOps/s $\color{#d91a1a}-0.43\%$
test_compile_indexing[slice-pytree-compile] 0.1564ms 96.2287μs 10.3919 KOps/s 10.3663 KOps/s $\color{#35bf28}+0.25\%$
test_compile_indexing[slice-pytree-eager] 56.6910μs 15.6559μs 63.8739 KOps/s 64.6105 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_indexing[int-tensordict-compile] 0.1707ms 0.1000ms 9.9955 KOps/s 10.0194 KOps/s $\color{#d91a1a}-0.24\%$
test_compile_indexing[int-tensordict-eager] 0.6026ms 17.2182μs 58.0781 KOps/s 59.2036 KOps/s $\color{#d91a1a}-1.90\%$
test_compile_indexing[int-tensorclass-compile] 0.2507ms 0.1000ms 9.9981 KOps/s 10.3815 KOps/s $\color{#d91a1a}-3.69\%$
test_compile_indexing[int-tensorclass-eager] 44.5110μs 15.6289μs 63.9840 KOps/s 64.9556 KOps/s $\color{#d91a1a}-1.50\%$
test_compile_indexing[int-pytree-compile] 0.1841ms 95.8028μs 10.4381 KOps/s 10.3212 KOps/s $\color{#35bf28}+1.13\%$
test_compile_indexing[int-pytree-eager] 0.1392ms 16.5930μs 60.2662 KOps/s 65.2166 KOps/s $\textbf{\color{#d91a1a}-7.59\%}$
test_mod_add[eager] 77.7310μs 38.0688μs 26.2682 KOps/s 26.0857 KOps/s $\color{#35bf28}+0.70\%$
test_mod_add[compile] 0.4469ms 80.9989μs 12.3459 KOps/s 12.2826 KOps/s $\color{#35bf28}+0.52\%$
test_mod_add[compile-overhead] 0.3261ms 0.1673ms 5.9777 KOps/s 5.7093 KOps/s $\color{#35bf28}+4.70\%$
test_mod_wrap[eager] 0.3953ms 0.2463ms 4.0606 KOps/s 4.0044 KOps/s $\color{#35bf28}+1.40\%$
test_mod_wrap[compile] 0.6903ms 0.2857ms 3.5000 KOps/s 3.4801 KOps/s $\color{#35bf28}+0.57\%$
test_mod_wrap[compile-overhead] 7.3085ms 3.9376ms 253.9602 Ops/s 268.0857 Ops/s $\textbf{\color{#d91a1a}-5.27\%}$
test_mod_wrap_and_backward[eager] 1.8313ms 1.3320ms 750.7510 Ops/s 678.5650 Ops/s $\textbf{\color{#35bf28}+10.64\%}$
test_mod_wrap_and_backward[compile] 1.4458ms 1.2482ms 801.1468 Ops/s 728.8561 Ops/s $\textbf{\color{#35bf28}+9.92\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3595ms 0.9182ms 1.0891 KOps/s 950.9055 Ops/s $\textbf{\color{#35bf28}+14.53\%}$
test_seq_add[eager] 0.2366ms 0.1127ms 8.8701 KOps/s 8.4282 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_seq_add[compile] 0.5105ms 87.5527μs 11.4217 KOps/s 11.0398 KOps/s $\color{#35bf28}+3.46\%$
test_seq_add[compile-overhead] 0.5478ms 0.1295ms 7.7212 KOps/s 7.4352 KOps/s $\color{#35bf28}+3.85\%$
test_seq_wrap[eager] 0.8383ms 0.4223ms 2.3678 KOps/s 2.2601 KOps/s $\color{#35bf28}+4.77\%$
test_seq_wrap[compile] 0.7511ms 0.3030ms 3.3004 KOps/s 3.3074 KOps/s $\color{#d91a1a}-0.21\%$
test_seq_wrap[compile-overhead] 0.6509ms 0.2250ms 4.4451 KOps/s 4.4331 KOps/s $\color{#35bf28}+0.27\%$
test_func_call_runtime[False-eager] 1.1486ms 0.7192ms 1.3904 KOps/s 1.3644 KOps/s $\color{#35bf28}+1.90\%$
test_func_call_runtime[False-compile] 3.5699ms 0.7611ms 1.3139 KOps/s 1.3362 KOps/s $\color{#d91a1a}-1.67\%$
test_func_call_runtime[False-compile-overhead] 0.5189ms 0.3677ms 2.7198 KOps/s 2.7470 KOps/s $\color{#d91a1a}-0.99\%$
test_func_call_runtime[True-eager] 1.0882ms 0.9153ms 1.0926 KOps/s 1.0821 KOps/s $\color{#35bf28}+0.97\%$
test_func_call_runtime[True-compile] 0.8978ms 0.7770ms 1.2871 KOps/s 1.3153 KOps/s $\color{#d91a1a}-2.15\%$
test_func_call_runtime[True-compile-overhead] 0.4563ms 0.3953ms 2.5296 KOps/s 2.5762 KOps/s $\color{#d91a1a}-1.81\%$
test_func_call_cm_runtime[False-eager] 0.8708ms 0.7524ms 1.3290 KOps/s 1.3079 KOps/s $\color{#35bf28}+1.61\%$
test_func_call_cm_runtime[False-compile] 0.8794ms 0.7363ms 1.3581 KOps/s 1.3515 KOps/s $\color{#35bf28}+0.48\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4804ms 0.3650ms 2.7395 KOps/s 2.7229 KOps/s $\color{#35bf28}+0.61\%$
test_func_call_cm_runtime[True-eager] 1.1344ms 0.9873ms 1.0129 KOps/s 1.0005 KOps/s $\color{#35bf28}+1.24\%$
test_func_call_cm_runtime[True-compile] 1.0890ms 0.9796ms 1.0208 KOps/s 1.0149 KOps/s $\color{#35bf28}+0.58\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2040ms 0.9837ms 1.0165 KOps/s 1.0079 KOps/s $\color{#35bf28}+0.86\%$
test_vmap_func_call_cm_runtime[eager] 2.4430ms 2.0652ms 484.2205 Ops/s 478.5293 Ops/s $\color{#35bf28}+1.19\%$
test_vmap_func_call_cm_runtime[compile] 0.9130ms 0.8077ms 1.2380 KOps/s 1.2234 KOps/s $\color{#35bf28}+1.20\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.5274ms 0.4206ms 2.3773 KOps/s 2.3114 KOps/s $\color{#35bf28}+2.85\%$
test_distributed 2.4701ms 0.1691ms 5.9145 KOps/s 8.5396 KOps/s $\textbf{\color{#d91a1a}-30.74\%}$
test_tdmodule 58.5810μs 19.9284μs 50.1795 KOps/s 50.5840 KOps/s $\color{#d91a1a}-0.80\%$
test_tdmodule_dispatch 80.6910μs 35.1020μs 28.4884 KOps/s 27.6773 KOps/s $\color{#35bf28}+2.93\%$
test_tdseq 0.1211ms 20.8710μs 47.9133 KOps/s 48.2753 KOps/s $\color{#d91a1a}-0.75\%$
test_tdseq_dispatch 71.5010μs 38.3947μs 26.0453 KOps/s 25.5496 KOps/s $\color{#35bf28}+1.94\%$
test_instantiation_functorch 1.6184ms 1.5208ms 657.5612 Ops/s 653.9239 Ops/s $\color{#35bf28}+0.56\%$
test_exec_functorch 0.1855ms 0.1377ms 7.2598 KOps/s 7.0061 KOps/s $\color{#35bf28}+3.62\%$
test_exec_functional_call 0.2213ms 0.1303ms 7.6775 KOps/s 7.3426 KOps/s $\color{#35bf28}+4.56\%$
test_exec_td_decorator 0.3765ms 0.1808ms 5.5307 KOps/s 5.3439 KOps/s $\color{#35bf28}+3.50\%$
test_vmap_mlp_speed_decorator[True-True] 0.7913ms 0.6772ms 1.4766 KOps/s 1.4347 KOps/s $\color{#35bf28}+2.92\%$
test_vmap_mlp_speed_decorator[True-False] 0.7920ms 0.6778ms 1.4754 KOps/s 1.4328 KOps/s $\color{#35bf28}+2.97\%$
test_vmap_mlp_speed_decorator[False-True] 0.7320ms 0.5900ms 1.6949 KOps/s 1.6583 KOps/s $\color{#35bf28}+2.21\%$
test_vmap_mlp_speed_decorator[False-False] 0.7085ms 0.5874ms 1.7024 KOps/s 1.6471 KOps/s $\color{#35bf28}+3.35\%$
test_vmap_transformer_speed_decorator[True-True] 19.8486ms 18.8582ms 53.0274 Ops/s 52.3684 Ops/s $\color{#35bf28}+1.26\%$
test_vmap_transformer_speed_decorator[True-False] 19.6845ms 18.9995ms 52.6329 Ops/s 52.9531 Ops/s $\color{#d91a1a}-0.60\%$
test_vmap_transformer_speed_decorator[False-True] 18.8016ms 18.6714ms 53.5580 Ops/s 52.0808 Ops/s $\color{#35bf28}+2.84\%$
test_vmap_transformer_speed_decorator[False-False] 18.7920ms 18.6848ms 53.5194 Ops/s 52.8394 Ops/s $\color{#35bf28}+1.29\%$
test_to_module_speed[True] 1.0795ms 0.9556ms 1.0465 KOps/s 1.0447 KOps/s $\color{#35bf28}+0.17\%$
test_to_module_speed[False] 1.0554ms 0.9292ms 1.0761 KOps/s 1.0693 KOps/s $\color{#35bf28}+0.64\%$
test_tc_init 68.8310μs 36.1841μs 27.6365 KOps/s 27.8556 KOps/s $\color{#d91a1a}-0.79\%$
test_tc_init_nested 0.1621ms 73.1469μs 13.6711 KOps/s 13.5332 KOps/s $\color{#35bf28}+1.02\%$
test_tc_first_layer_tensor 28.0300μs 0.7960μs 1.2563 MOps/s 1.4267 MOps/s $\textbf{\color{#d91a1a}-11.95\%}$
test_tc_first_layer_nontensor 43.5800μs 2.2155μs 451.3723 KOps/s 447.9325 KOps/s $\color{#35bf28}+0.77\%$
test_tc_second_layer_tensor 27.1153μs 1.3974μs 715.5952 KOps/s 707.6609 KOps/s $\color{#35bf28}+1.12\%$
test_tc_second_layer_nontensor 91.9310μs 2.9448μs 339.5852 KOps/s 342.5835 KOps/s $\color{#d91a1a}-0.88\%$
test_unbind 0.2221s 11.5116ms 86.8687 Ops/s 142.8503 Ops/s $\textbf{\color{#d91a1a}-39.19\%}$
test_full_like 9.7514ms 9.3941ms 106.4493 Ops/s 105.2902 Ops/s $\color{#35bf28}+1.10\%$
test_zeros_like 5.0032ms 4.3619ms 229.2583 Ops/s 235.1976 Ops/s $\color{#d91a1a}-2.53\%$
test_ones_like 10.4475ms 4.3759ms 228.5250 Ops/s 233.6699 Ops/s $\color{#d91a1a}-2.20\%$
test_clone 7.2087ms 6.6677ms 149.9777 Ops/s 106.0417 Ops/s $\textbf{\color{#35bf28}+41.43\%}$
test_squeeze 70.6000μs 9.4852μs 105.4269 KOps/s 106.6199 KOps/s $\color{#d91a1a}-1.12\%$
test_unsqueeze 0.1679ms 70.5200μs 14.1804 KOps/s 14.1903 KOps/s $\color{#d91a1a}-0.07\%$
test_split 0.2885ms 0.1560ms 6.4091 KOps/s 6.3855 KOps/s $\color{#35bf28}+0.37\%$
test_permute 0.2887ms 0.1783ms 5.6072 KOps/s 5.5623 KOps/s $\color{#35bf28}+0.81\%$
test_stack 51.8384ms 51.0572ms 19.5859 Ops/s 19.6141 Ops/s $\color{#d91a1a}-0.14\%$
test_cat 51.4483ms 50.9571ms 19.6244 Ops/s 19.5819 Ops/s $\color{#35bf28}+0.22\%$

vmoens added a commit that referenced this pull request Feb 17, 2025
ghstack-source-id: 46d70d783e59f86f1720cd6bede12049de57d095
Pull Request resolved: #1218

(cherry picked from commit 9eba0c1)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants