-
Notifications
You must be signed in to change notification settings - Fork 79
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Minor] Fix tensorclass prints #1218
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Feb 13, 2025
ghstack-source-id: 46d70d783e59f86f1720cd6bede12049de57d095 Pull Request resolved: #1218
vmoens
added a commit
that referenced
this pull request
Feb 13, 2025
ghstack-source-id: 46d70d783e59f86f1720cd6bede12049de57d095 Pull Request resolved: #1218
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 43.5110μs | 20.7399μs | 48.2162 KOps/s | 48.9514 KOps/s | |
test_plain_set_stack_nested | 59.4810μs | 20.9319μs | 47.7740 KOps/s | 47.8534 KOps/s | |
test_plain_set_nested_inplace | 87.8640μs | 22.5249μs | 44.3953 KOps/s | 44.1562 KOps/s | |
test_plain_set_stack_nested_inplace | 77.0340μs | 22.5800μs | 44.2870 KOps/s | 43.9668 KOps/s | |
test_items | 31.8200μs | 4.2630μs | 234.5766 KOps/s | 239.4746 KOps/s | |
test_items_nested | 0.7978ms | 0.4078ms | 2.4523 KOps/s | 2.3994 KOps/s | |
test_items_nested_locked | 0.5514ms | 0.4097ms | 2.4409 KOps/s | 2.3998 KOps/s | |
test_items_nested_leaf | 0.1429ms | 78.4301μs | 12.7502 KOps/s | 12.8816 KOps/s | |
test_items_stack_nested | 0.7587ms | 0.4112ms | 2.4320 KOps/s | 2.3812 KOps/s | |
test_items_stack_nested_leaf | 0.1544ms | 81.6204μs | 12.2518 KOps/s | 12.3959 KOps/s | |
test_items_stack_nested_locked | 0.5279ms | 0.4077ms | 2.4528 KOps/s | 2.3743 KOps/s | |
test_keys | 27.6210μs | 3.4569μs | 289.2729 KOps/s | 285.7785 KOps/s | |
test_keys_nested | 0.2548ms | 0.1647ms | 6.0708 KOps/s | 6.1111 KOps/s | |
test_keys_nested_locked | 1.8167ms | 0.1714ms | 5.8336 KOps/s | 5.8612 KOps/s | |
test_keys_nested_leaf | 0.2536ms | 0.1436ms | 6.9661 KOps/s | 7.0251 KOps/s | |
test_keys_stack_nested | 0.2722ms | 0.1635ms | 6.1180 KOps/s | 6.1937 KOps/s | |
test_keys_stack_nested_leaf | 0.2771ms | 0.1431ms | 6.9865 KOps/s | 7.3016 KOps/s | |
test_keys_stack_nested_locked | 0.2544ms | 0.1704ms | 5.8676 KOps/s | 5.9778 KOps/s | |
test_values | 10.3854μs | 1.0386μs | 962.8022 KOps/s | 966.4115 KOps/s | |
test_values_nested | 0.1068ms | 62.9791μs | 15.8783 KOps/s | 15.9531 KOps/s | |
test_values_nested_locked | 0.1415ms | 62.6013μs | 15.9741 KOps/s | 15.7403 KOps/s | |
test_values_nested_leaf | 0.1435ms | 72.4662μs | 13.7995 KOps/s | 14.2854 KOps/s | |
test_values_stack_nested | 0.1296ms | 63.8690μs | 15.6570 KOps/s | 15.0596 KOps/s | |
test_values_stack_nested_leaf | 0.1368ms | 73.1779μs | 13.6653 KOps/s | 14.4006 KOps/s | |
test_values_stack_nested_locked | 0.1190ms | 63.7664μs | 15.6822 KOps/s | 15.5213 KOps/s | |
test_membership | 16.9810μs | 0.8609μs | 1.1616 MOps/s | 1.1526 MOps/s | |
test_membership_nested | 27.8920μs | 2.9564μs | 338.2542 KOps/s | 338.7347 KOps/s | |
test_membership_nested_leaf | 36.7190μs | 2.9845μs | 335.0597 KOps/s | 325.2229 KOps/s | |
test_membership_stacked_nested | 43.2000μs | 2.9345μs | 340.7764 KOps/s | 341.0266 KOps/s | |
test_membership_stacked_nested_leaf | 48.8210μs | 2.9714μs | 336.5423 KOps/s | 342.4397 KOps/s | |
test_membership_nested_last | 33.0020μs | 4.4457μs | 224.9370 KOps/s | 229.5586 KOps/s | |
test_membership_nested_leaf_last | 76.8330μs | 4.4324μs | 225.6093 KOps/s | 227.7016 KOps/s | |
test_membership_stacked_nested_last | 42.0190μs | 4.3848μs | 228.0598 KOps/s | 72.2829 KOps/s | |
test_membership_stacked_nested_leaf_last | 24.9660μs | 4.4290μs | 225.7863 KOps/s | 73.0315 KOps/s | |
test_nested_getleaf | 62.7870μs | 10.7841μs | 92.7294 KOps/s | 92.3981 KOps/s | |
test_nested_get | 58.0070μs | 10.0557μs | 99.4464 KOps/s | 97.1912 KOps/s | |
test_stacked_getleaf | 69.8800μs | 10.7266μs | 93.2263 KOps/s | 93.9040 KOps/s | |
test_stacked_get | 42.5890μs | 10.1319μs | 98.6983 KOps/s | 98.0913 KOps/s | |
test_nested_getitemleaf | 60.8730μs | 11.2752μs | 88.6903 KOps/s | 87.5402 KOps/s | |
test_nested_getitem | 49.9630μs | 10.8365μs | 92.2810 KOps/s | 92.2289 KOps/s | |
test_stacked_getitemleaf | 72.6860μs | 11.3190μs | 88.3470 KOps/s | 88.6223 KOps/s | |
test_stacked_getitem | 43.6720μs | 10.7911μs | 92.6693 KOps/s | 92.5059 KOps/s | |
test_lock_nested | 0.7269ms | 0.4222ms | 2.3684 KOps/s | 2.4193 KOps/s | |
test_lock_stack_nested | 0.7129ms | 0.4310ms | 2.3201 KOps/s | 2.3929 KOps/s | |
test_unlock_nested | 0.8056ms | 0.3437ms | 2.9094 KOps/s | 2.9070 KOps/s | |
test_unlock_stack_nested | 0.7072ms | 0.3489ms | 2.8664 KOps/s | 2.9663 KOps/s | |
test_flatten_speed | 0.2002ms | 0.1039ms | 9.6232 KOps/s | 10.0275 KOps/s | |
test_unflatten_speed | 0.9199ms | 0.5332ms | 1.8753 KOps/s | 1.8916 KOps/s | |
test_common_ops | 1.0315ms | 0.8207ms | 1.2185 KOps/s | 1.2132 KOps/s | |
test_creation | 64.2900μs | 2.5457μs | 392.8132 KOps/s | 400.2539 KOps/s | |
test_creation_empty | 43.2710μs | 12.6347μs | 79.1468 KOps/s | 85.4863 KOps/s | |
test_creation_nested_1 | 74.7290μs | 16.0582μs | 62.2736 KOps/s | 64.6348 KOps/s | |
test_creation_nested_2 | 60.3530μs | 20.4380μs | 48.9284 KOps/s | 50.9887 KOps/s | |
test_clone | 63.2480μs | 13.5011μs | 74.0683 KOps/s | 71.2157 KOps/s | |
test_getitem[int] | 0.8374ms | 13.0131μs | 76.8459 KOps/s | 75.1348 KOps/s | |
test_getitem[slice_int] | 0.1493ms | 25.8461μs | 38.6906 KOps/s | 38.5115 KOps/s | |
test_getitem[range] | 0.1703ms | 51.2764μs | 19.5021 KOps/s | 19.2426 KOps/s | |
test_getitem[tuple] | 0.1343ms | 20.3660μs | 49.1014 KOps/s | 46.4289 KOps/s | |
test_getitem[list] | 0.2784ms | 46.9084μs | 21.3182 KOps/s | 21.3610 KOps/s | |
test_setitem_dim[int] | 74.1880μs | 26.8860μs | 37.1940 KOps/s | 38.6624 KOps/s | |
test_setitem_dim[slice_int] | 94.1550μs | 53.0454μs | 18.8518 KOps/s | 18.6889 KOps/s | |
test_setitem_dim[range] | 0.1431ms | 78.6381μs | 12.7165 KOps/s | 13.0449 KOps/s | |
test_setitem_dim[tuple] | 0.1078ms | 42.7052μs | 23.4163 KOps/s | 23.9867 KOps/s | |
test_setitem | 0.1176ms | 20.8852μs | 47.8809 KOps/s | 47.0690 KOps/s | |
test_set | 0.1304ms | 20.5303μs | 48.7086 KOps/s | 48.5491 KOps/s | |
test_set_shared | 4.4666ms | 0.1866ms | 5.3598 KOps/s | 5.3083 KOps/s | |
test_update | 0.1610ms | 23.3650μs | 42.7990 KOps/s | 42.0150 KOps/s | |
test_update_nested | 0.2344ms | 35.0615μs | 28.5213 KOps/s | 27.9604 KOps/s | |
test_update__nested | 0.4393ms | 33.2671μs | 30.0597 KOps/s | 28.3968 KOps/s | |
test_set_nested | 96.5300μs | 23.2759μs | 42.9629 KOps/s | 42.3727 KOps/s | |
test_set_nested_new | 0.1369ms | 28.3971μs | 35.2149 KOps/s | 34.5627 KOps/s | |
test_select | 0.1656ms | 45.6074μs | 21.9263 KOps/s | 21.9395 KOps/s | |
test_select_nested | 0.1449ms | 62.7544μs | 15.9351 KOps/s | 16.0393 KOps/s | |
test_exclude_nested | 0.1569ms | 81.3445μs | 12.2934 KOps/s | 12.3360 KOps/s | |
test_empty[True] | 0.7398ms | 0.4133ms | 2.4193 KOps/s | 2.4220 KOps/s | |
test_empty[False] | 8.9040μs | 1.3883μs | 720.2804 KOps/s | 725.4287 KOps/s | |
test_unbind_speed | 0.4711ms | 0.2727ms | 3.6674 KOps/s | 3.6534 KOps/s | |
test_unbind_speed_stack0 | 0.3981ms | 0.2685ms | 3.7242 KOps/s | 3.8042 KOps/s | |
test_unbind_speed_stack1 | 0.1178s | 0.7467ms | 1.3391 KOps/s | 1.3454 KOps/s | |
test_split | 0.1183s | 1.7701ms | 564.9312 Ops/s | 489.5105 Ops/s | |
test_chunk | 0.1253s | 1.7876ms | 559.3960 Ops/s | 616.4269 Ops/s | |
test_consolidate_njt[False-None] | 9.4343ms | 8.4521ms | 118.3135 Ops/s | 116.9115 Ops/s | |
test_creation[device0] | 0.3111ms | 93.0325μs | 10.7489 KOps/s | 10.4137 KOps/s | |
test_creation_from_tensor | 4.1990ms | 98.9834μs | 10.1027 KOps/s | 10.0819 KOps/s | |
test_add_one[memmap_tensor0] | 0.1060ms | 5.0214μs | 199.1462 KOps/s | 184.9328 KOps/s | |
test_contiguous[memmap_tensor0] | 9.9380μs | 0.5024μs | 1.9906 MOps/s | 1.9748 MOps/s | |
test_stack[memmap_tensor0] | 33.8130μs | 3.3618μs | 297.4560 KOps/s | 271.7475 KOps/s | |
test_memmaptd_index | 1.3842ms | 0.2279ms | 4.3875 KOps/s | 4.1492 KOps/s | |
test_memmaptd_index_astensor | 0.6518ms | 0.3161ms | 3.1640 KOps/s | 3.0426 KOps/s | |
test_memmaptd_index_op | 1.0753ms | 0.5963ms | 1.6769 KOps/s | 1.6290 KOps/s | |
test_serialize_model | 0.1311s | 0.1198s | 8.3440 Ops/s | 7.2198 Ops/s | |
test_serialize_model_pickle | 0.4980s | 0.4052s | 2.4677 Ops/s | 2.4949 Ops/s | |
test_serialize_weights | 0.1303s | 0.1184s | 8.4456 Ops/s | 8.2239 Ops/s | |
test_serialize_weights_returnearly | 0.1908s | 0.1641s | 6.0957 Ops/s | 6.1330 Ops/s | |
test_serialize_weights_pickle | 0.6119s | 0.4464s | 2.2402 Ops/s | 2.5591 Ops/s | |
test_serialize_weights_filesystem | 0.1538s | 0.1437s | 6.9589 Ops/s | 6.8784 Ops/s | |
test_serialize_model_filesystem | 0.2668s | 0.1702s | 5.8759 Ops/s | 5.5584 Ops/s | |
test_reshape_pytree | 75.9310μs | 26.1780μs | 38.1999 KOps/s | 36.5366 KOps/s | |
test_reshape_td | 0.1020ms | 33.6322μs | 29.7334 KOps/s | 29.5465 KOps/s | |
test_view_pytree | 0.1227ms | 27.1262μs | 36.8647 KOps/s | 37.8423 KOps/s | |
test_view_td | 88.6250μs | 41.3596μs | 24.1782 KOps/s | 24.3777 KOps/s | |
test_unbind_pytree | 66.9650μs | 29.7053μs | 33.6640 KOps/s | 33.3774 KOps/s | |
test_unbind_td | 0.3394ms | 40.0043μs | 24.9973 KOps/s | 24.2832 KOps/s | |
test_split_pytree | 71.5930μs | 28.9826μs | 34.5034 KOps/s | 33.4949 KOps/s | |
test_split_td | 0.5808ms | 45.4965μs | 21.9797 KOps/s | 21.3031 KOps/s | |
test_add_pytree | 83.1950μs | 35.8342μs | 27.9063 KOps/s | 27.0730 KOps/s | |
test_add_td | 0.1198ms | 58.0068μs | 17.2394 KOps/s | 16.4776 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1699ms | 68.0617μs | 14.6926 KOps/s | 14.8298 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3803ms | 0.1731ms | 5.7757 KOps/s | 5.7132 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1830ms | 46.4693μs | 21.5196 KOps/s | 21.7823 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2492ms | 0.1189ms | 8.4072 KOps/s | 8.2079 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 81.5520μs | 28.8713μs | 34.6364 KOps/s | 35.1849 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1150ms | 59.1438μs | 16.9080 KOps/s | 16.8559 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1570ms | 81.2127μs | 12.3133 KOps/s | 12.3810 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1296ms | 67.1601μs | 14.8898 KOps/s | 14.6959 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2004ms | 0.1079ms | 9.2711 KOps/s | 9.3711 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3961ms | 0.2168ms | 4.6117 KOps/s | 4.6281 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1683ms | 47.3958μs | 21.0989 KOps/s | 20.6634 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1557ms | 67.5114μs | 14.8123 KOps/s | 14.4481 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2208ms | 0.1010ms | 9.9030 KOps/s | 9.8635 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3418ms | 0.2020ms | 4.9517 KOps/s | 4.8131 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3983ms | 0.2340ms | 4.2741 KOps/s | 4.2209 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2806ms | 0.1093ms | 9.1473 KOps/s | 9.1866 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2784ms | 64.3523μs | 15.5394 KOps/s | 15.5621 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2770ms | 50.2747μs | 19.8907 KOps/s | 20.6403 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3484ms | 0.1584ms | 6.3147 KOps/s | 6.1279 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2218ms | 0.1011ms | 9.8936 KOps/s | 9.7939 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 68.7790μs | 23.5543μs | 42.4552 KOps/s | 45.3385 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1375ms | 67.0927μs | 14.9047 KOps/s | 14.6740 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1654ms | 83.4595μs | 11.9819 KOps/s | 12.0869 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1490ms | 68.9636μs | 14.5004 KOps/s | 14.6125 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4711ms | 0.2186ms | 4.5739 KOps/s | 4.5572 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.6710ms | 1.4056ms | 711.4640 Ops/s | 710.7093 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3680ms | 0.2122ms | 4.7117 KOps/s | 4.7065 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.4963ms | 0.8348ms | 1.1978 KOps/s | 1.1750 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7221ms | 0.4668ms | 2.1422 KOps/s | 2.1520 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.1537ms | 2.7210ms | 367.5088 Ops/s | 355.9822 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1178ms | 38.5462μs | 25.9429 KOps/s | 25.2474 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.8013ms | 33.6950μs | 29.6780 KOps/s | 28.5986 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 93.4640μs | 31.8151μs | 31.4316 KOps/s | 31.6685 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1018ms | 24.1455μs | 41.4156 KOps/s | 41.5732 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 96.4710μs | 33.2629μs | 30.0635 KOps/s | 32.2324 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2258ms | 23.7637μs | 42.0811 KOps/s | 40.9733 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1495ms | 55.0439μs | 18.1673 KOps/s | 18.4546 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.3660ms | 20.3234μs | 49.2042 KOps/s | 48.7622 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1274ms | 46.7871μs | 21.3734 KOps/s | 21.2036 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 77.7780μs | 19.0401μs | 52.5206 KOps/s | 49.7512 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1159ms | 48.1521μs | 20.7675 KOps/s | 20.8660 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 62.1860μs | 18.9219μs | 52.8488 KOps/s | 51.7424 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1395ms | 56.4447μs | 17.7165 KOps/s | 17.6094 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9328ms | 19.8996μs | 50.2522 KOps/s | 46.9008 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1301ms | 47.4728μs | 21.0647 KOps/s | 21.1039 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 60.1320μs | 18.8794μs | 52.9678 KOps/s | 51.7849 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1500ms | 47.6968μs | 20.9658 KOps/s | 20.7599 KOps/s | |
test_compile_indexing[int-pytree-eager] | 84.2670μs | 18.7496μs | 53.3345 KOps/s | 52.3665 KOps/s | |
test_mod_add[eager] | 0.1143ms | 35.9790μs | 27.7940 KOps/s | 27.5221 KOps/s | |
test_mod_add[compile] | 0.1600ms | 66.2418μs | 15.0962 KOps/s | 14.6981 KOps/s | |
test_mod_add[compile-overhead] | 0.1879ms | 65.0782μs | 15.3661 KOps/s | 14.9643 KOps/s | |
test_mod_wrap[eager] | 0.4364ms | 0.2235ms | 4.4750 KOps/s | 4.2981 KOps/s | |
test_mod_wrap[compile] | 2.0480ms | 0.2339ms | 4.2746 KOps/s | 4.1157 KOps/s | |
test_mod_wrap[compile-overhead] | 0.5708ms | 0.2289ms | 4.3681 KOps/s | 4.2565 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.8066ms | 11.3993ms | 87.7245 Ops/s | 70.4749 Ops/s | |
test_mod_wrap_and_backward[compile] | 13.1286ms | 11.6169ms | 86.0818 Ops/s | 81.0688 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.6164ms | 11.4291ms | 87.4960 Ops/s | 82.2672 Ops/s | |
test_seq_add[eager] | 0.2188ms | 0.1175ms | 8.5104 KOps/s | 8.1959 KOps/s | |
test_seq_add[compile] | 0.2579ms | 79.4939μs | 12.5796 KOps/s | 12.7747 KOps/s | |
test_seq_add[compile-overhead] | 0.2076ms | 77.2605μs | 12.9432 KOps/s | 13.0652 KOps/s | |
test_seq_wrap[eager] | 0.7475ms | 0.4456ms | 2.2444 KOps/s | 2.1608 KOps/s | |
test_seq_wrap[compile] | 0.5000ms | 0.2534ms | 3.9462 KOps/s | 3.9348 KOps/s | |
test_seq_wrap[compile-overhead] | 0.5003ms | 0.2517ms | 3.9724 KOps/s | 3.9944 KOps/s | |
test_func_call_runtime[False-eager] | 0.9298ms | 0.5499ms | 1.8184 KOps/s | 1.8151 KOps/s | |
test_func_call_runtime[False-compile] | 0.6495ms | 0.4513ms | 2.2158 KOps/s | 2.1708 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.6652ms | 0.4519ms | 2.2126 KOps/s | 2.1723 KOps/s | |
test_func_call_runtime[True-eager] | 1.0784ms | 0.7690ms | 1.3004 KOps/s | 1.2988 KOps/s | |
test_func_call_runtime[True-compile] | 0.6337ms | 0.4756ms | 2.1025 KOps/s | 2.0804 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6664ms | 0.4739ms | 2.1103 KOps/s | 2.0663 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9694ms | 0.5441ms | 1.8379 KOps/s | 1.8145 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6043ms | 0.4528ms | 2.2084 KOps/s | 2.1960 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7988ms | 0.4513ms | 2.2157 KOps/s | 2.1934 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.9589ms | 0.9321ms | 1.0728 KOps/s | 1.0875 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1938ms | 0.8155ms | 1.2263 KOps/s | 1.2179 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.1832ms | 0.8233ms | 1.2147 KOps/s | 1.2132 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.0304ms | 1.9994ms | 500.1518 Ops/s | 496.6516 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9535ms | 0.5516ms | 1.8128 KOps/s | 1.7699 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7300ms | 0.5474ms | 1.8267 KOps/s | 1.8061 KOps/s | |
test_distributed | 0.3797ms | 0.1285ms | 7.7813 KOps/s | 7.6012 KOps/s | |
test_tdmodule | 65.0910μs | 26.5371μs | 37.6831 KOps/s | 37.6261 KOps/s | |
test_tdmodule_dispatch | 0.1353ms | 49.7269μs | 20.1098 KOps/s | 20.6253 KOps/s | |
test_tdseq | 57.4070μs | 29.7673μs | 33.5939 KOps/s | 32.9453 KOps/s | |
test_tdseq_dispatch | 0.1067ms | 56.4228μs | 17.7233 KOps/s | 18.1770 KOps/s | |
test_instantiation_functorch | 1.8800ms | 1.5779ms | 633.7665 Ops/s | 631.4339 Ops/s | |
test_exec_functorch | 0.3340ms | 0.1850ms | 5.4063 KOps/s | 5.4359 KOps/s | |
test_exec_functional_call | 0.3431ms | 0.1742ms | 5.7391 KOps/s | 5.7151 KOps/s | |
test_exec_td_decorator | 0.5114ms | 0.2399ms | 4.1689 KOps/s | 4.1209 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9270ms | 0.6778ms | 1.4754 KOps/s | 1.4174 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0115ms | 0.6813ms | 1.4678 KOps/s | 1.4689 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.4166ms | 0.5531ms | 1.8081 KOps/s | 1.8166 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9789ms | 0.5500ms | 1.8183 KOps/s | 1.8197 KOps/s | |
test_to_module_speed[True] | 2.2132ms | 1.3849ms | 722.0675 Ops/s | 732.6107 Ops/s | |
test_to_module_speed[False] | 1.9657ms | 1.3356ms | 748.7108 Ops/s | 752.2878 Ops/s | |
test_tc_init | 89.5670μs | 49.2901μs | 20.2880 KOps/s | 21.4014 KOps/s | |
test_tc_init_nested | 0.1723ms | 98.8640μs | 10.1149 KOps/s | 10.9395 KOps/s | |
test_tc_first_layer_tensor | 24.4550μs | 1.6100μs | 621.1069 KOps/s | 614.0215 KOps/s | |
test_tc_first_layer_nontensor | 20.2880μs | 4.7497μs | 210.5402 KOps/s | 209.5067 KOps/s | |
test_tc_second_layer_tensor | 45.0340μs | 2.8974μs | 345.1384 KOps/s | 346.1637 KOps/s | |
test_tc_second_layer_nontensor | 43.4320μs | 6.1524μs | 162.5371 KOps/s | 163.7864 KOps/s | |
test_unbind | 0.2810s | 15.3510ms | 65.1422 Ops/s | 71.7735 Ops/s | |
test_full_like | 13.6577ms | 11.0542ms | 90.4632 Ops/s | 105.0875 Ops/s | |
test_zeros_like | 6.0950ms | 3.8084ms | 262.5801 Ops/s | 268.3161 Ops/s | |
test_ones_like | 6.1373ms | 4.1016ms | 243.8068 Ops/s | 227.7247 Ops/s | |
test_clone | 9.2285ms | 6.3450ms | 157.6049 Ops/s | 151.7601 Ops/s | |
test_squeeze | 91.9020μs | 12.8704μs | 77.6977 KOps/s | 79.9209 KOps/s | |
test_unsqueeze | 0.2938ms | 94.7268μs | 10.5567 KOps/s | 10.4274 KOps/s | |
test_split | 0.4217ms | 0.1993ms | 5.0168 KOps/s | 5.0318 KOps/s | |
test_permute | 0.4326ms | 0.2091ms | 4.7834 KOps/s | 4.8638 KOps/s | |
test_stack | 35.6618ms | 28.4037ms | 35.2066 Ops/s | 37.8296 Ops/s | |
test_cat | 39.5340ms | 28.8088ms | 34.7116 Ops/s | 36.8266 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 46.6910μs | 12.8197μs | 78.0049 KOps/s | 77.6266 KOps/s | |
test_plain_set_stack_nested | 32.7900μs | 13.0066μs | 76.8842 KOps/s | 77.9717 KOps/s | |
test_plain_set_nested_inplace | 41.7800μs | 14.0059μs | 71.3983 KOps/s | 71.1922 KOps/s | |
test_plain_set_stack_nested_inplace | 46.8010μs | 14.0055μs | 71.4005 KOps/s | 71.9786 KOps/s | |
test_items | 31.2110μs | 2.8687μs | 348.5843 KOps/s | 342.9265 KOps/s | |
test_items_nested | 0.3905ms | 0.3627ms | 2.7573 KOps/s | 2.7307 KOps/s | |
test_items_nested_locked | 0.4186ms | 0.3672ms | 2.7235 KOps/s | 2.7374 KOps/s | |
test_items_nested_leaf | 85.4610μs | 60.4981μs | 16.5295 KOps/s | 16.5614 KOps/s | |
test_items_stack_nested | 0.4110ms | 0.3631ms | 2.7542 KOps/s | 2.7422 KOps/s | |
test_items_stack_nested_leaf | 0.1028ms | 61.7171μs | 16.2030 KOps/s | 16.1809 KOps/s | |
test_items_stack_nested_locked | 0.4065ms | 0.3664ms | 2.7292 KOps/s | 2.7703 KOps/s | |
test_keys | 31.0500μs | 3.4401μs | 290.6855 KOps/s | 291.2843 KOps/s | |
test_keys_nested | 0.1239ms | 88.1443μs | 11.3450 KOps/s | 11.5001 KOps/s | |
test_keys_nested_locked | 0.7383ms | 93.5999μs | 10.6838 KOps/s | 10.7913 KOps/s | |
test_keys_nested_leaf | 0.1202ms | 79.1114μs | 12.6404 KOps/s | 12.7969 KOps/s | |
test_keys_stack_nested | 0.1116ms | 87.3500μs | 11.4482 KOps/s | 11.4506 KOps/s | |
test_keys_stack_nested_leaf | 0.1093ms | 78.8113μs | 12.6885 KOps/s | 12.6285 KOps/s | |
test_keys_stack_nested_locked | 0.1411ms | 91.9918μs | 10.8705 KOps/s | 10.7076 KOps/s | |
test_values | 5.5383μs | 0.8499μs | 1.1766 MOps/s | 1.1711 MOps/s | |
test_values_nested | 64.9510μs | 37.3570μs | 26.7688 KOps/s | 26.7874 KOps/s | |
test_values_nested_locked | 65.1210μs | 39.1621μs | 25.5349 KOps/s | 25.5285 KOps/s | |
test_values_nested_leaf | 68.1610μs | 42.4447μs | 23.5601 KOps/s | 23.4273 KOps/s | |
test_values_stack_nested | 72.7610μs | 37.9922μs | 26.3212 KOps/s | 26.4806 KOps/s | |
test_values_stack_nested_leaf | 82.2910μs | 42.7062μs | 23.4158 KOps/s | 23.4554 KOps/s | |
test_values_stack_nested_locked | 66.8610μs | 39.7887μs | 25.1328 KOps/s | 25.4041 KOps/s | |
test_membership | 1.8020μs | 0.5001μs | 1.9998 MOps/s | 1.9855 MOps/s | |
test_membership_nested | 16.9050μs | 2.0380μs | 490.6659 KOps/s | 477.5626 KOps/s | |
test_membership_nested_leaf | 15.5405μs | 2.0460μs | 488.7492 KOps/s | 469.8962 KOps/s | |
test_membership_stacked_nested | 28.6500μs | 2.1036μs | 475.3654 KOps/s | 478.6299 KOps/s | |
test_membership_stacked_nested_leaf | 37.3600μs | 2.1206μs | 471.5688 KOps/s | 473.0125 KOps/s | |
test_membership_nested_last | 29.0800μs | 3.1143μs | 321.0946 KOps/s | 320.2583 KOps/s | |
test_membership_nested_leaf_last | 39.7110μs | 3.1279μs | 319.7076 KOps/s | 320.1354 KOps/s | |
test_membership_stacked_nested_last | 33.1110μs | 8.2467μs | 121.2604 KOps/s | 324.2907 KOps/s | |
test_membership_stacked_nested_leaf_last | 42.3900μs | 8.2558μs | 121.1275 KOps/s | 322.9279 KOps/s | |
test_nested_getleaf | 31.2110μs | 6.1401μs | 162.8643 KOps/s | 162.3877 KOps/s | |
test_nested_get | 40.7110μs | 5.9638μs | 167.6771 KOps/s | 168.5078 KOps/s | |
test_stacked_getleaf | 37.0300μs | 6.1760μs | 161.9160 KOps/s | 161.3323 KOps/s | |
test_stacked_get | 47.1500μs | 5.8143μs | 171.9889 KOps/s | 171.1380 KOps/s | |
test_nested_getitemleaf | 28.8710μs | 6.4294μs | 155.5360 KOps/s | 154.8652 KOps/s | |
test_nested_getitem | 44.0400μs | 6.1075μs | 163.7341 KOps/s | 164.6926 KOps/s | |
test_stacked_getitemleaf | 39.9100μs | 6.4178μs | 155.8158 KOps/s | 156.3829 KOps/s | |
test_stacked_getitem | 33.2500μs | 6.0063μs | 166.4924 KOps/s | 165.6157 KOps/s | |
test_lock_nested | 9.0842ms | 0.3439ms | 2.9080 KOps/s | 3.0054 KOps/s | |
test_lock_stack_nested | 0.3751ms | 0.3360ms | 2.9764 KOps/s | 2.9620 KOps/s | |
test_unlock_nested | 0.4716ms | 0.2783ms | 3.5930 KOps/s | 3.6283 KOps/s | |
test_unlock_stack_nested | 0.3137ms | 0.2739ms | 3.6508 KOps/s | 3.6384 KOps/s | |
test_flatten_speed | 0.1260ms | 77.4830μs | 12.9061 KOps/s | 13.1444 KOps/s | |
test_unflatten_speed | 0.3611ms | 0.3183ms | 3.1415 KOps/s | 3.1649 KOps/s | |
test_common_ops | 0.9229ms | 0.6238ms | 1.6030 KOps/s | 1.6260 KOps/s | |
test_creation | 97.4510μs | 1.7722μs | 564.2782 KOps/s | 577.8826 KOps/s | |
test_creation_empty | 34.5000μs | 9.2647μs | 107.9371 KOps/s | 107.3996 KOps/s | |
test_creation_nested_1 | 40.7700μs | 10.9157μs | 91.6107 KOps/s | 90.7857 KOps/s | |
test_creation_nested_2 | 36.1400μs | 13.6680μs | 73.1634 KOps/s | 73.2221 KOps/s | |
test_clone | 42.5410μs | 10.3707μs | 96.4259 KOps/s | 99.5023 KOps/s | |
test_getitem[int] | 1.2541ms | 10.5132μs | 95.1185 KOps/s | 96.0944 KOps/s | |
test_getitem[slice_int] | 0.1071ms | 20.2499μs | 49.3830 KOps/s | 49.7169 KOps/s | |
test_getitem[range] | 0.1264ms | 37.5711μs | 26.6162 KOps/s | 27.0442 KOps/s | |
test_getitem[tuple] | 0.1064ms | 17.6363μs | 56.7013 KOps/s | 56.8018 KOps/s | |
test_getitem[list] | 0.1986ms | 32.6527μs | 30.6254 KOps/s | 30.3593 KOps/s | |
test_setitem_dim[int] | 40.1500μs | 18.4766μs | 54.1225 KOps/s | 53.2497 KOps/s | |
test_setitem_dim[slice_int] | 0.1300ms | 37.6816μs | 26.5381 KOps/s | 26.4219 KOps/s | |
test_setitem_dim[range] | 89.4700μs | 53.1458μs | 18.8162 KOps/s | 18.9863 KOps/s | |
test_setitem_dim[tuple] | 60.2900μs | 30.9145μs | 32.3473 KOps/s | 31.0964 KOps/s | |
test_setitem | 71.3110μs | 14.9975μs | 66.6778 KOps/s | 65.1013 KOps/s | |
test_set | 74.0210μs | 14.7248μs | 67.9124 KOps/s | 68.8269 KOps/s | |
test_set_shared | 0.5283ms | 0.1556ms | 6.4274 KOps/s | 6.3747 KOps/s | |
test_update | 0.4441ms | 18.3971μs | 54.3565 KOps/s | 55.7534 KOps/s | |
test_update_nested | 78.3310μs | 24.6617μs | 40.5488 KOps/s | 42.7758 KOps/s | |
test_update__nested | 0.5426ms | 24.6018μs | 40.6475 KOps/s | 41.9866 KOps/s | |
test_set_nested | 58.3510μs | 15.7360μs | 63.5486 KOps/s | 63.2127 KOps/s | |
test_set_nested_new | 72.9800μs | 18.0592μs | 55.3736 KOps/s | 54.6831 KOps/s | |
test_select | 71.9300μs | 29.1502μs | 34.3051 KOps/s | 32.8960 KOps/s | |
test_select_nested | 73.0510μs | 43.4908μs | 22.9934 KOps/s | 22.9850 KOps/s | |
test_exclude_nested | 0.1164ms | 61.2723μs | 16.3206 KOps/s | 15.8147 KOps/s | |
test_empty[True] | 0.3796ms | 0.2922ms | 3.4219 KOps/s | 3.3920 KOps/s | |
test_empty[False] | 2.8320μs | 0.8163μs | 1.2250 MOps/s | 1.2281 MOps/s | |
test_to | 98.5110μs | 56.4591μs | 17.7119 KOps/s | 18.0457 KOps/s | |
test_to_nonblocking | 94.2510μs | 45.8922μs | 21.7902 KOps/s | 21.7184 KOps/s | |
test_unbind_speed | 0.2832ms | 0.2376ms | 4.2091 KOps/s | 4.2416 KOps/s | |
test_unbind_speed_stack0 | 0.3077ms | 0.2315ms | 4.3203 KOps/s | 4.2868 KOps/s | |
test_unbind_speed_stack1 | 93.9816ms | 0.8160ms | 1.2255 KOps/s | 1.3884 KOps/s | |
test_split | 1.5602ms | 1.4319ms | 698.3579 Ops/s | 637.2861 Ops/s | |
test_chunk | 96.8348ms | 1.7120ms | 584.1280 Ops/s | 634.7362 Ops/s | |
test_consolidate[False-None] | 3.3610ms | 2.6599ms | 375.9478 Ops/s | 376.3507 Ops/s | |
test_consolidate[default-None] | 1.8301ms | 1.7007ms | 587.9896 Ops/s | 590.9999 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8243ms | 1.7226ms | 580.5137 Ops/s | 585.6753 Ops/s | |
test_consolidate_njt[False-None] | 6.7027ms | 6.4383ms | 155.3201 Ops/s | 158.6316 Ops/s | |
test_to[False-False-None] | 1.7179ms | 1.6461ms | 607.5092 Ops/s | 616.0839 Ops/s | |
test_to[True-False-None] | 1.5544ms | 1.3148ms | 760.5678 Ops/s | 758.0677 Ops/s | |
test_to[within-False-None] | 4.3359ms | 4.1322ms | 242.0034 Ops/s | 180.4394 Ops/s | |
test_to[True-default-None] | 5.3481ms | 5.0809ms | 196.8147 Ops/s | 191.8695 Ops/s | |
test_to_njt[False-False-None] | 7.0061ms | 6.8347ms | 146.3124 Ops/s | 140.0859 Ops/s | |
test_to_njt[True-False-None] | 5.4991ms | 5.3425ms | 187.1770 Ops/s | 179.0879 Ops/s | |
test_to_njt[within-False-None] | 11.9047ms | 11.8188ms | 84.6107 Ops/s | 75.7825 Ops/s | |
test_creation[device0] | 0.4613ms | 78.8775μs | 12.6779 KOps/s | 12.1032 KOps/s | |
test_creation_from_tensor | 0.4534ms | 83.3281μs | 12.0008 KOps/s | 11.5392 KOps/s | |
test_add_one[memmap_tensor0] | 0.5673ms | 6.4422μs | 155.2269 KOps/s | 155.1310 KOps/s | |
test_contiguous[memmap_tensor0] | 1.8220μs | 0.4260μs | 2.3475 MOps/s | 2.3845 MOps/s | |
test_stack[memmap_tensor0] | 38.6500μs | 4.2067μs | 237.7186 KOps/s | 237.8749 KOps/s | |
test_memmaptd_index | 1.5161ms | 0.2356ms | 4.2445 KOps/s | 4.1825 KOps/s | |
test_memmaptd_index_astensor | 0.4297ms | 0.2955ms | 3.3845 KOps/s | 3.3673 KOps/s | |
test_memmaptd_index_op | 0.7018ms | 0.5701ms | 1.7541 KOps/s | 1.7038 KOps/s | |
test_serialize_model | 0.4389s | 0.1741s | 5.7429 Ops/s | 7.6821 Ops/s | |
test_serialize_model_pickle | 1.3486s | 1.1865s | 0.8428 Ops/s | 0.8222 Ops/s | |
test_serialize_weights | 0.1307s | 0.1295s | 7.7248 Ops/s | 7.7217 Ops/s | |
test_serialize_weights_returnearly | 0.3176s | 54.8938ms | 18.2170 Ops/s | 11.3965 Ops/s | |
test_serialize_weights_pickle | 1.3608s | 1.2169s | 0.8218 Ops/s | 0.8223 Ops/s | |
test_reshape_pytree | 52.5100μs | 22.4259μs | 44.5913 KOps/s | 45.1960 KOps/s | |
test_reshape_td | 66.9910μs | 26.6512μs | 37.5218 KOps/s | 37.0855 KOps/s | |
test_view_pytree | 58.3810μs | 23.1660μs | 43.1667 KOps/s | 45.9772 KOps/s | |
test_view_td | 59.7110μs | 31.8349μs | 31.4121 KOps/s | 31.0110 KOps/s | |
test_unbind_pytree | 56.9300μs | 29.3854μs | 34.0305 KOps/s | 36.0780 KOps/s | |
test_unbind_td | 0.7977ms | 36.9769μs | 27.0439 KOps/s | 27.5110 KOps/s | |
test_split_pytree | 69.8310μs | 30.7637μs | 32.5058 KOps/s | 33.3172 KOps/s | |
test_split_td | 0.7814ms | 38.6584μs | 25.8676 KOps/s | 25.1133 KOps/s | |
test_add_pytree | 81.7510μs | 34.3711μs | 29.0942 KOps/s | 29.3666 KOps/s | |
test_add_td | 83.4010μs | 49.0194μs | 20.4001 KOps/s | 19.8770 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2590ms | 0.1240ms | 8.0652 KOps/s | 7.9965 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2618ms | 0.1357ms | 7.3690 KOps/s | 7.3616 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2330ms | 96.7780μs | 10.3329 KOps/s | 10.4852 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3535ms | 0.1494ms | 6.6951 KOps/s | 6.6517 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 48.3610μs | 24.6590μs | 40.5531 KOps/s | 42.2364 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 62.4710μs | 29.1890μs | 34.2594 KOps/s | 34.4074 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4636ms | 63.8292μs | 15.6668 KOps/s | 15.7686 KOps/s | |
test_compile_copy_nested[pytree-eager] | 86.6310μs | 48.8225μs | 20.4824 KOps/s | 20.4516 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2820ms | 0.1449ms | 6.9022 KOps/s | 7.0254 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3287ms | 0.2186ms | 4.5744 KOps/s | 4.6617 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2181ms | 99.6630μs | 10.0338 KOps/s | 10.2020 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1226ms | 55.9198μs | 17.8827 KOps/s | 17.8563 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1785ms | 0.1366ms | 7.3230 KOps/s | 7.4825 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5657ms | 0.4817ms | 2.0761 KOps/s | 2.0971 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3909ms | 0.2620ms | 3.8165 KOps/s | 3.8486 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2076ms | 0.1481ms | 6.7537 KOps/s | 6.9986 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1646ms | 71.5820μs | 13.9700 KOps/s | 14.5332 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1398ms | 98.0856μs | 10.1952 KOps/s | 10.1248 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4715ms | 0.4058ms | 2.4642 KOps/s | 2.5089 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1868ms | 0.1392ms | 7.1859 KOps/s | 7.5236 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 50.0400μs | 19.3967μs | 51.5551 KOps/s | 57.8099 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 61.4500μs | 31.6369μs | 31.6086 KOps/s | 32.0859 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2083ms | 70.0115μs | 14.2834 KOps/s | 14.4031 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.3010μs | 52.1833μs | 19.1632 KOps/s | 19.2648 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6113ms | 0.3913ms | 2.5555 KOps/s | 2.2042 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.9061ms | 2.5296ms | 395.3241 Ops/s | 388.6302 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6176ms | 0.4376ms | 2.2851 KOps/s | 2.2817 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.6919ms | 2.5373ms | 394.1180 Ops/s | 392.4757 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1801ms | 0.1129ms | 8.8547 KOps/s | 8.4819 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5924ms | 82.2059μs | 12.1646 KOps/s | 12.7448 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2463ms | 0.1054ms | 9.4834 KOps/s | 9.4262 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2113ms | 66.6735μs | 14.9985 KOps/s | 14.6797 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2551ms | 0.1125ms | 8.8875 KOps/s | 9.3231 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2231ms | 70.9828μs | 14.0879 KOps/s | 14.9035 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1444ms | 0.1024ms | 9.7637 KOps/s | 10.1251 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1431ms | 17.3680μs | 57.5772 KOps/s | 57.8091 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2044ms | 0.1006ms | 9.9439 KOps/s | 10.4678 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 47.6910μs | 15.5909μs | 64.1398 KOps/s | 64.4175 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1564ms | 96.2287μs | 10.3919 KOps/s | 10.3663 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 56.6910μs | 15.6559μs | 63.8739 KOps/s | 64.6105 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1707ms | 0.1000ms | 9.9955 KOps/s | 10.0194 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6026ms | 17.2182μs | 58.0781 KOps/s | 59.2036 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2507ms | 0.1000ms | 9.9981 KOps/s | 10.3815 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 44.5110μs | 15.6289μs | 63.9840 KOps/s | 64.9556 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1841ms | 95.8028μs | 10.4381 KOps/s | 10.3212 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.1392ms | 16.5930μs | 60.2662 KOps/s | 65.2166 KOps/s | |
test_mod_add[eager] | 77.7310μs | 38.0688μs | 26.2682 KOps/s | 26.0857 KOps/s | |
test_mod_add[compile] | 0.4469ms | 80.9989μs | 12.3459 KOps/s | 12.2826 KOps/s | |
test_mod_add[compile-overhead] | 0.3261ms | 0.1673ms | 5.9777 KOps/s | 5.7093 KOps/s | |
test_mod_wrap[eager] | 0.3953ms | 0.2463ms | 4.0606 KOps/s | 4.0044 KOps/s | |
test_mod_wrap[compile] | 0.6903ms | 0.2857ms | 3.5000 KOps/s | 3.4801 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3085ms | 3.9376ms | 253.9602 Ops/s | 268.0857 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.8313ms | 1.3320ms | 750.7510 Ops/s | 678.5650 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4458ms | 1.2482ms | 801.1468 Ops/s | 728.8561 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3595ms | 0.9182ms | 1.0891 KOps/s | 950.9055 Ops/s | |
test_seq_add[eager] | 0.2366ms | 0.1127ms | 8.8701 KOps/s | 8.4282 KOps/s | |
test_seq_add[compile] | 0.5105ms | 87.5527μs | 11.4217 KOps/s | 11.0398 KOps/s | |
test_seq_add[compile-overhead] | 0.5478ms | 0.1295ms | 7.7212 KOps/s | 7.4352 KOps/s | |
test_seq_wrap[eager] | 0.8383ms | 0.4223ms | 2.3678 KOps/s | 2.2601 KOps/s | |
test_seq_wrap[compile] | 0.7511ms | 0.3030ms | 3.3004 KOps/s | 3.3074 KOps/s | |
test_seq_wrap[compile-overhead] | 0.6509ms | 0.2250ms | 4.4451 KOps/s | 4.4331 KOps/s | |
test_func_call_runtime[False-eager] | 1.1486ms | 0.7192ms | 1.3904 KOps/s | 1.3644 KOps/s | |
test_func_call_runtime[False-compile] | 3.5699ms | 0.7611ms | 1.3139 KOps/s | 1.3362 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5189ms | 0.3677ms | 2.7198 KOps/s | 2.7470 KOps/s | |
test_func_call_runtime[True-eager] | 1.0882ms | 0.9153ms | 1.0926 KOps/s | 1.0821 KOps/s | |
test_func_call_runtime[True-compile] | 0.8978ms | 0.7770ms | 1.2871 KOps/s | 1.3153 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4563ms | 0.3953ms | 2.5296 KOps/s | 2.5762 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8708ms | 0.7524ms | 1.3290 KOps/s | 1.3079 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8794ms | 0.7363ms | 1.3581 KOps/s | 1.3515 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4804ms | 0.3650ms | 2.7395 KOps/s | 2.7229 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1344ms | 0.9873ms | 1.0129 KOps/s | 1.0005 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0890ms | 0.9796ms | 1.0208 KOps/s | 1.0149 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2040ms | 0.9837ms | 1.0165 KOps/s | 1.0079 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4430ms | 2.0652ms | 484.2205 Ops/s | 478.5293 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9130ms | 0.8077ms | 1.2380 KOps/s | 1.2234 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5274ms | 0.4206ms | 2.3773 KOps/s | 2.3114 KOps/s | |
test_distributed | 2.4701ms | 0.1691ms | 5.9145 KOps/s | 8.5396 KOps/s | |
test_tdmodule | 58.5810μs | 19.9284μs | 50.1795 KOps/s | 50.5840 KOps/s | |
test_tdmodule_dispatch | 80.6910μs | 35.1020μs | 28.4884 KOps/s | 27.6773 KOps/s | |
test_tdseq | 0.1211ms | 20.8710μs | 47.9133 KOps/s | 48.2753 KOps/s | |
test_tdseq_dispatch | 71.5010μs | 38.3947μs | 26.0453 KOps/s | 25.5496 KOps/s | |
test_instantiation_functorch | 1.6184ms | 1.5208ms | 657.5612 Ops/s | 653.9239 Ops/s | |
test_exec_functorch | 0.1855ms | 0.1377ms | 7.2598 KOps/s | 7.0061 KOps/s | |
test_exec_functional_call | 0.2213ms | 0.1303ms | 7.6775 KOps/s | 7.3426 KOps/s | |
test_exec_td_decorator | 0.3765ms | 0.1808ms | 5.5307 KOps/s | 5.3439 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7913ms | 0.6772ms | 1.4766 KOps/s | 1.4347 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7920ms | 0.6778ms | 1.4754 KOps/s | 1.4328 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7320ms | 0.5900ms | 1.6949 KOps/s | 1.6583 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7085ms | 0.5874ms | 1.7024 KOps/s | 1.6471 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.8486ms | 18.8582ms | 53.0274 Ops/s | 52.3684 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6845ms | 18.9995ms | 52.6329 Ops/s | 52.9531 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 18.8016ms | 18.6714ms | 53.5580 Ops/s | 52.0808 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 18.7920ms | 18.6848ms | 53.5194 Ops/s | 52.8394 Ops/s | |
test_to_module_speed[True] | 1.0795ms | 0.9556ms | 1.0465 KOps/s | 1.0447 KOps/s | |
test_to_module_speed[False] | 1.0554ms | 0.9292ms | 1.0761 KOps/s | 1.0693 KOps/s | |
test_tc_init | 68.8310μs | 36.1841μs | 27.6365 KOps/s | 27.8556 KOps/s | |
test_tc_init_nested | 0.1621ms | 73.1469μs | 13.6711 KOps/s | 13.5332 KOps/s | |
test_tc_first_layer_tensor | 28.0300μs | 0.7960μs | 1.2563 MOps/s | 1.4267 MOps/s | |
test_tc_first_layer_nontensor | 43.5800μs | 2.2155μs | 451.3723 KOps/s | 447.9325 KOps/s | |
test_tc_second_layer_tensor | 27.1153μs | 1.3974μs | 715.5952 KOps/s | 707.6609 KOps/s | |
test_tc_second_layer_nontensor | 91.9310μs | 2.9448μs | 339.5852 KOps/s | 342.5835 KOps/s | |
test_unbind | 0.2221s | 11.5116ms | 86.8687 Ops/s | 142.8503 Ops/s | |
test_full_like | 9.7514ms | 9.3941ms | 106.4493 Ops/s | 105.2902 Ops/s | |
test_zeros_like | 5.0032ms | 4.3619ms | 229.2583 Ops/s | 235.1976 Ops/s | |
test_ones_like | 10.4475ms | 4.3759ms | 228.5250 Ops/s | 233.6699 Ops/s | |
test_clone | 7.2087ms | 6.6677ms | 149.9777 Ops/s | 106.0417 Ops/s | |
test_squeeze | 70.6000μs | 9.4852μs | 105.4269 KOps/s | 106.6199 KOps/s | |
test_unsqueeze | 0.1679ms | 70.5200μs | 14.1804 KOps/s | 14.1903 KOps/s | |
test_split | 0.2885ms | 0.1560ms | 6.4091 KOps/s | 6.3855 KOps/s | |
test_permute | 0.2887ms | 0.1783ms | 5.6072 KOps/s | 5.5623 KOps/s | |
test_stack | 51.8384ms | 51.0572ms | 19.5859 Ops/s | 19.6141 Ops/s | |
test_cat | 51.4483ms | 50.9571ms | 19.6244 Ops/s | 19.5819 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):