Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix] Fix call to tree.plot in tests #2547

Merged
merged 1 commit into from
Nov 9, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 9, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Nov 9, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2547

Note: Links to docs will display an error until the docs builds have been completed.

❌ 18 New Failures, 4 Unrelated Failures

As of commit 0dcbbd9 with merge base 218d5bf (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

vmoens added a commit that referenced this pull request Nov 9, 2024
ghstack-source-id: 4a5babbf46294ab6ed4a791e26cfacaf3a41a2e0
Pull Request resolved: #2547
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 9, 2024
@vmoens vmoens merged commit 0dcbbd9 into gh/vmoens/35/base Nov 9, 2024
18 of 33 checks passed
vmoens added a commit that referenced this pull request Nov 9, 2024
ghstack-source-id: 4a5babbf46294ab6ed4a791e26cfacaf3a41a2e0
Pull Request resolved: #2547
@vmoens vmoens deleted the gh/vmoens/35/head branch November 9, 2024 21:42
Copy link

github-actions bot commented Nov 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4247s 0.4226s 2.3661 Ops/s 2.2762 Ops/s $\color{#35bf28}+3.95\%$
test_transformed 0.6918s 0.6181s 1.6178 Ops/s 1.6724 Ops/s $\color{#d91a1a}-3.27\%$
test_serial 1.3381s 1.3348s 0.7492 Ops/s 0.7387 Ops/s $\color{#35bf28}+1.42\%$
test_parallel 1.2636s 1.2546s 0.7971 Ops/s 0.7699 Ops/s $\color{#35bf28}+3.53\%$
test_step_mdp_speed[True-True-True-True-True] 0.2328ms 27.1521μs 36.8295 KOps/s 37.5991 KOps/s $\color{#d91a1a}-2.05\%$
test_step_mdp_speed[True-True-True-True-False] 60.9140μs 15.9760μs 62.5939 KOps/s 64.1795 KOps/s $\color{#d91a1a}-2.47\%$
test_step_mdp_speed[True-True-True-False-True] 57.5780μs 15.5333μs 64.3777 KOps/s 65.9805 KOps/s $\color{#d91a1a}-2.43\%$
test_step_mdp_speed[True-True-True-False-False] 33.8530μs 9.0699μs 110.2545 KOps/s 113.5623 KOps/s $\color{#d91a1a}-2.91\%$
test_step_mdp_speed[True-True-False-True-True] 73.2570μs 29.1874μs 34.2614 KOps/s 35.1890 KOps/s $\color{#d91a1a}-2.64\%$
test_step_mdp_speed[True-True-False-True-False] 64.5310μs 17.9853μs 55.6011 KOps/s 57.9956 KOps/s $\color{#d91a1a}-4.13\%$
test_step_mdp_speed[True-True-False-False-True] 57.5780μs 17.3769μs 57.5477 KOps/s 59.7402 KOps/s $\color{#d91a1a}-3.67\%$
test_step_mdp_speed[True-True-False-False-False] 35.2660μs 11.0504μs 90.4944 KOps/s 96.6324 KOps/s $\textbf{\color{#d91a1a}-6.35\%}$
test_step_mdp_speed[True-False-True-True-True] 75.1310μs 30.9277μs 32.3334 KOps/s 33.3720 KOps/s $\color{#d91a1a}-3.11\%$
test_step_mdp_speed[True-False-True-True-False] 57.3870μs 19.7944μs 50.5193 KOps/s 53.1474 KOps/s $\color{#d91a1a}-4.94\%$
test_step_mdp_speed[True-False-True-False-True] 50.2850μs 17.4146μs 57.4230 KOps/s 59.2327 KOps/s $\color{#d91a1a}-3.06\%$
test_step_mdp_speed[True-False-True-False-False] 38.5120μs 10.7912μs 92.6683 KOps/s 95.5676 KOps/s $\color{#d91a1a}-3.03\%$
test_step_mdp_speed[True-False-False-True-True] 91.7990μs 32.3774μs 30.8858 KOps/s 31.7856 KOps/s $\color{#d91a1a}-2.83\%$
test_step_mdp_speed[True-False-False-True-False] 55.7540μs 21.4339μs 46.6550 KOps/s 48.7976 KOps/s $\color{#d91a1a}-4.39\%$
test_step_mdp_speed[True-False-False-False-True] 59.6420μs 18.9646μs 52.7299 KOps/s 54.4662 KOps/s $\color{#d91a1a}-3.19\%$
test_step_mdp_speed[True-False-False-False-False] 37.4610μs 12.5615μs 79.6083 KOps/s 83.5732 KOps/s $\color{#d91a1a}-4.74\%$
test_step_mdp_speed[False-True-True-True-True] 73.2570μs 31.1484μs 32.1044 KOps/s 33.2301 KOps/s $\color{#d91a1a}-3.39\%$
test_step_mdp_speed[False-True-True-True-False] 48.9710μs 19.7905μs 50.5294 KOps/s 52.7586 KOps/s $\color{#d91a1a}-4.23\%$
test_step_mdp_speed[False-True-True-False-True] 58.5500μs 19.9368μs 50.1586 KOps/s 52.1332 KOps/s $\color{#d91a1a}-3.79\%$
test_step_mdp_speed[False-True-True-False-False] 40.1450μs 12.2905μs 81.3638 KOps/s 84.3101 KOps/s $\color{#d91a1a}-3.49\%$
test_step_mdp_speed[False-True-False-True-True] 70.5820μs 32.4456μs 30.8208 KOps/s 31.7547 KOps/s $\color{#d91a1a}-2.94\%$
test_step_mdp_speed[False-True-False-True-False] 0.1710ms 22.1678μs 45.1106 KOps/s 49.2487 KOps/s $\textbf{\color{#d91a1a}-8.40\%}$
test_step_mdp_speed[False-True-False-False-True] 3.2222ms 21.4704μs 46.5758 KOps/s 48.1759 KOps/s $\color{#d91a1a}-3.32\%$
test_step_mdp_speed[False-True-False-False-False] 37.7610μs 14.0537μs 71.1555 KOps/s 74.6231 KOps/s $\color{#d91a1a}-4.65\%$
test_step_mdp_speed[False-False-True-True-True] 83.6680μs 33.9456μs 29.4589 KOps/s 30.5018 KOps/s $\color{#d91a1a}-3.42\%$
test_step_mdp_speed[False-False-True-True-False] 57.3280μs 23.2168μs 43.0723 KOps/s 45.8439 KOps/s $\textbf{\color{#d91a1a}-6.05\%}$
test_step_mdp_speed[False-False-True-False-True] 62.4770μs 21.5367μs 46.4324 KOps/s 46.7435 KOps/s $\color{#d91a1a}-0.67\%$
test_step_mdp_speed[False-False-True-False-False] 40.8360μs 14.0775μs 71.0355 KOps/s 74.9492 KOps/s $\textbf{\color{#d91a1a}-5.22\%}$
test_step_mdp_speed[False-False-False-True-True] 85.5690μs 35.5635μs 28.1188 KOps/s 29.1431 KOps/s $\color{#d91a1a}-3.52\%$
test_step_mdp_speed[False-False-False-True-False] 81.1130μs 24.4206μs 40.9491 KOps/s 43.0846 KOps/s $\color{#d91a1a}-4.96\%$
test_step_mdp_speed[False-False-False-False-True] 48.5500μs 22.9535μs 43.5663 KOps/s 45.3839 KOps/s $\color{#d91a1a}-4.00\%$
test_step_mdp_speed[False-False-False-False-False] 78.3260μs 15.2778μs 65.4543 KOps/s 67.4365 KOps/s $\color{#d91a1a}-2.94\%$
test_values[generalized_advantage_estimate-True-True] 11.3574ms 9.7048ms 103.0419 Ops/s 103.8670 Ops/s $\color{#d91a1a}-0.79\%$
test_values[vec_generalized_advantage_estimate-True-True] 40.3549ms 35.5730ms 28.1112 Ops/s 28.0487 Ops/s $\color{#35bf28}+0.22\%$
test_values[td0_return_estimate-False-False] 0.2275ms 0.1664ms 6.0103 KOps/s 5.9895 KOps/s $\color{#35bf28}+0.35\%$
test_values[td1_return_estimate-False-False] 29.6872ms 24.1338ms 41.4357 Ops/s 41.4395 Ops/s $-0.01\%$
test_values[vec_td1_return_estimate-False-False] 37.8096ms 35.7258ms 27.9910 Ops/s 28.0672 Ops/s $\color{#d91a1a}-0.27\%$
test_values[td_lambda_return_estimate-True-False] 38.3121ms 34.5513ms 28.9425 Ops/s 28.9634 Ops/s $\color{#d91a1a}-0.07\%$
test_values[vec_td_lambda_return_estimate-True-False] 37.9492ms 35.7321ms 27.9860 Ops/s 27.8704 Ops/s $\color{#35bf28}+0.41\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 13.5482ms 8.5894ms 116.4221 Ops/s 118.7161 Ops/s $\color{#d91a1a}-1.93\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3733ms 2.0190ms 495.3068 Ops/s 500.3915 Ops/s $\color{#d91a1a}-1.02\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6352ms 0.3591ms 2.7844 KOps/s 2.8127 KOps/s $\color{#d91a1a}-1.00\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 49.6021ms 48.7884ms 20.4967 Ops/s 20.6040 Ops/s $\color{#d91a1a}-0.52\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.9753ms 3.0347ms 329.5174 Ops/s 329.5111 Ops/s $+0.00\%$
test_dqn_speed[False-None] 5.7321ms 1.3315ms 751.0150 Ops/s 732.6514 Ops/s $\color{#35bf28}+2.51\%$
test_dqn_speed[False-backward] 1.8911ms 1.8071ms 553.3717 Ops/s 545.9158 Ops/s $\color{#35bf28}+1.37\%$
test_dqn_speed[True-None] 0.7260ms 0.4660ms 2.1458 KOps/s 2.1568 KOps/s $\color{#d91a1a}-0.51\%$
test_dqn_speed[True-backward] 0.9266ms 0.8886ms 1.1254 KOps/s 1.1341 KOps/s $\color{#d91a1a}-0.77\%$
test_dqn_speed[reduce-overhead-None] 0.5863ms 0.4685ms 2.1342 KOps/s 2.1515 KOps/s $\color{#d91a1a}-0.80\%$
test_dqn_speed[reduce-overhead-backward] 0.9350ms 0.8899ms 1.1237 KOps/s 1.1283 KOps/s $\color{#d91a1a}-0.41\%$
test_ddpg_speed[False-None] 3.4360ms 2.7736ms 360.5475 Ops/s 357.7384 Ops/s $\color{#35bf28}+0.79\%$
test_ddpg_speed[False-backward] 4.6857ms 3.9281ms 254.5777 Ops/s 257.6554 Ops/s $\color{#d91a1a}-1.19\%$
test_ddpg_speed[True-None] 2.8096ms 1.0231ms 977.4332 Ops/s 997.7160 Ops/s $\color{#d91a1a}-2.03\%$
test_ddpg_speed[True-backward] 1.9798ms 1.8976ms 526.9831 Ops/s 527.3097 Ops/s $\color{#d91a1a}-0.06\%$
test_ddpg_speed[reduce-overhead-None] 1.3057ms 1.0031ms 996.9573 Ops/s 997.4925 Ops/s $\color{#d91a1a}-0.05\%$
test_ddpg_speed[reduce-overhead-backward] 1.9857ms 1.9052ms 524.8915 Ops/s 521.9980 Ops/s $\color{#35bf28}+0.55\%$
test_sac_speed[False-None] 9.8417ms 7.9150ms 126.3418 Ops/s 127.2448 Ops/s $\color{#d91a1a}-0.71\%$
test_sac_speed[False-backward] 11.7121ms 10.6080ms 94.2681 Ops/s 94.7263 Ops/s $\color{#d91a1a}-0.48\%$
test_sac_speed[True-None] 2.2159ms 1.8368ms 544.4201 Ops/s 546.8154 Ops/s $\color{#d91a1a}-0.44\%$
test_sac_speed[True-backward] 3.7790ms 3.5428ms 282.2599 Ops/s 285.2241 Ops/s $\color{#d91a1a}-1.04\%$
test_sac_speed[reduce-overhead-None] 2.0847ms 1.8349ms 544.9931 Ops/s 540.0416 Ops/s $\color{#35bf28}+0.92\%$
test_sac_speed[reduce-overhead-backward] 3.5943ms 3.5280ms 283.4480 Ops/s 286.9246 Ops/s $\color{#d91a1a}-1.21\%$
test_redq_speed[False-None] 21.2821ms 14.6081ms 68.4550 Ops/s 65.1595 Ops/s $\textbf{\color{#35bf28}+5.06\%}$
test_redq_speed[False-backward] 24.5664ms 22.4993ms 44.4458 Ops/s 45.1124 Ops/s $\color{#d91a1a}-1.48\%$
test_redq_speed[True-None] 5.3956ms 4.6002ms 217.3828 Ops/s 219.1510 Ops/s $\color{#d91a1a}-0.81\%$
test_redq_speed[True-backward] 13.2173ms 11.8730ms 84.2250 Ops/s 84.1707 Ops/s $\color{#35bf28}+0.06\%$
test_redq_speed[reduce-overhead-None] 5.5284ms 4.6220ms 216.3568 Ops/s 221.1885 Ops/s $\color{#d91a1a}-2.18\%$
test_redq_speed[reduce-overhead-backward] 13.0022ms 12.1025ms 82.6279 Ops/s 83.8201 Ops/s $\color{#d91a1a}-1.42\%$
test_redq_deprec_speed[False-None] 14.8729ms 12.6597ms 78.9908 Ops/s 80.1445 Ops/s $\color{#d91a1a}-1.44\%$
test_redq_deprec_speed[False-backward] 21.1447ms 18.1344ms 55.1438 Ops/s 55.8904 Ops/s $\color{#d91a1a}-1.34\%$
test_redq_deprec_speed[True-None] 4.2712ms 3.5985ms 277.8962 Ops/s 278.7729 Ops/s $\color{#d91a1a}-0.31\%$
test_redq_deprec_speed[True-backward] 8.8501ms 8.1531ms 122.6520 Ops/s 122.6227 Ops/s $\color{#35bf28}+0.02\%$
test_redq_deprec_speed[reduce-overhead-None] 4.0256ms 3.5725ms 279.9181 Ops/s 279.4409 Ops/s $\color{#35bf28}+0.17\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.5368ms 8.1443ms 122.7859 Ops/s 125.8180 Ops/s $\color{#d91a1a}-2.41\%$
test_td3_speed[False-None] 8.4301ms 7.7812ms 128.5141 Ops/s 128.5096 Ops/s $+0.00\%$
test_td3_speed[False-backward] 11.8058ms 10.2211ms 97.8365 Ops/s 97.6195 Ops/s $\color{#35bf28}+0.22\%$
test_td3_speed[True-None] 2.0401ms 1.7507ms 571.2100 Ops/s 576.5753 Ops/s $\color{#d91a1a}-0.93\%$
test_td3_speed[True-backward] 3.4200ms 3.3558ms 297.9937 Ops/s 297.2464 Ops/s $\color{#35bf28}+0.25\%$
test_td3_speed[reduce-overhead-None] 2.0419ms 1.7540ms 570.1144 Ops/s 576.1458 Ops/s $\color{#d91a1a}-1.05\%$
test_td3_speed[reduce-overhead-backward] 3.4344ms 3.3575ms 297.8451 Ops/s 298.1163 Ops/s $\color{#d91a1a}-0.09\%$
test_cql_speed[False-None] 37.9054ms 35.8514ms 27.8929 Ops/s 27.6287 Ops/s $\color{#35bf28}+0.96\%$
test_cql_speed[False-backward] 49.9758ms 46.0547ms 21.7133 Ops/s 21.8172 Ops/s $\color{#d91a1a}-0.48\%$
test_cql_speed[True-None] 17.2816ms 15.4307ms 64.8059 Ops/s 64.0114 Ops/s $\color{#35bf28}+1.24\%$
test_cql_speed[True-backward] 23.8052ms 22.3754ms 44.6919 Ops/s 44.7740 Ops/s $\color{#d91a1a}-0.18\%$
test_cql_speed[reduce-overhead-None] 16.9277ms 15.5141ms 64.4575 Ops/s 64.2938 Ops/s $\color{#35bf28}+0.25\%$
test_cql_speed[reduce-overhead-backward] 23.7832ms 22.2361ms 44.9718 Ops/s 44.7536 Ops/s $\color{#35bf28}+0.49\%$
test_a2c_speed[False-None] 7.8713ms 7.0266ms 142.3168 Ops/s 140.9902 Ops/s $\color{#35bf28}+0.94\%$
test_a2c_speed[False-backward] 15.1686ms 14.1165ms 70.8392 Ops/s 71.5474 Ops/s $\color{#d91a1a}-0.99\%$
test_a2c_speed[True-None] 3.7708ms 3.2946ms 303.5311 Ops/s 298.8626 Ops/s $\color{#35bf28}+1.56\%$
test_a2c_speed[True-backward] 10.0124ms 9.6761ms 103.3478 Ops/s 102.9207 Ops/s $\color{#35bf28}+0.41\%$
test_a2c_speed[reduce-overhead-None] 4.2760ms 3.4031ms 293.8533 Ops/s 295.4482 Ops/s $\color{#d91a1a}-0.54\%$
test_a2c_speed[reduce-overhead-backward] 9.9458ms 9.6610ms 103.5090 Ops/s 102.5735 Ops/s $\color{#35bf28}+0.91\%$
test_ppo_speed[False-None] 8.6018ms 7.3340ms 136.3518 Ops/s 133.5787 Ops/s $\color{#35bf28}+2.08\%$
test_ppo_speed[False-backward] 14.7699ms 14.3621ms 69.6276 Ops/s 67.5273 Ops/s $\color{#35bf28}+3.11\%$
test_ppo_speed[True-None] 4.0142ms 3.6964ms 270.5320 Ops/s 268.3225 Ops/s $\color{#35bf28}+0.82\%$
test_ppo_speed[True-backward] 10.2488ms 9.5322ms 104.9077 Ops/s 103.6753 Ops/s $\color{#35bf28}+1.19\%$
test_ppo_speed[reduce-overhead-None] 4.0778ms 3.6935ms 270.7493 Ops/s 267.7747 Ops/s $\color{#35bf28}+1.11\%$
test_ppo_speed[reduce-overhead-backward] 10.5911ms 9.6057ms 104.1049 Ops/s 103.2907 Ops/s $\color{#35bf28}+0.79\%$
test_reinforce_speed[False-None] 8.6300ms 6.4666ms 154.6396 Ops/s 153.3848 Ops/s $\color{#35bf28}+0.82\%$
test_reinforce_speed[False-backward] 9.8490ms 9.6313ms 103.8287 Ops/s 102.2960 Ops/s $\color{#35bf28}+1.50\%$
test_reinforce_speed[True-None] 3.0096ms 2.6623ms 375.6148 Ops/s 375.7714 Ops/s $\color{#d91a1a}-0.04\%$
test_reinforce_speed[True-backward] 9.6776ms 8.5323ms 117.2020 Ops/s 116.9655 Ops/s $\color{#35bf28}+0.20\%$
test_reinforce_speed[reduce-overhead-None] 3.2309ms 2.6615ms 375.7258 Ops/s 375.4958 Ops/s $\color{#35bf28}+0.06\%$
test_reinforce_speed[reduce-overhead-backward] 9.6997ms 8.5654ms 116.7487 Ops/s 116.0274 Ops/s $\color{#35bf28}+0.62\%$
test_iql_speed[False-None] 33.6635ms 31.7121ms 31.5337 Ops/s 31.2494 Ops/s $\color{#35bf28}+0.91\%$
test_iql_speed[False-backward] 46.8060ms 44.5205ms 22.4616 Ops/s 22.4269 Ops/s $\color{#35bf28}+0.15\%$
test_iql_speed[True-None] 13.1077ms 10.5330ms 94.9397 Ops/s 93.5007 Ops/s $\color{#35bf28}+1.54\%$
test_iql_speed[True-backward] 22.9519ms 21.3474ms 46.8441 Ops/s 46.6188 Ops/s $\color{#35bf28}+0.48\%$
test_iql_speed[reduce-overhead-None] 11.4606ms 10.4838ms 95.3853 Ops/s 93.8453 Ops/s $\color{#35bf28}+1.64\%$
test_iql_speed[reduce-overhead-backward] 23.7992ms 21.4677ms 46.5816 Ops/s 45.4269 Ops/s $\color{#35bf28}+2.54\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.6278ms 4.8471ms 206.3077 Ops/s 204.4769 Ops/s $\color{#35bf28}+0.90\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.9544ms 0.5054ms 1.9787 KOps/s 1.9548 KOps/s $\color{#35bf28}+1.22\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8335ms 0.4775ms 2.0942 KOps/s 2.0972 KOps/s $\color{#d91a1a}-0.14\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.2908ms 4.5670ms 218.9643 Ops/s 216.4973 Ops/s $\color{#35bf28}+1.14\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1402ms 0.4904ms 2.0394 KOps/s 2.0384 KOps/s $\color{#35bf28}+0.05\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7891ms 0.4609ms 2.1697 KOps/s 2.1389 KOps/s $\color{#35bf28}+1.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.4754ms 1.6331ms 612.3428 Ops/s 616.8819 Ops/s $\color{#d91a1a}-0.74\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.3578ms 1.5838ms 631.4106 Ops/s 633.2007 Ops/s $\color{#d91a1a}-0.28\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.4684ms 4.6975ms 212.8805 Ops/s 209.2511 Ops/s $\color{#35bf28}+1.73\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.7719ms 0.6423ms 1.5569 KOps/s 1.5536 KOps/s $\color{#35bf28}+0.21\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8467ms 0.6077ms 1.6457 KOps/s 1.6182 KOps/s $\color{#35bf28}+1.70\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.2203ms 4.5663ms 218.9949 Ops/s 216.3828 Ops/s $\color{#35bf28}+1.21\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 3.5420ms 0.5071ms 1.9721 KOps/s 1.9780 KOps/s $\color{#d91a1a}-0.30\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8595ms 0.4825ms 2.0724 KOps/s 2.0235 KOps/s $\color{#35bf28}+2.42\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.0621ms 4.5546ms 219.5596 Ops/s 214.0885 Ops/s $\color{#35bf28}+2.56\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8474ms 0.4938ms 2.0250 KOps/s 2.0457 KOps/s $\color{#d91a1a}-1.01\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 8.3686ms 0.4735ms 2.1121 KOps/s 2.0633 KOps/s $\color{#35bf28}+2.37\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.0677ms 4.6724ms 214.0230 Ops/s 206.3438 Ops/s $\color{#35bf28}+3.72\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.3284ms 0.6485ms 1.5420 KOps/s 1.5402 KOps/s $\color{#35bf28}+0.12\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8321ms 0.6080ms 1.6447 KOps/s 1.6234 KOps/s $\color{#35bf28}+1.31\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 0.4200s 12.5703ms 79.5528 Ops/s 233.9207 Ops/s $\textbf{\color{#d91a1a}-65.99\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 6.6626ms 2.2968ms 435.3862 Ops/s 432.3684 Ops/s $\color{#35bf28}+0.70\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 1.7376ms 1.2051ms 829.7771 Ops/s 714.9811 Ops/s $\textbf{\color{#35bf28}+16.06\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 5.5375ms 4.2511ms 235.2308 Ops/s 234.1553 Ops/s $\color{#35bf28}+0.46\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 6.9100ms 2.2783ms 438.9280 Ops/s 435.0618 Ops/s $\color{#35bf28}+0.89\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 4.8062ms 1.3061ms 765.6262 Ops/s 746.7184 Ops/s $\color{#35bf28}+2.53\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 0.3697s 11.7961ms 84.7736 Ops/s 233.8663 Ops/s $\textbf{\color{#d91a1a}-63.75\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.9155ms 2.4058ms 415.6652 Ops/s 381.8111 Ops/s $\textbf{\color{#35bf28}+8.87\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 4.3830ms 1.4221ms 703.1834 Ops/s 689.3436 Ops/s $\color{#35bf28}+2.01\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.3618ms 11.1252ms 89.8856 Ops/s 86.1496 Ops/s $\color{#35bf28}+4.34\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 14.4315ms 14.1918ms 70.4632 Ops/s 70.2191 Ops/s $\color{#35bf28}+0.35\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 22.1609ms 20.0040ms 49.9899 Ops/s 49.4753 Ops/s $\color{#35bf28}+1.04\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 14.5372ms 14.3132ms 69.8655 Ops/s 69.5134 Ops/s $\color{#35bf28}+0.51\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 21.9588ms 20.0028ms 49.9930 Ops/s 48.4514 Ops/s $\color{#35bf28}+3.18\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.9180ms 15.5322ms 64.3824 Ops/s 63.6240 Ops/s $\color{#35bf28}+1.19\%$

Copy link

github-actions bot commented Nov 9, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7562s 0.7560s 1.3228 Ops/s 1.3177 Ops/s $\color{#35bf28}+0.39\%$
test_transformed 1.1039s 1.0263s 0.9744 Ops/s 0.9992 Ops/s $\color{#d91a1a}-2.48\%$
test_serial 2.2624s 2.1893s 0.4568 Ops/s 0.4637 Ops/s $\color{#d91a1a}-1.49\%$
test_parallel 2.0644s 2.0318s 0.4922 Ops/s 0.5140 Ops/s $\color{#d91a1a}-4.24\%$
test_step_mdp_speed[True-True-True-True-True] 0.2847ms 36.1343μs 27.6745 KOps/s 27.3528 KOps/s $\color{#35bf28}+1.18\%$
test_step_mdp_speed[True-True-True-True-False] 51.1410μs 20.5280μs 48.7139 KOps/s 48.1245 KOps/s $\color{#35bf28}+1.22\%$
test_step_mdp_speed[True-True-True-False-True] 52.0520μs 20.2495μs 49.3838 KOps/s 48.9454 KOps/s $\color{#35bf28}+0.90\%$
test_step_mdp_speed[True-True-True-False-False] 38.4110μs 11.6468μs 85.8602 KOps/s 83.9790 KOps/s $\color{#35bf28}+2.24\%$
test_step_mdp_speed[True-True-False-True-True] 80.8710μs 38.7899μs 25.7799 KOps/s 25.2712 KOps/s $\color{#35bf28}+2.01\%$
test_step_mdp_speed[True-True-False-True-False] 50.2720μs 22.4536μs 44.5363 KOps/s 43.0788 KOps/s $\color{#35bf28}+3.38\%$
test_step_mdp_speed[True-True-False-False-True] 54.6220μs 22.7330μs 43.9890 KOps/s 43.5395 KOps/s $\color{#35bf28}+1.03\%$
test_step_mdp_speed[True-True-False-False-False] 41.5610μs 13.7299μs 72.8339 KOps/s 72.0670 KOps/s $\color{#35bf28}+1.06\%$
test_step_mdp_speed[True-False-True-True-True] 73.9620μs 40.7430μs 24.5441 KOps/s 24.1869 KOps/s $\color{#35bf28}+1.48\%$
test_step_mdp_speed[True-False-True-True-False] 52.9510μs 24.5523μs 40.7293 KOps/s 39.8351 KOps/s $\color{#35bf28}+2.24\%$
test_step_mdp_speed[True-False-True-False-True] 67.0510μs 22.4061μs 44.6308 KOps/s 44.7358 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[True-False-True-False-False] 42.6010μs 13.7274μs 72.8472 KOps/s 72.2091 KOps/s $\color{#35bf28}+0.88\%$
test_step_mdp_speed[True-False-False-True-True] 72.8710μs 42.9842μs 23.2643 KOps/s 23.8368 KOps/s $\color{#d91a1a}-2.40\%$
test_step_mdp_speed[True-False-False-True-False] 54.0410μs 26.7278μs 37.4142 KOps/s 37.0680 KOps/s $\color{#35bf28}+0.93\%$
test_step_mdp_speed[True-False-False-False-True] 57.4410μs 24.1777μs 41.3605 KOps/s 40.9327 KOps/s $\color{#35bf28}+1.05\%$
test_step_mdp_speed[True-False-False-False-False] 45.3310μs 15.6474μs 63.9085 KOps/s 63.1176 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[False-True-True-True-True] 83.8120μs 41.1122μs 24.3237 KOps/s 24.4999 KOps/s $\color{#d91a1a}-0.72\%$
test_step_mdp_speed[False-True-True-True-False] 55.2310μs 24.7960μs 40.3290 KOps/s 40.0594 KOps/s $\color{#35bf28}+0.67\%$
test_step_mdp_speed[False-True-True-False-True] 58.1620μs 26.2900μs 38.0373 KOps/s 37.8219 KOps/s $\color{#35bf28}+0.57\%$
test_step_mdp_speed[False-True-True-False-False] 43.6110μs 15.3185μs 65.2806 KOps/s 64.8841 KOps/s $\color{#35bf28}+0.61\%$
test_step_mdp_speed[False-True-False-True-True] 74.4720μs 42.9695μs 23.2723 KOps/s 23.8093 KOps/s $\color{#d91a1a}-2.26\%$
test_step_mdp_speed[False-True-False-True-False] 97.4120μs 26.4701μs 37.7785 KOps/s 38.5612 KOps/s $\color{#d91a1a}-2.03\%$
test_step_mdp_speed[False-True-False-False-True] 3.5388ms 29.3752μs 34.0424 KOps/s 35.7131 KOps/s $\color{#d91a1a}-4.68\%$
test_step_mdp_speed[False-True-False-False-False] 50.2910μs 18.5432μs 53.9281 KOps/s 57.5790 KOps/s $\textbf{\color{#d91a1a}-6.34\%}$
test_step_mdp_speed[False-False-True-True-True] 80.9420μs 44.7340μs 22.3544 KOps/s 22.4484 KOps/s $\color{#d91a1a}-0.42\%$
test_step_mdp_speed[False-False-True-True-False] 59.6320μs 29.2062μs 34.2392 KOps/s 34.5089 KOps/s $\color{#d91a1a}-0.78\%$
test_step_mdp_speed[False-False-True-False-True] 60.1410μs 28.5187μs 35.0648 KOps/s 35.9142 KOps/s $\color{#d91a1a}-2.37\%$
test_step_mdp_speed[False-False-True-False-False] 45.5010μs 18.2844μs 54.6916 KOps/s 56.6925 KOps/s $\color{#d91a1a}-3.53\%$
test_step_mdp_speed[False-False-False-True-True] 94.4020μs 46.3835μs 21.5594 KOps/s 21.3970 KOps/s $\color{#35bf28}+0.76\%$
test_step_mdp_speed[False-False-False-True-False] 67.8710μs 31.6560μs 31.5896 KOps/s 32.1745 KOps/s $\color{#d91a1a}-1.82\%$
test_step_mdp_speed[False-False-False-False-True] 85.2110μs 30.0717μs 33.2539 KOps/s 33.7163 KOps/s $\color{#d91a1a}-1.37\%$
test_step_mdp_speed[False-False-False-False-False] 49.1110μs 19.0021μs 52.6257 KOps/s 51.5939 KOps/s $\color{#35bf28}+2.00\%$
test_values[generalized_advantage_estimate-True-True] 27.1742ms 26.5257ms 37.6993 Ops/s 39.0675 Ops/s $\color{#d91a1a}-3.50\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1071s 3.0411ms 328.8240 Ops/s 333.5473 Ops/s $\color{#d91a1a}-1.42\%$
test_values[td0_return_estimate-False-False] 89.1320μs 67.6521μs 14.7815 KOps/s 14.5037 KOps/s $\color{#35bf28}+1.92\%$
test_values[td1_return_estimate-False-False] 60.3843ms 58.8736ms 16.9855 Ops/s 17.5753 Ops/s $\color{#d91a1a}-3.36\%$
test_values[vec_td1_return_estimate-False-False] 1.2992ms 1.0908ms 916.7873 Ops/s 913.8330 Ops/s $\color{#35bf28}+0.32\%$
test_values[td_lambda_return_estimate-True-False] 96.2262ms 94.9872ms 10.5277 Ops/s 11.0587 Ops/s $\color{#d91a1a}-4.80\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3179ms 1.0888ms 918.4586 Ops/s 908.0346 Ops/s $\color{#35bf28}+1.15\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 26.6820ms 26.3016ms 38.0205 Ops/s 39.9503 Ops/s $\color{#d91a1a}-4.83\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0581ms 0.7638ms 1.3092 KOps/s 1.3200 KOps/s $\color{#d91a1a}-0.82\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7954ms 0.6805ms 1.4695 KOps/s 1.4747 KOps/s $\color{#d91a1a}-0.35\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5605ms 1.4917ms 670.3591 Ops/s 670.2136 Ops/s $\color{#35bf28}+0.02\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7599ms 0.6937ms 1.4415 KOps/s 1.4456 KOps/s $\color{#d91a1a}-0.28\%$
test_dqn_speed[False-None] 6.8666ms 1.3740ms 727.8041 Ops/s 751.8275 Ops/s $\color{#d91a1a}-3.20\%$
test_dqn_speed[False-backward] 1.9837ms 1.9244ms 519.6494 Ops/s 531.6602 Ops/s $\color{#d91a1a}-2.26\%$
test_dqn_speed[True-None] 1.0646ms 0.5778ms 1.7308 KOps/s 1.7034 KOps/s $\color{#35bf28}+1.61\%$
test_dqn_speed[True-backward] 1.0854ms 1.0375ms 963.8427 Ops/s 948.4574 Ops/s $\color{#35bf28}+1.62\%$
test_dqn_speed[reduce-overhead-None] 0.7812ms 0.5730ms 1.7452 KOps/s 1.7129 KOps/s $\color{#35bf28}+1.89\%$
test_dqn_speed[reduce-overhead-backward] 1.0748ms 1.0293ms 971.5603 Ops/s 968.4217 Ops/s $\color{#35bf28}+0.32\%$
test_ddpg_speed[False-None] 3.0871ms 2.7660ms 361.5393 Ops/s 371.6521 Ops/s $\color{#d91a1a}-2.72\%$
test_ddpg_speed[False-backward] 4.1954ms 4.0543ms 246.6489 Ops/s 247.7061 Ops/s $\color{#d91a1a}-0.43\%$
test_ddpg_speed[True-None] 1.6320ms 1.2807ms 780.8050 Ops/s 777.0857 Ops/s $\color{#35bf28}+0.48\%$
test_ddpg_speed[True-backward] 2.3796ms 2.2904ms 436.6122 Ops/s 438.8579 Ops/s $\color{#d91a1a}-0.51\%$
test_ddpg_speed[reduce-overhead-None] 1.6195ms 1.2759ms 783.7857 Ops/s 771.5437 Ops/s $\color{#35bf28}+1.59\%$
test_ddpg_speed[reduce-overhead-backward] 2.3319ms 2.2833ms 437.9664 Ops/s 441.7559 Ops/s $\color{#d91a1a}-0.86\%$
test_sac_speed[False-None] 8.0766ms 7.6418ms 130.8585 Ops/s 130.2183 Ops/s $\color{#35bf28}+0.49\%$
test_sac_speed[False-backward] 11.4764ms 11.0050ms 90.8677 Ops/s 91.9477 Ops/s $\color{#d91a1a}-1.17\%$
test_sac_speed[True-None] 2.4414ms 2.0650ms 484.2573 Ops/s 483.8842 Ops/s $\color{#35bf28}+0.08\%$
test_sac_speed[True-backward] 4.2100ms 4.1148ms 243.0239 Ops/s 211.3322 Ops/s $\textbf{\color{#35bf28}+15.00\%}$
test_sac_speed[reduce-overhead-None] 2.2398ms 2.0794ms 480.9054 Ops/s 470.8588 Ops/s $\color{#35bf28}+2.13\%$
test_sac_speed[reduce-overhead-backward] 4.3520ms 4.1411ms 241.4798 Ops/s 243.2208 Ops/s $\color{#d91a1a}-0.72\%$
test_redq_speed[False-None] 16.3515ms 11.7008ms 85.4641 Ops/s 92.2190 Ops/s $\textbf{\color{#d91a1a}-7.32\%}$
test_redq_speed[False-backward] 19.4394ms 18.3543ms 54.4832 Ops/s 54.2002 Ops/s $\color{#35bf28}+0.52\%$
test_redq_speed[True-None] 4.0124ms 3.6944ms 270.6830 Ops/s 273.9035 Ops/s $\color{#d91a1a}-1.18\%$
test_redq_speed[True-backward] 9.4973ms 9.0390ms 110.6311 Ops/s 113.4164 Ops/s $\color{#d91a1a}-2.46\%$
test_redq_speed[reduce-overhead-None] 4.0740ms 3.6594ms 273.2714 Ops/s 269.9248 Ops/s $\color{#35bf28}+1.24\%$
test_redq_speed[reduce-overhead-backward] 9.3161ms 8.9776ms 111.3882 Ops/s 112.6163 Ops/s $\color{#d91a1a}-1.09\%$
test_redq_deprec_speed[False-None] 12.9499ms 10.9346ms 91.4531 Ops/s 92.8317 Ops/s $\color{#d91a1a}-1.49\%$
test_redq_deprec_speed[False-backward] 16.6717ms 16.0669ms 62.2396 Ops/s 63.4784 Ops/s $\color{#d91a1a}-1.95\%$
test_redq_deprec_speed[True-None] 3.5551ms 3.3026ms 302.7884 Ops/s 310.2810 Ops/s $\color{#d91a1a}-2.41\%$
test_redq_deprec_speed[True-backward] 7.7232ms 7.3820ms 135.4647 Ops/s 139.4325 Ops/s $\color{#d91a1a}-2.85\%$
test_redq_deprec_speed[reduce-overhead-None] 3.5230ms 3.3143ms 301.7256 Ops/s 311.9122 Ops/s $\color{#d91a1a}-3.27\%$
test_redq_deprec_speed[reduce-overhead-backward] 8.0863ms 7.3968ms 135.1943 Ops/s 139.7134 Ops/s $\color{#d91a1a}-3.23\%$
test_td3_speed[False-None] 7.8448ms 7.6592ms 130.5623 Ops/s 131.7482 Ops/s $\color{#d91a1a}-0.90\%$
test_td3_speed[False-backward] 11.0607ms 10.6615ms 93.7955 Ops/s 94.7731 Ops/s $\color{#d91a1a}-1.03\%$
test_td3_speed[True-None] 2.0660ms 2.0064ms 498.3960 Ops/s 511.0693 Ops/s $\color{#d91a1a}-2.48\%$
test_td3_speed[True-backward] 4.1764ms 3.9605ms 252.4941 Ops/s 256.4639 Ops/s $\color{#d91a1a}-1.55\%$
test_td3_speed[reduce-overhead-None] 2.1024ms 2.0001ms 499.9759 Ops/s 511.2465 Ops/s $\color{#d91a1a}-2.20\%$
test_td3_speed[reduce-overhead-backward] 4.0065ms 3.8526ms 259.5627 Ops/s 261.6184 Ops/s $\color{#d91a1a}-0.79\%$
test_cql_speed[False-None] 0.2889s 33.7895ms 29.5950 Ops/s 24.1892 Ops/s $\textbf{\color{#35bf28}+22.35\%}$
test_cql_speed[False-backward] 40.4852ms 37.2264ms 26.8627 Ops/s 27.8985 Ops/s $\color{#d91a1a}-3.71\%$
test_cql_speed[True-None] 12.1075ms 11.5627ms 86.4852 Ops/s 91.6550 Ops/s $\textbf{\color{#d91a1a}-5.64\%}$
test_cql_speed[True-backward] 18.2709ms 17.7868ms 56.2214 Ops/s 59.2746 Ops/s $\textbf{\color{#d91a1a}-5.15\%}$
test_cql_speed[reduce-overhead-None] 11.9694ms 11.4702ms 87.1824 Ops/s 89.8872 Ops/s $\color{#d91a1a}-3.01\%$
test_cql_speed[reduce-overhead-backward] 18.0115ms 17.3997ms 57.4724 Ops/s 59.5954 Ops/s $\color{#d91a1a}-3.56\%$
test_a2c_speed[False-None] 5.8811ms 5.5758ms 179.3473 Ops/s 181.8404 Ops/s $\color{#d91a1a}-1.37\%$
test_a2c_speed[False-backward] 12.9391ms 12.5067ms 79.9574 Ops/s 83.1823 Ops/s $\color{#d91a1a}-3.88\%$
test_a2c_speed[True-None] 3.3403ms 3.1538ms 317.0789 Ops/s 321.6842 Ops/s $\color{#d91a1a}-1.43\%$
test_a2c_speed[True-backward] 9.1309ms 8.6981ms 114.9681 Ops/s 116.1847 Ops/s $\color{#d91a1a}-1.05\%$
test_a2c_speed[reduce-overhead-None] 3.4194ms 3.1537ms 317.0885 Ops/s 323.8217 Ops/s $\color{#d91a1a}-2.08\%$
test_a2c_speed[reduce-overhead-backward] 9.2048ms 8.6709ms 115.3285 Ops/s 119.1337 Ops/s $\color{#d91a1a}-3.19\%$
test_ppo_speed[False-None] 7.8299ms 5.9099ms 169.2075 Ops/s 172.6229 Ops/s $\color{#d91a1a}-1.98\%$
test_ppo_speed[False-backward] 13.3191ms 12.8949ms 77.5502 Ops/s 80.1402 Ops/s $\color{#d91a1a}-3.23\%$
test_ppo_speed[True-None] 3.6771ms 3.5310ms 283.2076 Ops/s 285.9161 Ops/s $\color{#d91a1a}-0.95\%$
test_ppo_speed[True-backward] 8.8907ms 8.4855ms 117.8487 Ops/s 119.6775 Ops/s $\color{#d91a1a}-1.53\%$
test_ppo_speed[reduce-overhead-None] 3.7934ms 3.5750ms 279.7216 Ops/s 292.3084 Ops/s $\color{#d91a1a}-4.31\%$
test_ppo_speed[reduce-overhead-backward] 8.8047ms 8.3703ms 119.4704 Ops/s 120.9987 Ops/s $\color{#d91a1a}-1.26\%$
test_reinforce_speed[False-None] 5.0002ms 4.6183ms 216.5287 Ops/s 221.7242 Ops/s $\color{#d91a1a}-2.34\%$
test_reinforce_speed[False-backward] 7.9329ms 7.5769ms 131.9806 Ops/s 134.6687 Ops/s $\color{#d91a1a}-2.00\%$
test_reinforce_speed[True-None] 2.5179ms 2.2942ms 435.8789 Ops/s 445.7476 Ops/s $\color{#d91a1a}-2.21\%$
test_reinforce_speed[True-backward] 7.9606ms 7.2768ms 137.4222 Ops/s 141.5842 Ops/s $\color{#d91a1a}-2.94\%$
test_reinforce_speed[reduce-overhead-None] 2.4629ms 2.2931ms 436.0929 Ops/s 454.2409 Ops/s $\color{#d91a1a}-4.00\%$
test_reinforce_speed[reduce-overhead-backward] 7.4089ms 7.1892ms 139.0975 Ops/s 140.7623 Ops/s $\color{#d91a1a}-1.18\%$
test_iql_speed[False-None] 20.6577ms 19.8609ms 50.3503 Ops/s 51.1214 Ops/s $\color{#d91a1a}-1.51\%$
test_iql_speed[False-backward] 32.6663ms 31.0564ms 32.1995 Ops/s 33.0675 Ops/s $\color{#d91a1a}-2.63\%$
test_iql_speed[True-None] 7.6186ms 7.0778ms 141.2877 Ops/s 156.3781 Ops/s $\textbf{\color{#d91a1a}-9.65\%}$
test_iql_speed[True-backward] 16.8501ms 16.4603ms 60.7522 Ops/s 62.8357 Ops/s $\color{#d91a1a}-3.32\%$
test_iql_speed[reduce-overhead-None] 7.6742ms 7.2306ms 138.3014 Ops/s 145.2530 Ops/s $\color{#d91a1a}-4.79\%$
test_iql_speed[reduce-overhead-backward] 16.4143ms 15.8597ms 63.0529 Ops/s 64.1434 Ops/s $\color{#d91a1a}-1.70\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.6306ms 6.4585ms 154.8356 Ops/s 157.9634 Ops/s $\color{#d91a1a}-1.98\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.1158ms 0.3205ms 3.1197 KOps/s 3.6050 KOps/s $\textbf{\color{#d91a1a}-13.46\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6158ms 0.2964ms 3.3733 KOps/s 3.9169 KOps/s $\textbf{\color{#d91a1a}-13.88\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.5940ms 6.2118ms 160.9836 Ops/s 164.7229 Ops/s $\color{#d91a1a}-2.27\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.1470ms 0.3007ms 3.3254 KOps/s 3.3926 KOps/s $\color{#d91a1a}-1.98\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6589ms 0.3418ms 2.9259 KOps/s 4.1423 KOps/s $\textbf{\color{#d91a1a}-29.37\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7761ms 1.3472ms 742.2986 Ops/s 771.6798 Ops/s $\color{#d91a1a}-3.81\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.4701ms 1.2820ms 780.0329 Ops/s 805.3331 Ops/s $\color{#d91a1a}-3.14\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5704ms 6.3514ms 157.4445 Ops/s 158.6047 Ops/s $\color{#d91a1a}-0.73\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9018ms 0.4869ms 2.0536 KOps/s 2.3386 KOps/s $\textbf{\color{#d91a1a}-12.19\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.7969ms 0.4804ms 2.0816 KOps/s 2.3032 KOps/s $\textbf{\color{#d91a1a}-9.63\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.4086ms 6.2255ms 160.6309 Ops/s 162.1473 Ops/s $\color{#d91a1a}-0.94\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.0976ms 0.3792ms 2.6375 KOps/s 3.1984 KOps/s $\textbf{\color{#d91a1a}-17.54\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6968ms 0.3735ms 2.6772 KOps/s 3.1480 KOps/s $\textbf{\color{#d91a1a}-14.96\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4959ms 6.1437ms 162.7686 Ops/s 164.1627 Ops/s $\color{#d91a1a}-0.85\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.2621ms 0.3471ms 2.8812 KOps/s 2.9750 KOps/s $\color{#d91a1a}-3.15\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6267ms 0.3088ms 3.2378 KOps/s 3.1730 KOps/s $\color{#35bf28}+2.04\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.5825ms 6.3591ms 157.2547 Ops/s 161.7051 Ops/s $\color{#d91a1a}-2.75\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.0697ms 0.4992ms 2.0031 KOps/s 2.3827 KOps/s $\textbf{\color{#d91a1a}-15.93\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8088ms 0.4739ms 2.1100 KOps/s 2.2734 KOps/s $\textbf{\color{#d91a1a}-7.19\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.1050ms 5.3850ms 185.7013 Ops/s 190.5215 Ops/s $\color{#d91a1a}-2.53\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.1500ms 2.3211ms 430.8296 Ops/s 449.3746 Ops/s $\color{#d91a1a}-4.13\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.5051ms 1.3344ms 749.3868 Ops/s 848.7630 Ops/s $\textbf{\color{#d91a1a}-11.71\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4780s 14.8665ms 67.2651 Ops/s 33.6508 Ops/s $\textbf{\color{#35bf28}+99.89\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.4097ms 2.0682ms 483.5181 Ops/s 479.7428 Ops/s $\color{#35bf28}+0.79\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.0550ms 1.2720ms 786.1713 Ops/s 775.4812 Ops/s $\color{#35bf28}+1.38\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.3443ms 5.6049ms 178.4155 Ops/s 178.5857 Ops/s $\color{#d91a1a}-0.10\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.6339ms 2.3027ms 434.2788 Ops/s 442.7890 Ops/s $\color{#d91a1a}-1.92\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 3.7604ms 1.3894ms 719.7583 Ops/s 649.7960 Ops/s $\textbf{\color{#35bf28}+10.77\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 14.0644ms 13.0536ms 76.6071 Ops/s 75.3770 Ops/s $\color{#35bf28}+1.63\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.0754ms 17.2821ms 57.8634 Ops/s 57.9448 Ops/s $\color{#d91a1a}-0.14\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.7758ms 17.9974ms 55.5637 Ops/s 55.7928 Ops/s $\color{#d91a1a}-0.41\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.0603ms 17.4569ms 57.2838 Ops/s 57.3747 Ops/s $\color{#d91a1a}-0.16\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.2638ms 17.8051ms 56.1637 Ops/s 56.2165 Ops/s $\color{#d91a1a}-0.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.9651ms 18.6875ms 53.5118 Ops/s 53.4381 Ops/s $\color{#35bf28}+0.14\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants