Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BugFix,Doc] Revert dynamic shape in export tutorial #2563

Merged
merged 1 commit into from
Nov 14, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 14, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: fc856218e840469a5bb0143241d100e9cc612538
Pull Request resolved: #2563
Copy link

pytorch-bot bot commented Nov 14, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2563

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 3 New Failures, 1 Cancelled Job, 2 Pending, 9 Unrelated Failures

As of commit 48e1a87 with merge base 304e707 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 14, 2024
@vmoens vmoens merged commit 48e1a87 into gh/vmoens/40/base Nov 14, 2024
43 of 57 checks passed
@vmoens vmoens deleted the gh/vmoens/40/head branch November 14, 2024 14:24
@vmoens vmoens added bug Something isn't working documentation Improvements or additions to documentation labels Nov 14, 2024
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}17$. Worsened: $\large\color{#d91a1a}2$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4199s 0.4193s 2.3852 Ops/s 2.2549 Ops/s $\textbf{\color{#35bf28}+5.78\%}$
test_transformed 0.6871s 0.6054s 1.6517 Ops/s 1.6217 Ops/s $\color{#35bf28}+1.85\%$
test_serial 1.3187s 1.3169s 0.7594 Ops/s 0.7419 Ops/s $\color{#35bf28}+2.36\%$
test_parallel 1.2660s 1.2550s 0.7968 Ops/s 0.7702 Ops/s $\color{#35bf28}+3.46\%$
test_step_mdp_speed[True-True-True-True-True] 0.2464ms 27.1116μs 36.8846 KOps/s 35.8887 KOps/s $\color{#35bf28}+2.77\%$
test_step_mdp_speed[True-True-True-True-False] 51.7030μs 15.5830μs 64.1725 KOps/s 61.2029 KOps/s $\color{#35bf28}+4.85\%$
test_step_mdp_speed[True-True-True-False-True] 59.4000μs 15.4794μs 64.6021 KOps/s 64.0755 KOps/s $\color{#35bf28}+0.82\%$
test_step_mdp_speed[True-True-True-False-False] 54.2510μs 8.9378μs 111.8842 KOps/s 109.1176 KOps/s $\color{#35bf28}+2.54\%$
test_step_mdp_speed[True-True-False-True-True] 79.8680μs 29.0752μs 34.3936 KOps/s 33.5153 KOps/s $\color{#35bf28}+2.62\%$
test_step_mdp_speed[True-True-False-True-False] 44.2020μs 17.5444μs 56.9982 KOps/s 55.8286 KOps/s $\color{#35bf28}+2.10\%$
test_step_mdp_speed[True-True-False-False-True] 46.3060μs 17.1938μs 58.1606 KOps/s 57.3023 KOps/s $\color{#35bf28}+1.50\%$
test_step_mdp_speed[True-True-False-False-False] 37.8700μs 10.6923μs 93.5251 KOps/s 92.6134 KOps/s $\color{#35bf28}+0.98\%$
test_step_mdp_speed[True-False-True-True-True] 75.0390μs 30.6957μs 32.5778 KOps/s 31.6156 KOps/s $\color{#35bf28}+3.04\%$
test_step_mdp_speed[True-False-True-True-False] 56.5440μs 19.4464μs 51.4235 KOps/s 50.0169 KOps/s $\color{#35bf28}+2.81\%$
test_step_mdp_speed[True-False-True-False-True] 55.4630μs 16.9094μs 59.1386 KOps/s 57.9196 KOps/s $\color{#35bf28}+2.10\%$
test_step_mdp_speed[True-False-True-False-False] 55.7230μs 10.6318μs 94.0570 KOps/s 92.1296 KOps/s $\color{#35bf28}+2.09\%$
test_step_mdp_speed[True-False-False-True-True] 65.7220μs 32.7207μs 30.5617 KOps/s 30.3649 KOps/s $\color{#35bf28}+0.65\%$
test_step_mdp_speed[True-False-False-True-False] 68.7780μs 21.0883μs 47.4197 KOps/s 46.5821 KOps/s $\color{#35bf28}+1.80\%$
test_step_mdp_speed[True-False-False-False-True] 58.9700μs 18.9769μs 52.6955 KOps/s 52.0446 KOps/s $\color{#35bf28}+1.25\%$
test_step_mdp_speed[True-False-False-False-False] 67.0440μs 12.1954μs 81.9982 KOps/s 79.6238 KOps/s $\color{#35bf28}+2.98\%$
test_step_mdp_speed[False-True-True-True-True] 0.6405ms 30.8427μs 32.4226 KOps/s 31.3077 KOps/s $\color{#35bf28}+3.56\%$
test_step_mdp_speed[False-True-True-True-False] 51.3150μs 19.5984μs 51.0247 KOps/s 50.0288 KOps/s $\color{#35bf28}+1.99\%$
test_step_mdp_speed[False-True-True-False-True] 2.4422ms 19.3840μs 51.5889 KOps/s 48.6967 KOps/s $\textbf{\color{#35bf28}+5.94\%}$
test_step_mdp_speed[False-True-True-False-False] 40.9760μs 11.9724μs 83.5252 KOps/s 80.9676 KOps/s $\color{#35bf28}+3.16\%$
test_step_mdp_speed[False-True-False-True-True] 91.5700μs 32.3118μs 30.9484 KOps/s 30.0581 KOps/s $\color{#35bf28}+2.96\%$
test_step_mdp_speed[False-True-False-True-False] 66.0430μs 21.0579μs 47.4880 KOps/s 46.6816 KOps/s $\color{#35bf28}+1.73\%$
test_step_mdp_speed[False-True-False-False-True] 49.9230μs 21.1289μs 47.3286 KOps/s 45.1149 KOps/s $\color{#35bf28}+4.91\%$
test_step_mdp_speed[False-True-False-False-False] 67.8650μs 13.5478μs 73.8125 KOps/s 72.1421 KOps/s $\color{#35bf28}+2.32\%$
test_step_mdp_speed[False-False-True-True-True] 84.8170μs 33.8703μs 29.5244 KOps/s 28.5150 KOps/s $\color{#35bf28}+3.54\%$
test_step_mdp_speed[False-False-True-True-False] 71.1320μs 22.7735μs 43.9107 KOps/s 43.3440 KOps/s $\color{#35bf28}+1.31\%$
test_step_mdp_speed[False-False-True-False-True] 61.1740μs 21.1260μs 47.3351 KOps/s 45.5740 KOps/s $\color{#35bf28}+3.86\%$
test_step_mdp_speed[False-False-True-False-False] 86.8010μs 13.2926μs 75.2299 KOps/s 70.5211 KOps/s $\textbf{\color{#35bf28}+6.68\%}$
test_step_mdp_speed[False-False-False-True-True] 0.1135ms 34.8047μs 28.7318 KOps/s 27.5542 KOps/s $\color{#35bf28}+4.27\%$
test_step_mdp_speed[False-False-False-True-False] 57.4760μs 23.7175μs 42.1630 KOps/s 40.7890 KOps/s $\color{#35bf28}+3.37\%$
test_step_mdp_speed[False-False-False-False-True] 51.3150μs 22.2926μs 44.8579 KOps/s 42.1645 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_step_mdp_speed[False-False-False-False-False] 53.2780μs 14.7733μs 67.6897 KOps/s 64.3139 KOps/s $\textbf{\color{#35bf28}+5.25\%}$
test_values[generalized_advantage_estimate-True-True] 11.3565ms 9.7270ms 102.8069 Ops/s 101.3930 Ops/s $\color{#35bf28}+1.39\%$
test_values[vec_generalized_advantage_estimate-True-True] 36.6005ms 33.2905ms 30.0386 Ops/s 29.4574 Ops/s $\color{#35bf28}+1.97\%$
test_values[td0_return_estimate-False-False] 0.2416ms 0.1691ms 5.9132 KOps/s 5.3287 KOps/s $\textbf{\color{#35bf28}+10.97\%}$
test_values[td1_return_estimate-False-False] 27.0642ms 24.3087ms 41.1375 Ops/s 39.9956 Ops/s $\color{#35bf28}+2.86\%$
test_values[vec_td1_return_estimate-False-False] 36.0961ms 33.8055ms 29.5810 Ops/s 29.9641 Ops/s $\color{#d91a1a}-1.28\%$
test_values[td_lambda_return_estimate-True-False] 35.3768ms 34.5554ms 28.9390 Ops/s 28.5228 Ops/s $\color{#35bf28}+1.46\%$
test_values[vec_td_lambda_return_estimate-True-False] 37.1008ms 34.0082ms 29.4047 Ops/s 29.9906 Ops/s $\color{#d91a1a}-1.95\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 9.6608ms 8.4905ms 117.7788 Ops/s 114.9085 Ops/s $\color{#35bf28}+2.50\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 3.0829ms 1.8433ms 542.4945 Ops/s 543.0226 Ops/s $\color{#d91a1a}-0.10\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.5548ms 0.3569ms 2.8022 KOps/s 2.7912 KOps/s $\color{#35bf28}+0.39\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 44.5862ms 43.7304ms 22.8674 Ops/s 23.2446 Ops/s $\color{#d91a1a}-1.62\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.7113ms 3.0257ms 330.4971 Ops/s 330.4317 Ops/s $\color{#35bf28}+0.02\%$
test_dqn_speed[False-None] 6.0883ms 1.2888ms 775.9405 Ops/s 746.2836 Ops/s $\color{#35bf28}+3.97\%$
test_dqn_speed[False-backward] 1.8292ms 1.7556ms 569.5907 Ops/s 548.3504 Ops/s $\color{#35bf28}+3.87\%$
test_dqn_speed[True-None] 0.5940ms 0.4590ms 2.1787 KOps/s 2.1290 KOps/s $\color{#35bf28}+2.33\%$
test_dqn_speed[True-backward] 1.2786ms 0.9229ms 1.0835 KOps/s 830.8655 Ops/s $\textbf{\color{#35bf28}+30.41\%}$
test_dqn_speed[reduce-overhead-None] 0.6144ms 0.4652ms 2.1496 KOps/s 2.0994 KOps/s $\color{#35bf28}+2.39\%$
test_dqn_speed[reduce-overhead-backward] 0.9354ms 0.8782ms 1.1387 KOps/s 1.1167 KOps/s $\color{#35bf28}+1.97\%$
test_ddpg_speed[False-None] 3.5274ms 2.7156ms 368.2458 Ops/s 358.1646 Ops/s $\color{#35bf28}+2.81\%$
test_ddpg_speed[False-backward] 4.1976ms 3.8473ms 259.9237 Ops/s 244.9974 Ops/s $\textbf{\color{#35bf28}+6.09\%}$
test_ddpg_speed[True-None] 1.5517ms 1.0315ms 969.4796 Ops/s 988.7010 Ops/s $\color{#d91a1a}-1.94\%$
test_ddpg_speed[True-backward] 2.9765ms 2.0235ms 494.1928 Ops/s 462.3605 Ops/s $\textbf{\color{#35bf28}+6.88\%}$
test_ddpg_speed[reduce-overhead-None] 1.3618ms 1.0016ms 998.3896 Ops/s 985.3178 Ops/s $\color{#35bf28}+1.33\%$
test_ddpg_speed[reduce-overhead-backward] 1.9629ms 1.8914ms 528.7015 Ops/s 523.3624 Ops/s $\color{#35bf28}+1.02\%$
test_sac_speed[False-None] 8.8125ms 7.6290ms 131.0786 Ops/s 126.8855 Ops/s $\color{#35bf28}+3.30\%$
test_sac_speed[False-backward] 10.8573ms 10.3733ms 96.4014 Ops/s 94.7902 Ops/s $\color{#35bf28}+1.70\%$
test_sac_speed[True-None] 2.1201ms 1.8191ms 549.7314 Ops/s 546.1050 Ops/s $\color{#35bf28}+0.66\%$
test_sac_speed[True-backward] 3.6934ms 3.5070ms 285.1476 Ops/s 280.3147 Ops/s $\color{#35bf28}+1.72\%$
test_sac_speed[reduce-overhead-None] 2.5898ms 1.8304ms 546.3260 Ops/s 543.5981 Ops/s $\color{#35bf28}+0.50\%$
test_sac_speed[reduce-overhead-backward] 3.6676ms 3.5160ms 284.4132 Ops/s 283.1128 Ops/s $\color{#35bf28}+0.46\%$
test_redq_speed[False-None] 14.6884ms 12.6778ms 78.8779 Ops/s 78.4173 Ops/s $\color{#35bf28}+0.59\%$
test_redq_speed[False-backward] 22.4032ms 21.8088ms 45.8531 Ops/s 44.7292 Ops/s $\color{#35bf28}+2.51\%$
test_redq_speed[True-None] 6.1405ms 4.7551ms 210.2985 Ops/s 218.7460 Ops/s $\color{#d91a1a}-3.86\%$
test_redq_speed[True-backward] 13.4614ms 12.1395ms 82.3759 Ops/s 82.7680 Ops/s $\color{#d91a1a}-0.47\%$
test_redq_speed[reduce-overhead-None] 7.3620ms 4.6986ms 212.8316 Ops/s 213.1378 Ops/s $\color{#d91a1a}-0.14\%$
test_redq_speed[reduce-overhead-backward] 12.9506ms 12.1465ms 82.3281 Ops/s 83.2934 Ops/s $\color{#d91a1a}-1.16\%$
test_redq_deprec_speed[False-None] 14.5836ms 12.5296ms 79.8113 Ops/s 77.4627 Ops/s $\color{#35bf28}+3.03\%$
test_redq_deprec_speed[False-backward] 18.8858ms 18.1620ms 55.0600 Ops/s 54.2543 Ops/s $\color{#35bf28}+1.49\%$
test_redq_deprec_speed[True-None] 4.4977ms 3.5587ms 281.0001 Ops/s 279.1745 Ops/s $\color{#35bf28}+0.65\%$
test_redq_deprec_speed[True-backward] 8.4376ms 7.9898ms 125.1593 Ops/s 124.5583 Ops/s $\color{#35bf28}+0.48\%$
test_redq_deprec_speed[reduce-overhead-None] 3.9550ms 3.5329ms 283.0526 Ops/s 279.6370 Ops/s $\color{#35bf28}+1.22\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.0827ms 8.3136ms 120.2854 Ops/s 116.3354 Ops/s $\color{#35bf28}+3.40\%$
test_td3_speed[False-None] 7.7478ms 7.5170ms 133.0314 Ops/s 126.6316 Ops/s $\textbf{\color{#35bf28}+5.05\%}$
test_td3_speed[False-backward] 10.2310ms 9.9155ms 100.8525 Ops/s 98.7687 Ops/s $\color{#35bf28}+2.11\%$
test_td3_speed[True-None] 1.8914ms 1.7067ms 585.9409 Ops/s 572.9610 Ops/s $\color{#35bf28}+2.27\%$
test_td3_speed[True-backward] 3.7040ms 3.4456ms 290.2211 Ops/s 274.5038 Ops/s $\textbf{\color{#35bf28}+5.73\%}$
test_td3_speed[reduce-overhead-None] 1.7955ms 1.6956ms 589.7453 Ops/s 578.1888 Ops/s $\color{#35bf28}+2.00\%$
test_td3_speed[reduce-overhead-backward] 3.4938ms 3.3203ms 301.1808 Ops/s 295.4188 Ops/s $\color{#35bf28}+1.95\%$
test_cql_speed[False-None] 37.1569ms 35.3959ms 28.2518 Ops/s 28.0054 Ops/s $\color{#35bf28}+0.88\%$
test_cql_speed[False-backward] 46.4509ms 44.8285ms 22.3073 Ops/s 21.9068 Ops/s $\color{#35bf28}+1.83\%$
test_cql_speed[True-None] 17.0760ms 15.6904ms 63.7330 Ops/s 63.6791 Ops/s $\color{#35bf28}+0.08\%$
test_cql_speed[True-backward] 25.0690ms 22.3018ms 44.8394 Ops/s 45.0213 Ops/s $\color{#d91a1a}-0.40\%$
test_cql_speed[reduce-overhead-None] 16.8935ms 15.6217ms 64.0133 Ops/s 63.3921 Ops/s $\color{#35bf28}+0.98\%$
test_cql_speed[reduce-overhead-backward] 23.3828ms 22.3274ms 44.7880 Ops/s 43.5950 Ops/s $\color{#35bf28}+2.74\%$
test_a2c_speed[False-None] 7.5114ms 6.9861ms 143.1417 Ops/s 140.7155 Ops/s $\color{#35bf28}+1.72\%$
test_a2c_speed[False-backward] 15.5191ms 14.1820ms 70.5122 Ops/s 70.2664 Ops/s $\color{#35bf28}+0.35\%$
test_a2c_speed[True-None] 3.8116ms 3.3042ms 302.6495 Ops/s 300.1007 Ops/s $\color{#35bf28}+0.85\%$
test_a2c_speed[True-backward] 10.8937ms 9.7639ms 102.4177 Ops/s 102.2552 Ops/s $\color{#35bf28}+0.16\%$
test_a2c_speed[reduce-overhead-None] 4.0135ms 3.2984ms 303.1761 Ops/s 298.6312 Ops/s $\color{#35bf28}+1.52\%$
test_a2c_speed[reduce-overhead-backward] 10.4032ms 9.6874ms 103.2269 Ops/s 102.1495 Ops/s $\color{#35bf28}+1.05\%$
test_ppo_speed[False-None] 8.1908ms 7.3067ms 136.8613 Ops/s 134.1128 Ops/s $\color{#35bf28}+2.05\%$
test_ppo_speed[False-backward] 16.1737ms 14.7950ms 67.5904 Ops/s 67.7271 Ops/s $\color{#d91a1a}-0.20\%$
test_ppo_speed[True-None] 4.1087ms 3.6977ms 270.4380 Ops/s 268.5685 Ops/s $\color{#35bf28}+0.70\%$
test_ppo_speed[True-backward] 10.2148ms 9.6054ms 104.1084 Ops/s 104.4322 Ops/s $\color{#d91a1a}-0.31\%$
test_ppo_speed[reduce-overhead-None] 4.2844ms 3.7453ms 267.0012 Ops/s 269.0911 Ops/s $\color{#d91a1a}-0.78\%$
test_ppo_speed[reduce-overhead-backward] 10.8195ms 9.7959ms 102.0836 Ops/s 102.1218 Ops/s $\color{#d91a1a}-0.04\%$
test_reinforce_speed[False-None] 8.1355ms 6.4395ms 155.2925 Ops/s 153.2332 Ops/s $\color{#35bf28}+1.34\%$
test_reinforce_speed[False-backward] 10.1685ms 9.7828ms 102.2206 Ops/s 101.4100 Ops/s $\color{#35bf28}+0.80\%$
test_reinforce_speed[True-None] 3.2592ms 2.6763ms 373.6518 Ops/s 374.6555 Ops/s $\color{#d91a1a}-0.27\%$
test_reinforce_speed[True-backward] 9.5082ms 8.6249ms 115.9428 Ops/s 115.8151 Ops/s $\color{#35bf28}+0.11\%$
test_reinforce_speed[reduce-overhead-None] 3.2643ms 2.6762ms 373.6647 Ops/s 375.4717 Ops/s $\color{#d91a1a}-0.48\%$
test_reinforce_speed[reduce-overhead-backward] 10.1776ms 8.7401ms 114.4155 Ops/s 115.5577 Ops/s $\color{#d91a1a}-0.99\%$
test_iql_speed[False-None] 33.9853ms 31.8906ms 31.3572 Ops/s 30.9520 Ops/s $\color{#35bf28}+1.31\%$
test_iql_speed[False-backward] 45.9354ms 44.5875ms 22.4278 Ops/s 21.8945 Ops/s $\color{#35bf28}+2.44\%$
test_iql_speed[True-None] 11.6546ms 10.8022ms 92.5741 Ops/s 92.9304 Ops/s $\color{#d91a1a}-0.38\%$
test_iql_speed[True-backward] 22.5561ms 21.8778ms 45.7084 Ops/s 44.6744 Ops/s $\color{#35bf28}+2.31\%$
test_iql_speed[reduce-overhead-None] 11.5843ms 10.6448ms 93.9428 Ops/s 92.9840 Ops/s $\color{#35bf28}+1.03\%$
test_iql_speed[reduce-overhead-backward] 22.7277ms 21.7030ms 46.0765 Ops/s 45.4546 Ops/s $\color{#35bf28}+1.37\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.0631ms 4.8355ms 206.8054 Ops/s 201.0441 Ops/s $\color{#35bf28}+2.87\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.3684ms 0.5378ms 1.8593 KOps/s 1.9082 KOps/s $\color{#d91a1a}-2.56\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8555ms 0.4827ms 2.0715 KOps/s 2.0271 KOps/s $\color{#35bf28}+2.19\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.5156ms 4.5660ms 219.0093 Ops/s 207.5961 Ops/s $\textbf{\color{#35bf28}+5.50\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.2668ms 0.4921ms 2.0323 KOps/s 1.9249 KOps/s $\textbf{\color{#35bf28}+5.58\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7054ms 0.4736ms 2.1113 KOps/s 2.0361 KOps/s $\color{#35bf28}+3.70\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.3457ms 1.6174ms 618.2665 Ops/s 606.7750 Ops/s $\color{#35bf28}+1.89\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.2718ms 1.5609ms 640.6740 Ops/s 620.6274 Ops/s $\color{#35bf28}+3.23\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.1118ms 4.6882ms 213.3031 Ops/s 204.8480 Ops/s $\color{#35bf28}+4.13\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.0345ms 0.6326ms 1.5807 KOps/s 1.5346 KOps/s $\color{#35bf28}+3.00\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0384ms 0.6075ms 1.6462 KOps/s 1.6117 KOps/s $\color{#35bf28}+2.14\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.3006ms 4.6503ms 215.0413 Ops/s 214.6778 Ops/s $\color{#35bf28}+0.17\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.9563ms 0.5056ms 1.9778 KOps/s 1.9579 KOps/s $\color{#35bf28}+1.02\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8622ms 0.4887ms 2.0461 KOps/s 2.0157 KOps/s $\color{#35bf28}+1.51\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.2656ms 4.5935ms 217.7000 Ops/s 213.0482 Ops/s $\color{#35bf28}+2.18\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.8542ms 0.4970ms 2.0122 KOps/s 2.0018 KOps/s $\color{#35bf28}+0.52\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6770ms 0.4687ms 2.1335 KOps/s 2.1042 KOps/s $\color{#35bf28}+1.39\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.9506ms 4.8199ms 207.4716 Ops/s 208.4200 Ops/s $\color{#d91a1a}-0.46\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.7544ms 0.6369ms 1.5702 KOps/s 1.5344 KOps/s $\color{#35bf28}+2.33\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8317ms 0.6081ms 1.6445 KOps/s 1.5909 KOps/s $\color{#35bf28}+3.37\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.2842ms 4.0713ms 245.6216 Ops/s 249.7574 Ops/s $\color{#d91a1a}-1.66\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.5717ms 2.2911ms 436.4661 Ops/s 417.3668 Ops/s $\color{#35bf28}+4.58\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 3.9639ms 1.2664ms 789.6562 Ops/s 782.1177 Ops/s $\color{#35bf28}+0.96\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3955s 11.8562ms 84.3441 Ops/s 38.2712 Ops/s $\textbf{\color{#35bf28}+120.39\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 4.8058ms 2.2126ms 451.9569 Ops/s 429.6504 Ops/s $\textbf{\color{#35bf28}+5.19\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.6510ms 1.3691ms 730.4294 Ops/s 776.5039 Ops/s $\textbf{\color{#d91a1a}-5.93\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.7868ms 4.2983ms 232.6512 Ops/s 220.2640 Ops/s $\textbf{\color{#35bf28}+5.62\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 7.1151ms 2.4283ms 411.8054 Ops/s 368.0910 Ops/s $\textbf{\color{#35bf28}+11.88\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 0.3495s 8.4544ms 118.2809 Ops/s 708.9134 Ops/s $\textbf{\color{#d91a1a}-83.32\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 11.2408ms 10.6826ms 93.6099 Ops/s 91.9076 Ops/s $\color{#35bf28}+1.85\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 14.8233ms 14.1626ms 70.6084 Ops/s 69.9708 Ops/s $\color{#35bf28}+0.91\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 21.0793ms 19.6085ms 50.9982 Ops/s 50.2314 Ops/s $\color{#35bf28}+1.53\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.2199ms 14.3673ms 69.6024 Ops/s 69.0303 Ops/s $\color{#35bf28}+0.83\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.2478ms 19.5442ms 51.1660 Ops/s 50.9824 Ops/s $\color{#35bf28}+0.36\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 16.6518ms 15.7697ms 63.4127 Ops/s 63.8043 Ops/s $\color{#d91a1a}-0.61\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7529s 0.7480s 1.3370 Ops/s 1.3108 Ops/s $\color{#35bf28}+2.00\%$
test_transformed 1.0087s 1.0030s 0.9970 Ops/s 1.0109 Ops/s $\color{#d91a1a}-1.38\%$
test_serial 2.1878s 2.1573s 0.4635 Ops/s 0.4653 Ops/s $\color{#d91a1a}-0.39\%$
test_parallel 2.0492s 1.9895s 0.5026 Ops/s 0.5161 Ops/s $\color{#d91a1a}-2.61\%$
test_step_mdp_speed[True-True-True-True-True] 0.2037ms 35.4788μs 28.1859 KOps/s 28.1101 KOps/s $\color{#35bf28}+0.27\%$
test_step_mdp_speed[True-True-True-True-False] 0.1574ms 20.6289μs 48.4756 KOps/s 48.0211 KOps/s $\color{#35bf28}+0.95\%$
test_step_mdp_speed[True-True-True-False-True] 47.3510μs 19.5373μs 51.1841 KOps/s 50.2160 KOps/s $\color{#35bf28}+1.93\%$
test_step_mdp_speed[True-True-True-False-False] 40.7010μs 11.4647μs 87.2242 KOps/s 86.2542 KOps/s $\color{#35bf28}+1.12\%$
test_step_mdp_speed[True-True-False-True-True] 0.1607ms 37.8954μs 26.3884 KOps/s 26.0862 KOps/s $\color{#35bf28}+1.16\%$
test_step_mdp_speed[True-True-False-True-False] 58.9810μs 22.1192μs 45.2096 KOps/s 44.4187 KOps/s $\color{#35bf28}+1.78\%$
test_step_mdp_speed[True-True-False-False-True] 69.3610μs 21.9300μs 45.5996 KOps/s 47.3922 KOps/s $\color{#d91a1a}-3.78\%$
test_step_mdp_speed[True-True-False-False-False] 40.8910μs 13.5728μs 73.6767 KOps/s 73.7846 KOps/s $\color{#d91a1a}-0.15\%$
test_step_mdp_speed[True-False-True-True-True] 79.1520μs 39.6957μs 25.1917 KOps/s 25.2490 KOps/s $\color{#d91a1a}-0.23\%$
test_step_mdp_speed[True-False-True-True-False] 50.8710μs 24.5488μs 40.7351 KOps/s 41.1416 KOps/s $\color{#d91a1a}-0.99\%$
test_step_mdp_speed[True-False-True-False-True] 54.8910μs 22.0625μs 45.3259 KOps/s 47.9638 KOps/s $\textbf{\color{#d91a1a}-5.50\%}$
test_step_mdp_speed[True-False-True-False-False] 42.2210μs 13.6067μs 73.4931 KOps/s 73.2365 KOps/s $\color{#35bf28}+0.35\%$
test_step_mdp_speed[True-False-False-True-True] 77.6510μs 41.6287μs 24.0219 KOps/s 23.9812 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[True-False-False-True-False] 57.7710μs 26.5139μs 37.7160 KOps/s 37.7398 KOps/s $\color{#d91a1a}-0.06\%$
test_step_mdp_speed[True-False-False-False-True] 54.0210μs 24.0270μs 41.6199 KOps/s 42.2362 KOps/s $\color{#d91a1a}-1.46\%$
test_step_mdp_speed[True-False-False-False-False] 0.1279ms 15.5509μs 64.3051 KOps/s 64.8746 KOps/s $\color{#d91a1a}-0.88\%$
test_step_mdp_speed[False-True-True-True-True] 0.1049ms 39.6109μs 25.2456 KOps/s 25.4710 KOps/s $\color{#d91a1a}-0.88\%$
test_step_mdp_speed[False-True-True-True-False] 61.9310μs 24.4242μs 40.9430 KOps/s 40.9561 KOps/s $\color{#d91a1a}-0.03\%$
test_step_mdp_speed[False-True-True-False-True] 53.9610μs 25.1329μs 39.7886 KOps/s 39.4291 KOps/s $\color{#35bf28}+0.91\%$
test_step_mdp_speed[False-True-True-False-False] 40.8400μs 15.2909μs 65.3985 KOps/s 65.6202 KOps/s $\color{#d91a1a}-0.34\%$
test_step_mdp_speed[False-True-False-True-True] 78.7410μs 41.7011μs 23.9802 KOps/s 23.9403 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[False-True-False-True-False] 67.2510μs 26.3873μs 37.8971 KOps/s 37.7327 KOps/s $\color{#35bf28}+0.44\%$
test_step_mdp_speed[False-True-False-False-True] 3.7917ms 27.9442μs 35.7856 KOps/s 36.1919 KOps/s $\color{#d91a1a}-1.12\%$
test_step_mdp_speed[False-True-False-False-False] 48.2710μs 17.3329μs 57.6937 KOps/s 58.3763 KOps/s $\color{#d91a1a}-1.17\%$
test_step_mdp_speed[False-False-True-True-True] 77.1620μs 43.6463μs 22.9114 KOps/s 22.8240 KOps/s $\color{#35bf28}+0.38\%$
test_step_mdp_speed[False-False-True-True-False] 0.1042ms 28.5020μs 35.0852 KOps/s 35.0372 KOps/s $\color{#35bf28}+0.14\%$
test_step_mdp_speed[False-False-True-False-True] 61.0410μs 27.2835μs 36.6522 KOps/s 36.3134 KOps/s $\color{#35bf28}+0.93\%$
test_step_mdp_speed[False-False-True-False-False] 45.8410μs 17.5472μs 56.9891 KOps/s 58.6045 KOps/s $\color{#d91a1a}-2.76\%$
test_step_mdp_speed[False-False-False-True-True] 74.5710μs 45.2976μs 22.0762 KOps/s 21.9911 KOps/s $\color{#35bf28}+0.39\%$
test_step_mdp_speed[False-False-False-True-False] 65.6910μs 30.2516μs 33.0561 KOps/s 32.8940 KOps/s $\color{#35bf28}+0.49\%$
test_step_mdp_speed[False-False-False-False-True] 63.4610μs 29.1919μs 34.2561 KOps/s 34.6980 KOps/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[False-False-False-False-False] 0.1601ms 19.1251μs 52.2872 KOps/s 53.1380 KOps/s $\color{#d91a1a}-1.60\%$
test_values[generalized_advantage_estimate-True-True] 25.5107ms 25.2582ms 39.5911 Ops/s 39.8530 Ops/s $\color{#d91a1a}-0.66\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1148s 3.1937ms 313.1140 Ops/s 333.9887 Ops/s $\textbf{\color{#d91a1a}-6.25\%}$
test_values[td0_return_estimate-False-False] 90.9010μs 67.6263μs 14.7871 KOps/s 15.0990 KOps/s $\color{#d91a1a}-2.07\%$
test_values[td1_return_estimate-False-False] 57.0050ms 56.3506ms 17.7460 Ops/s 17.9218 Ops/s $\color{#d91a1a}-0.98\%$
test_values[vec_td1_return_estimate-False-False] 1.3659ms 1.0863ms 920.5815 Ops/s 927.9232 Ops/s $\color{#d91a1a}-0.79\%$
test_values[td_lambda_return_estimate-True-False] 90.6164ms 89.4186ms 11.1834 Ops/s 11.3154 Ops/s $\color{#d91a1a}-1.17\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.4260ms 1.0894ms 917.9281 Ops/s 927.0452 Ops/s $\color{#d91a1a}-0.98\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.0415ms 24.5521ms 40.7298 Ops/s 40.6742 Ops/s $\color{#35bf28}+0.14\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0340ms 0.7549ms 1.3247 KOps/s 1.3531 KOps/s $\color{#d91a1a}-2.10\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.8064ms 0.6664ms 1.5006 KOps/s 1.5153 KOps/s $\color{#d91a1a}-0.97\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.6208ms 1.4780ms 676.6126 Ops/s 679.1427 Ops/s $\color{#d91a1a}-0.37\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.8655ms 0.6979ms 1.4328 KOps/s 1.4761 KOps/s $\color{#d91a1a}-2.94\%$
test_dqn_speed[False-None] 0.1056s 1.4964ms 668.2796 Ops/s 760.1201 Ops/s $\textbf{\color{#d91a1a}-12.08\%}$
test_dqn_speed[False-backward] 1.9220ms 1.8685ms 535.1964 Ops/s 533.6075 Ops/s $\color{#35bf28}+0.30\%$
test_dqn_speed[True-None] 0.8124ms 0.5593ms 1.7879 KOps/s 1.7800 KOps/s $\color{#35bf28}+0.45\%$
test_dqn_speed[True-backward] 1.0654ms 1.0228ms 977.6840 Ops/s 980.7910 Ops/s $\color{#d91a1a}-0.32\%$
test_dqn_speed[reduce-overhead-None] 0.7524ms 0.5686ms 1.7587 KOps/s 1.7475 KOps/s $\color{#35bf28}+0.64\%$
test_dqn_speed[reduce-overhead-backward] 1.0680ms 1.0265ms 974.1701 Ops/s 985.4637 Ops/s $\color{#d91a1a}-1.15\%$
test_ddpg_speed[False-None] 3.2524ms 2.7123ms 368.6946 Ops/s 375.5195 Ops/s $\color{#d91a1a}-1.82\%$
test_ddpg_speed[False-backward] 4.1000ms 3.9724ms 251.7353 Ops/s 254.1004 Ops/s $\color{#d91a1a}-0.93\%$
test_ddpg_speed[True-None] 1.6061ms 1.2510ms 799.3543 Ops/s 803.9482 Ops/s $\color{#d91a1a}-0.57\%$
test_ddpg_speed[True-backward] 2.3886ms 2.2722ms 440.1031 Ops/s 450.2726 Ops/s $\color{#d91a1a}-2.26\%$
test_ddpg_speed[reduce-overhead-None] 1.6301ms 1.2592ms 794.1326 Ops/s 802.8361 Ops/s $\color{#d91a1a}-1.08\%$
test_ddpg_speed[reduce-overhead-backward] 2.4100ms 2.2586ms 442.7428 Ops/s 452.8773 Ops/s $\color{#d91a1a}-2.24\%$
test_sac_speed[False-None] 8.3802ms 7.6206ms 131.2239 Ops/s 131.4206 Ops/s $\color{#d91a1a}-0.15\%$
test_sac_speed[False-backward] 11.6961ms 11.0530ms 90.4734 Ops/s 87.2270 Ops/s $\color{#35bf28}+3.72\%$
test_sac_speed[True-None] 2.3795ms 2.0159ms 496.0465 Ops/s 500.5112 Ops/s $\color{#d91a1a}-0.89\%$
test_sac_speed[True-backward] 4.0658ms 3.9712ms 251.8142 Ops/s 254.9959 Ops/s $\color{#d91a1a}-1.25\%$
test_sac_speed[reduce-overhead-None] 2.6946ms 2.0230ms 494.3066 Ops/s 501.2085 Ops/s $\color{#d91a1a}-1.38\%$
test_sac_speed[reduce-overhead-backward] 5.3308ms 4.0513ms 246.8333 Ops/s 253.5680 Ops/s $\color{#d91a1a}-2.66\%$
test_redq_speed[False-None] 15.5471ms 10.2944ms 97.1401 Ops/s 90.8715 Ops/s $\textbf{\color{#35bf28}+6.90\%}$
test_redq_speed[False-backward] 18.4735ms 17.4773ms 57.2171 Ops/s 57.0461 Ops/s $\color{#35bf28}+0.30\%$
test_redq_speed[True-None] 3.8419ms 3.5805ms 279.2915 Ops/s 280.4866 Ops/s $\color{#d91a1a}-0.43\%$
test_redq_speed[True-backward] 9.0907ms 8.7921ms 113.7390 Ops/s 109.2679 Ops/s $\color{#35bf28}+4.09\%$
test_redq_speed[reduce-overhead-None] 3.9880ms 3.5722ms 279.9358 Ops/s 288.6468 Ops/s $\color{#d91a1a}-3.02\%$
test_redq_speed[reduce-overhead-backward] 9.8678ms 9.0006ms 111.1037 Ops/s 116.3622 Ops/s $\color{#d91a1a}-4.52\%$
test_redq_deprec_speed[False-None] 15.7967ms 10.7073ms 93.3939 Ops/s 93.5167 Ops/s $\color{#d91a1a}-0.13\%$
test_redq_deprec_speed[False-backward] 16.0853ms 15.5292ms 64.3950 Ops/s 64.3039 Ops/s $\color{#35bf28}+0.14\%$
test_redq_deprec_speed[True-None] 3.6362ms 3.2554ms 307.1791 Ops/s 314.6379 Ops/s $\color{#d91a1a}-2.37\%$
test_redq_deprec_speed[True-backward] 7.3842ms 7.1546ms 139.7711 Ops/s 144.3505 Ops/s $\color{#d91a1a}-3.17\%$
test_redq_deprec_speed[reduce-overhead-None] 3.6891ms 3.2858ms 304.3371 Ops/s 316.2490 Ops/s $\color{#d91a1a}-3.77\%$
test_redq_deprec_speed[reduce-overhead-backward] 7.7771ms 7.3360ms 136.3140 Ops/s 144.3350 Ops/s $\textbf{\color{#d91a1a}-5.56\%}$
test_td3_speed[False-None] 7.7744ms 7.5959ms 131.6498 Ops/s 131.5159 Ops/s $\color{#35bf28}+0.10\%$
test_td3_speed[False-backward] 10.9637ms 10.5700ms 94.6070 Ops/s 94.9970 Ops/s $\color{#d91a1a}-0.41\%$
test_td3_speed[True-None] 1.9635ms 1.9237ms 519.8339 Ops/s 515.0354 Ops/s $\color{#35bf28}+0.93\%$
test_td3_speed[True-backward] 3.8335ms 3.7307ms 268.0454 Ops/s 254.3977 Ops/s $\textbf{\color{#35bf28}+5.36\%}$
test_td3_speed[reduce-overhead-None] 1.9856ms 1.9177ms 521.4700 Ops/s 531.7949 Ops/s $\color{#d91a1a}-1.94\%$
test_td3_speed[reduce-overhead-backward] 3.8831ms 3.7582ms 266.0828 Ops/s 274.8202 Ops/s $\color{#d91a1a}-3.18\%$
test_cql_speed[False-None] 28.2429ms 25.4441ms 39.3018 Ops/s 39.6731 Ops/s $\color{#d91a1a}-0.94\%$
test_cql_speed[False-backward] 38.8495ms 35.2934ms 28.3339 Ops/s 28.6019 Ops/s $\color{#d91a1a}-0.94\%$
test_cql_speed[True-None] 11.3222ms 10.8771ms 91.9361 Ops/s 93.4056 Ops/s $\color{#d91a1a}-1.57\%$
test_cql_speed[True-backward] 17.7169ms 16.9131ms 59.1257 Ops/s 61.0739 Ops/s $\color{#d91a1a}-3.19\%$
test_cql_speed[reduce-overhead-None] 11.2770ms 10.9645ms 91.2037 Ops/s 93.9462 Ops/s $\color{#d91a1a}-2.92\%$
test_cql_speed[reduce-overhead-backward] 17.4261ms 16.9084ms 59.1421 Ops/s 62.0041 Ops/s $\color{#d91a1a}-4.62\%$
test_a2c_speed[False-None] 5.6352ms 5.3920ms 185.4592 Ops/s 186.0449 Ops/s $\color{#d91a1a}-0.31\%$
test_a2c_speed[False-backward] 12.5105ms 11.9832ms 83.4505 Ops/s 84.4254 Ops/s $\color{#d91a1a}-1.15\%$
test_a2c_speed[True-None] 3.4433ms 3.0510ms 327.7636 Ops/s 325.7138 Ops/s $\color{#35bf28}+0.63\%$
test_a2c_speed[True-backward] 8.9087ms 8.4819ms 117.8984 Ops/s 120.1443 Ops/s $\color{#d91a1a}-1.87\%$
test_a2c_speed[reduce-overhead-None] 3.4663ms 3.0572ms 327.0944 Ops/s 326.2772 Ops/s $\color{#35bf28}+0.25\%$
test_a2c_speed[reduce-overhead-backward] 8.7293ms 8.4384ms 118.5065 Ops/s 119.1266 Ops/s $\color{#d91a1a}-0.52\%$
test_ppo_speed[False-None] 7.1722ms 5.7795ms 173.0266 Ops/s 173.9717 Ops/s $\color{#d91a1a}-0.54\%$
test_ppo_speed[False-backward] 12.9993ms 12.5737ms 79.5312 Ops/s 79.9371 Ops/s $\color{#d91a1a}-0.51\%$
test_ppo_speed[True-None] 3.5900ms 3.3987ms 294.2336 Ops/s 295.7701 Ops/s $\color{#d91a1a}-0.52\%$
test_ppo_speed[True-backward] 8.5859ms 8.2719ms 120.8906 Ops/s 121.9885 Ops/s $\color{#d91a1a}-0.90\%$
test_ppo_speed[reduce-overhead-None] 3.6097ms 3.3982ms 294.2752 Ops/s 294.2818 Ops/s $-0.00\%$
test_ppo_speed[reduce-overhead-backward] 8.5882ms 8.2087ms 121.8212 Ops/s 123.8142 Ops/s $\color{#d91a1a}-1.61\%$
test_reinforce_speed[False-None] 4.9315ms 4.4796ms 223.2348 Ops/s 224.4766 Ops/s $\color{#d91a1a}-0.55\%$
test_reinforce_speed[False-backward] 7.7372ms 7.3414ms 136.2133 Ops/s 136.8909 Ops/s $\color{#d91a1a}-0.49\%$
test_reinforce_speed[True-None] 2.5834ms 2.2228ms 449.8815 Ops/s 445.6275 Ops/s $\color{#35bf28}+0.95\%$
test_reinforce_speed[True-backward] 7.4450ms 7.1408ms 140.0401 Ops/s 141.3528 Ops/s $\color{#d91a1a}-0.93\%$
test_reinforce_speed[reduce-overhead-None] 2.9287ms 2.2664ms 441.2340 Ops/s 453.0445 Ops/s $\color{#d91a1a}-2.61\%$
test_reinforce_speed[reduce-overhead-backward] 7.3277ms 7.0682ms 141.4795 Ops/s 142.7493 Ops/s $\color{#d91a1a}-0.89\%$
test_iql_speed[False-None] 19.8982ms 19.2741ms 51.8831 Ops/s 50.1746 Ops/s $\color{#35bf28}+3.41\%$
test_iql_speed[False-backward] 31.6250ms 30.1642ms 33.1519 Ops/s 33.0267 Ops/s $\color{#35bf28}+0.38\%$
test_iql_speed[True-None] 7.1391ms 6.7411ms 148.3439 Ops/s 148.0461 Ops/s $\color{#35bf28}+0.20\%$
test_iql_speed[True-backward] 15.9071ms 15.5451ms 64.3291 Ops/s 63.7615 Ops/s $\color{#35bf28}+0.89\%$
test_iql_speed[reduce-overhead-None] 7.1253ms 6.7839ms 147.4070 Ops/s 147.0548 Ops/s $\color{#35bf28}+0.24\%$
test_iql_speed[reduce-overhead-backward] 16.0620ms 15.5600ms 64.2674 Ops/s 66.2895 Ops/s $\color{#d91a1a}-3.05\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.9798ms 6.2828ms 159.1644 Ops/s 157.2114 Ops/s $\color{#35bf28}+1.24\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6169ms 0.3975ms 2.5160 KOps/s 2.7458 KOps/s $\textbf{\color{#d91a1a}-8.37\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5925ms 0.2880ms 3.4728 KOps/s 3.0114 KOps/s $\textbf{\color{#35bf28}+15.32\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3875ms 6.0745ms 164.6231 Ops/s 162.9016 Ops/s $\color{#35bf28}+1.06\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.7997ms 0.3033ms 3.2975 KOps/s 2.9613 KOps/s $\textbf{\color{#35bf28}+11.35\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.6650ms 0.3129ms 3.1958 KOps/s 4.1379 KOps/s $\textbf{\color{#d91a1a}-22.77\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.6755ms 1.4538ms 687.8686 Ops/s 745.1159 Ops/s $\textbf{\color{#d91a1a}-7.68\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6575ms 1.4047ms 711.9110 Ops/s 829.8666 Ops/s $\textbf{\color{#d91a1a}-14.21\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2877ms 6.1062ms 163.7670 Ops/s 160.1906 Ops/s $\color{#35bf28}+2.23\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9090ms 0.4541ms 2.2023 KOps/s 1.9865 KOps/s $\textbf{\color{#35bf28}+10.86\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6820ms 0.4566ms 2.1901 KOps/s 2.5135 KOps/s $\textbf{\color{#d91a1a}-12.87\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.1859ms 5.9927ms 166.8695 Ops/s 164.9707 Ops/s $\color{#35bf28}+1.15\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.2833ms 0.2946ms 3.3939 KOps/s 2.9276 KOps/s $\textbf{\color{#35bf28}+15.93\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.4459ms 0.2945ms 3.3959 KOps/s 3.5598 KOps/s $\color{#d91a1a}-4.60\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.2706ms 5.9928ms 166.8669 Ops/s 166.4062 Ops/s $\color{#35bf28}+0.28\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.9307ms 0.2675ms 3.7383 KOps/s 2.9634 KOps/s $\textbf{\color{#35bf28}+26.15\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.4368ms 0.2397ms 4.1711 KOps/s 2.8977 KOps/s $\textbf{\color{#35bf28}+43.94\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3540ms 6.1986ms 161.3271 Ops/s 160.9160 Ops/s $\color{#35bf28}+0.26\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9310ms 0.4230ms 2.3642 KOps/s 2.0328 KOps/s $\textbf{\color{#35bf28}+16.31\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6141ms 0.3981ms 2.5119 KOps/s 2.2477 KOps/s $\textbf{\color{#35bf28}+11.75\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.8529ms 5.2706ms 189.7301 Ops/s 191.8163 Ops/s $\color{#d91a1a}-1.09\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.8136ms 2.2513ms 444.1902 Ops/s 440.0063 Ops/s $\color{#35bf28}+0.95\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 3.3026ms 1.1068ms 903.4955 Ops/s 860.4386 Ops/s $\textbf{\color{#35bf28}+5.00\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.3976s 13.2356ms 75.5536 Ops/s 192.6818 Ops/s $\textbf{\color{#d91a1a}-60.79\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 3.8468ms 1.6371ms 610.8309 Ops/s 436.3746 Ops/s $\textbf{\color{#35bf28}+39.98\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 2.1290ms 1.0679ms 936.4611 Ops/s 870.6378 Ops/s $\textbf{\color{#35bf28}+7.56\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 8.5122ms 5.5431ms 180.4052 Ops/s 35.1884 Ops/s $\textbf{\color{#35bf28}+412.68\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.5785ms 2.2853ms 437.5729 Ops/s 462.3824 Ops/s $\textbf{\color{#d91a1a}-5.37\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 8.2118ms 1.4918ms 670.3249 Ops/s 731.3024 Ops/s $\textbf{\color{#d91a1a}-8.34\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.5047ms 12.8684ms 77.7095 Ops/s 74.6305 Ops/s $\color{#35bf28}+4.13\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.8574ms 17.2272ms 58.0479 Ops/s 60.0375 Ops/s $\color{#d91a1a}-3.31\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.1326ms 17.6491ms 56.6601 Ops/s 55.8075 Ops/s $\color{#35bf28}+1.53\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.7194ms 17.0130ms 58.7786 Ops/s 59.5539 Ops/s $\color{#d91a1a}-1.30\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 19.8925ms 17.7895ms 56.2130 Ops/s 55.9101 Ops/s $\color{#35bf28}+0.54\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.0951ms 18.3795ms 54.4084 Ops/s 54.9157 Ops/s $\color{#d91a1a}-0.92\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants