Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix libs workflows #2800

Open
wants to merge 24 commits into
base: main
Choose a base branch
from
Open

[CI] Fix libs workflows #2800

wants to merge 24 commits into from

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Feb 20, 2025

No description provided.

Copy link

pytorch-bot bot commented Feb 20, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2800

Note: Links to docs will display an error until the docs builds have been completed.

❌ 6 New Failures, 4 Unrelated Failures

As of commit ac64cae with merge base 3acf491 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025
@vmoens vmoens added CI Has to do with CI setup (e.g. wheels & builds, tests...) Environments Adds or modifies an environment wrapper Data Data-related PR, will launch data-related jobs and removed CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. labels Feb 20, 2025
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 20, 2025
Copy link

github-actions bot commented Feb 20, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.6097s 0.5172s 1.9335 Ops/s 2.0168 Ops/s $\color{#d91a1a}-4.13\%$
test_transformed 1.0917s 1.0097s 0.9904 Ops/s 1.0049 Ops/s $\color{#d91a1a}-1.44\%$
test_serial 1.6209s 1.5084s 0.6630 Ops/s 0.6730 Ops/s $\color{#d91a1a}-1.49\%$
test_parallel 1.3871s 1.2884s 0.7762 Ops/s 0.7660 Ops/s $\color{#35bf28}+1.33\%$
test_step_mdp_speed[True-True-True-True-True] 0.2421ms 30.3512μs 32.9477 KOps/s 33.4469 KOps/s $\color{#d91a1a}-1.49\%$
test_step_mdp_speed[True-True-True-True-False] 45.9060μs 18.1896μs 54.9765 KOps/s 57.0326 KOps/s $\color{#d91a1a}-3.61\%$
test_step_mdp_speed[True-True-True-False-True] 0.7368ms 17.3104μs 57.7687 KOps/s 58.9196 KOps/s $\color{#d91a1a}-1.95\%$
test_step_mdp_speed[True-True-True-False-False] 35.4760μs 10.0778μs 99.2283 KOps/s 102.0450 KOps/s $\color{#d91a1a}-2.76\%$
test_step_mdp_speed[True-True-False-True-True] 73.1370μs 32.4167μs 30.8483 KOps/s 31.2624 KOps/s $\color{#d91a1a}-1.32\%$
test_step_mdp_speed[True-True-False-True-False] 50.5850μs 19.8735μs 50.3183 KOps/s 50.9943 KOps/s $\color{#d91a1a}-1.33\%$
test_step_mdp_speed[True-True-False-False-True] 48.2600μs 19.2873μs 51.8477 KOps/s 53.6309 KOps/s $\color{#d91a1a}-3.33\%$
test_step_mdp_speed[True-True-False-False-False] 42.1490μs 12.0603μs 82.9164 KOps/s 83.5986 KOps/s $\color{#d91a1a}-0.82\%$
test_step_mdp_speed[True-False-True-True-True] 76.4830μs 35.0975μs 28.4920 KOps/s 29.6927 KOps/s $\color{#d91a1a}-4.04\%$
test_step_mdp_speed[True-False-True-True-False] 73.5200μs 21.7208μs 46.0389 KOps/s 46.8733 KOps/s $\color{#d91a1a}-1.78\%$
test_step_mdp_speed[True-False-True-False-True] 53.7800μs 19.1321μs 52.2681 KOps/s 53.6634 KOps/s $\color{#d91a1a}-2.60\%$
test_step_mdp_speed[True-False-True-False-False] 39.8250μs 12.0936μs 82.6887 KOps/s 85.9455 KOps/s $\color{#d91a1a}-3.79\%$
test_step_mdp_speed[True-False-False-True-True] 73.0460μs 36.0156μs 27.7657 KOps/s 28.1790 KOps/s $\color{#d91a1a}-1.47\%$
test_step_mdp_speed[True-False-False-True-False] 52.8890μs 23.2618μs 42.9890 KOps/s 43.1334 KOps/s $\color{#d91a1a}-0.33\%$
test_step_mdp_speed[True-False-False-False-True] 50.7950μs 20.9840μs 47.6553 KOps/s 48.7103 KOps/s $\color{#d91a1a}-2.17\%$
test_step_mdp_speed[True-False-False-False-False] 46.7370μs 13.8321μs 72.2958 KOps/s 74.8308 KOps/s $\color{#d91a1a}-3.39\%$
test_step_mdp_speed[False-True-True-True-True] 68.2380μs 34.4848μs 28.9983 KOps/s 29.5526 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[False-True-True-True-False] 51.5660μs 21.6996μs 46.0838 KOps/s 46.7487 KOps/s $\color{#d91a1a}-1.42\%$
test_step_mdp_speed[False-True-True-False-True] 52.2880μs 21.7507μs 45.9754 KOps/s 46.9446 KOps/s $\color{#d91a1a}-2.06\%$
test_step_mdp_speed[False-True-True-False-False] 0.6108ms 13.3589μs 74.8566 KOps/s 76.6405 KOps/s $\color{#d91a1a}-2.33\%$
test_step_mdp_speed[False-True-False-True-True] 77.9050μs 36.2087μs 27.6177 KOps/s 28.1780 KOps/s $\color{#d91a1a}-1.99\%$
test_step_mdp_speed[False-True-False-True-False] 56.0340μs 23.5220μs 42.5135 KOps/s 43.1327 KOps/s $\color{#d91a1a}-1.44\%$
test_step_mdp_speed[False-True-False-False-True] 2.4995ms 23.5245μs 42.5089 KOps/s 42.9755 KOps/s $\color{#d91a1a}-1.09\%$
test_step_mdp_speed[False-True-False-False-False] 38.1810μs 14.9979μs 66.6758 KOps/s 67.8215 KOps/s $\color{#d91a1a}-1.69\%$
test_step_mdp_speed[False-False-True-True-True] 74.1380μs 37.9496μs 26.3507 KOps/s 27.0173 KOps/s $\color{#d91a1a}-2.47\%$
test_step_mdp_speed[False-False-True-True-False] 53.7900μs 25.2604μs 39.5877 KOps/s 40.1478 KOps/s $\color{#d91a1a}-1.39\%$
test_step_mdp_speed[False-False-True-False-True] 61.3150μs 23.5191μs 42.5186 KOps/s 44.2191 KOps/s $\color{#d91a1a}-3.85\%$
test_step_mdp_speed[False-False-True-False-False] 39.3540μs 15.0460μs 66.4627 KOps/s 68.0632 KOps/s $\color{#d91a1a}-2.35\%$
test_step_mdp_speed[False-False-False-True-True] 0.1030ms 39.4289μs 25.3621 KOps/s 26.0135 KOps/s $\color{#d91a1a}-2.50\%$
test_step_mdp_speed[False-False-False-True-False] 58.5100μs 26.6856μs 37.4734 KOps/s 37.7416 KOps/s $\color{#d91a1a}-0.71\%$
test_step_mdp_speed[False-False-False-False-True] 67.5570μs 24.7903μs 40.3383 KOps/s 40.8458 KOps/s $\color{#d91a1a}-1.24\%$
test_step_mdp_speed[False-False-False-False-False] 40.7070μs 16.5439μs 60.4451 KOps/s 61.2463 KOps/s $\color{#d91a1a}-1.31\%$
test_values[generalized_advantage_estimate-True-True] 13.0822ms 9.9031ms 100.9787 Ops/s 99.2132 Ops/s $\color{#35bf28}+1.78\%$
test_values[vec_generalized_advantage_estimate-True-True] 28.3415ms 26.6597ms 37.5097 Ops/s 41.0914 Ops/s $\textbf{\color{#d91a1a}-8.72\%}$
test_values[td0_return_estimate-False-False] 0.2490ms 0.1762ms 5.6766 KOps/s 5.5095 KOps/s $\color{#35bf28}+3.03\%$
test_values[td1_return_estimate-False-False] 27.7189ms 24.4476ms 40.9038 Ops/s 39.6720 Ops/s $\color{#35bf28}+3.10\%$
test_values[vec_td1_return_estimate-False-False] 29.1227ms 26.9488ms 37.1074 Ops/s 41.4664 Ops/s $\textbf{\color{#d91a1a}-10.51\%}$
test_values[td_lambda_return_estimate-True-False] 39.2649ms 35.1606ms 28.4409 Ops/s 28.7105 Ops/s $\color{#d91a1a}-0.94\%$
test_values[vec_td_lambda_return_estimate-True-False] 29.6132ms 26.8811ms 37.2009 Ops/s 41.4760 Ops/s $\textbf{\color{#d91a1a}-10.31\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 8.6977ms 8.5382ms 117.1214 Ops/s 117.1753 Ops/s $\color{#d91a1a}-0.05\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.6605ms 1.9688ms 507.9185 Ops/s 503.4531 Ops/s $\color{#35bf28}+0.89\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.6478ms 0.3738ms 2.6755 KOps/s 2.7109 KOps/s $\color{#d91a1a}-1.30\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 46.9483ms 45.0792ms 22.1832 Ops/s 25.7700 Ops/s $\textbf{\color{#d91a1a}-13.92\%}$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 4.2675ms 3.4382ms 290.8512 Ops/s 291.5212 Ops/s $\color{#d91a1a}-0.23\%$
test_dqn_speed[False-None] 5.6352ms 1.3847ms 722.1625 Ops/s 718.0433 Ops/s $\color{#35bf28}+0.57\%$
test_dqn_speed[False-backward] 1.9466ms 1.8628ms 536.8265 Ops/s 523.0430 Ops/s $\color{#35bf28}+2.64\%$
test_dqn_speed[True-None] 0.7520ms 0.4717ms 2.1200 KOps/s 2.0647 KOps/s $\color{#35bf28}+2.68\%$
test_dqn_speed[True-backward] 0.9696ms 0.8872ms 1.1272 KOps/s 857.1828 Ops/s $\textbf{\color{#35bf28}+31.50\%}$
test_dqn_speed[reduce-overhead-None] 0.7012ms 0.4718ms 2.1196 KOps/s 2.0767 KOps/s $\color{#35bf28}+2.06\%$
test_dqn_speed[reduce-overhead-backward] 1.0157ms 0.8884ms 1.1257 KOps/s 984.5784 Ops/s $\textbf{\color{#35bf28}+14.33\%}$
test_ddpg_speed[False-None] 3.5936ms 2.8604ms 349.6017 Ops/s 339.5467 Ops/s $\color{#35bf28}+2.96\%$
test_ddpg_speed[False-backward] 4.1029ms 3.9830ms 251.0651 Ops/s 243.1994 Ops/s $\color{#35bf28}+3.23\%$
test_ddpg_speed[True-None] 1.9772ms 1.2077ms 827.9922 Ops/s 811.4089 Ops/s $\color{#35bf28}+2.04\%$
test_ddpg_speed[True-backward] 2.1582ms 2.0969ms 476.8921 Ops/s 457.3346 Ops/s $\color{#35bf28}+4.28\%$
test_ddpg_speed[reduce-overhead-None] 1.6996ms 1.2120ms 825.1100 Ops/s 805.9398 Ops/s $\color{#35bf28}+2.38\%$
test_ddpg_speed[reduce-overhead-backward] 2.1422ms 2.0914ms 478.1597 Ops/s 469.2818 Ops/s $\color{#35bf28}+1.89\%$
test_sac_speed[False-None] 9.8244ms 8.0054ms 124.9154 Ops/s 125.2869 Ops/s $\color{#d91a1a}-0.30\%$
test_sac_speed[False-backward] 11.0260ms 10.6591ms 93.8166 Ops/s 92.6045 Ops/s $\color{#35bf28}+1.31\%$
test_sac_speed[True-None] 2.3684ms 2.0700ms 483.0811 Ops/s 473.6177 Ops/s $\color{#35bf28}+2.00\%$
test_sac_speed[True-backward] 4.2883ms 3.8284ms 261.2088 Ops/s 263.9317 Ops/s $\color{#d91a1a}-1.03\%$
test_sac_speed[reduce-overhead-None] 2.7077ms 2.0865ms 479.2716 Ops/s 476.9217 Ops/s $\color{#35bf28}+0.49\%$
test_sac_speed[reduce-overhead-backward] 3.8352ms 3.7461ms 266.9446 Ops/s 244.4418 Ops/s $\textbf{\color{#35bf28}+9.21\%}$
test_redq_speed[False-None] 23.1442ms 13.7625ms 72.6613 Ops/s 74.3738 Ops/s $\color{#d91a1a}-2.30\%$
test_redq_speed[False-backward] 25.6520ms 22.8464ms 43.7706 Ops/s 43.2177 Ops/s $\color{#35bf28}+1.28\%$
test_redq_speed[True-None] 5.5179ms 4.8765ms 205.0636 Ops/s 191.9517 Ops/s $\textbf{\color{#35bf28}+6.83\%}$
test_redq_speed[True-backward] 13.4514ms 12.7564ms 78.3918 Ops/s 78.7464 Ops/s $\color{#d91a1a}-0.45\%$
test_redq_speed[reduce-overhead-None] 6.2124ms 4.9787ms 200.8560 Ops/s 196.2015 Ops/s $\color{#35bf28}+2.37\%$
test_redq_speed[reduce-overhead-backward] 14.1057ms 12.7357ms 78.5195 Ops/s 79.9079 Ops/s $\color{#d91a1a}-1.74\%$
test_redq_deprec_speed[False-None] 14.0086ms 12.8277ms 77.9560 Ops/s 77.8781 Ops/s $\color{#35bf28}+0.10\%$
test_redq_deprec_speed[False-backward] 20.9084ms 18.7211ms 53.4157 Ops/s 52.8703 Ops/s $\color{#35bf28}+1.03\%$
test_redq_deprec_speed[True-None] 5.5302ms 3.9506ms 253.1231 Ops/s 258.0188 Ops/s $\color{#d91a1a}-1.90\%$
test_redq_deprec_speed[True-backward] 8.3614ms 8.2209ms 121.6412 Ops/s 114.2280 Ops/s $\textbf{\color{#35bf28}+6.49\%}$
test_redq_deprec_speed[reduce-overhead-None] 5.1644ms 4.1072ms 243.4730 Ops/s 230.7677 Ops/s $\textbf{\color{#35bf28}+5.51\%}$
test_redq_deprec_speed[reduce-overhead-backward] 9.6748ms 8.9631ms 111.5684 Ops/s 116.0619 Ops/s $\color{#d91a1a}-3.87\%$
test_td3_speed[False-None] 8.2384ms 7.9610ms 125.6129 Ops/s 123.4293 Ops/s $\color{#35bf28}+1.77\%$
test_td3_speed[False-backward] 11.5870ms 10.5403ms 94.8738 Ops/s 93.0575 Ops/s $\color{#35bf28}+1.95\%$
test_td3_speed[True-None] 1.9694ms 1.8041ms 554.2970 Ops/s 558.8431 Ops/s $\color{#d91a1a}-0.81\%$
test_td3_speed[True-backward] 3.6183ms 3.3802ms 295.8414 Ops/s 290.0719 Ops/s $\color{#35bf28}+1.99\%$
test_td3_speed[reduce-overhead-None] 1.9706ms 1.7762ms 563.0060 Ops/s 560.3058 Ops/s $\color{#35bf28}+0.48\%$
test_td3_speed[reduce-overhead-backward] 3.5081ms 3.4120ms 293.0813 Ops/s 292.8645 Ops/s $\color{#35bf28}+0.07\%$
test_cql_speed[False-None] 39.5482ms 36.7064ms 27.2432 Ops/s 27.1799 Ops/s $\color{#35bf28}+0.23\%$
test_cql_speed[False-backward] 49.7049ms 47.0835ms 21.2389 Ops/s 21.2348 Ops/s $\color{#35bf28}+0.02\%$
test_cql_speed[True-None] 16.6433ms 15.9611ms 62.6521 Ops/s 60.6043 Ops/s $\color{#35bf28}+3.38\%$
test_cql_speed[True-backward] 23.7075ms 22.4997ms 44.4451 Ops/s 41.9880 Ops/s $\textbf{\color{#35bf28}+5.85\%}$
test_cql_speed[reduce-overhead-None] 17.0812ms 15.9707ms 62.6146 Ops/s 61.1479 Ops/s $\color{#35bf28}+2.40\%$
test_cql_speed[reduce-overhead-backward] 23.7241ms 22.9628ms 43.5487 Ops/s 43.2763 Ops/s $\color{#35bf28}+0.63\%$
test_a2c_speed[False-None] 8.1624ms 7.2796ms 137.3696 Ops/s 137.7138 Ops/s $\color{#d91a1a}-0.25\%$
test_a2c_speed[False-backward] 16.2782ms 14.5296ms 68.8252 Ops/s 65.6348 Ops/s $\color{#35bf28}+4.86\%$
test_a2c_speed[True-None] 4.1470ms 3.7311ms 268.0147 Ops/s 266.3511 Ops/s $\color{#35bf28}+0.62\%$
test_a2c_speed[True-backward] 11.2784ms 10.1600ms 98.4249 Ops/s 98.1350 Ops/s $\color{#35bf28}+0.30\%$
test_a2c_speed[reduce-overhead-None] 4.4078ms 3.7315ms 267.9881 Ops/s 267.0978 Ops/s $\color{#35bf28}+0.33\%$
test_a2c_speed[reduce-overhead-backward] 10.5026ms 10.0783ms 99.2229 Ops/s 95.8385 Ops/s $\color{#35bf28}+3.53\%$
test_ppo_speed[False-None] 8.7874ms 7.4664ms 133.9330 Ops/s 131.2590 Ops/s $\color{#35bf28}+2.04\%$
test_ppo_speed[False-backward] 15.6346ms 14.6773ms 68.1325 Ops/s 65.3664 Ops/s $\color{#35bf28}+4.23\%$
test_ppo_speed[True-None] 4.4639ms 4.1124ms 243.1645 Ops/s 238.8746 Ops/s $\color{#35bf28}+1.80\%$
test_ppo_speed[True-backward] 10.4490ms 9.9567ms 100.4351 Ops/s 100.1450 Ops/s $\color{#35bf28}+0.29\%$
test_ppo_speed[reduce-overhead-None] 4.7701ms 4.0888ms 244.5681 Ops/s 238.4832 Ops/s $\color{#35bf28}+2.55\%$
test_ppo_speed[reduce-overhead-backward] 11.0714ms 9.9464ms 100.5390 Ops/s 100.3351 Ops/s $\color{#35bf28}+0.20\%$
test_reinforce_speed[False-None] 8.0436ms 6.5689ms 152.2336 Ops/s 153.5053 Ops/s $\color{#d91a1a}-0.83\%$
test_reinforce_speed[False-backward] 13.5037ms 10.0558ms 99.4451 Ops/s 101.9557 Ops/s $\color{#d91a1a}-2.46\%$
test_reinforce_speed[True-None] 3.5806ms 3.0502ms 327.8496 Ops/s 313.5228 Ops/s $\color{#35bf28}+4.57\%$
test_reinforce_speed[True-backward] 10.0080ms 8.9751ms 111.4188 Ops/s 110.8978 Ops/s $\color{#35bf28}+0.47\%$
test_reinforce_speed[reduce-overhead-None] 3.7527ms 3.0486ms 328.0191 Ops/s 316.2689 Ops/s $\color{#35bf28}+3.72\%$
test_reinforce_speed[reduce-overhead-backward] 9.3768ms 8.8928ms 112.4500 Ops/s 111.9203 Ops/s $\color{#35bf28}+0.47\%$
test_iql_speed[False-None] 37.6269ms 33.0425ms 30.2641 Ops/s 30.7605 Ops/s $\color{#d91a1a}-1.61\%$
test_iql_speed[False-backward] 47.2712ms 45.7411ms 21.8622 Ops/s 21.8492 Ops/s $\color{#35bf28}+0.06\%$
test_iql_speed[True-None] 12.0829ms 11.2682ms 88.7452 Ops/s 88.7063 Ops/s $\color{#35bf28}+0.04\%$
test_iql_speed[True-backward] 24.4268ms 22.9357ms 43.6002 Ops/s 43.6895 Ops/s $\color{#d91a1a}-0.20\%$
test_iql_speed[reduce-overhead-None] 13.4827ms 11.8701ms 84.2452 Ops/s 87.2811 Ops/s $\color{#d91a1a}-3.48\%$
test_iql_speed[reduce-overhead-backward] 24.2423ms 23.4748ms 42.5988 Ops/s 44.5699 Ops/s $\color{#d91a1a}-4.42\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.2921ms 5.1142ms 195.5323 Ops/s 201.5352 Ops/s $\color{#d91a1a}-2.98\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.2880ms 0.5330ms 1.8760 KOps/s 1.9538 KOps/s $\color{#d91a1a}-3.98\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.9192ms 0.5082ms 1.9678 KOps/s 2.0224 KOps/s $\color{#d91a1a}-2.70\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.3825ms 5.0013ms 199.9469 Ops/s 211.7049 Ops/s $\textbf{\color{#d91a1a}-5.55\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 2.9047ms 0.5236ms 1.9097 KOps/s 1.9275 KOps/s $\color{#d91a1a}-0.92\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8527ms 0.5067ms 1.9737 KOps/s 2.0253 KOps/s $\color{#d91a1a}-2.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.3517ms 1.6972ms 589.2073 Ops/s 595.3702 Ops/s $\color{#d91a1a}-1.04\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.4630ms 1.6065ms 622.4805 Ops/s 630.2327 Ops/s $\color{#d91a1a}-1.23\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.7185ms 5.0640ms 197.4717 Ops/s 203.1949 Ops/s $\color{#d91a1a}-2.82\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.9784ms 0.6720ms 1.4881 KOps/s 1.5254 KOps/s $\color{#d91a1a}-2.44\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.0066ms 0.6492ms 1.5403 KOps/s 1.5441 KOps/s $\color{#d91a1a}-0.24\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.6118ms 4.9218ms 203.1780 Ops/s 211.6380 Ops/s $\color{#d91a1a}-4.00\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.3030ms 0.5572ms 1.7946 KOps/s 1.8745 KOps/s $\color{#d91a1a}-4.26\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7247ms 0.5169ms 1.9346 KOps/s 2.0601 KOps/s $\textbf{\color{#d91a1a}-6.09\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 5.5910ms 4.9168ms 203.3843 Ops/s 213.1354 Ops/s $\color{#d91a1a}-4.58\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.3295ms 0.5240ms 1.9084 KOps/s 1.9819 KOps/s $\color{#d91a1a}-3.71\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.9580ms 0.5080ms 1.9684 KOps/s 2.0605 KOps/s $\color{#d91a1a}-4.47\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.7917ms 5.1396ms 194.5667 Ops/s 203.2294 Ops/s $\color{#d91a1a}-4.26\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 3.2234ms 0.6725ms 1.4870 KOps/s 1.5474 KOps/s $\color{#d91a1a}-3.91\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 1.2018ms 0.6484ms 1.5424 KOps/s 1.5816 KOps/s $\color{#d91a1a}-2.48\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 5.8251ms 4.2053ms 237.7938 Ops/s 237.1289 Ops/s $\color{#35bf28}+0.28\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 7.7667ms 2.2752ms 439.5240 Ops/s 432.1051 Ops/s $\color{#35bf28}+1.72\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 5.4996ms 1.3839ms 722.5939 Ops/s 780.4138 Ops/s $\textbf{\color{#d91a1a}-7.41\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.5119s 14.5390ms 68.7805 Ops/s 244.7136 Ops/s $\textbf{\color{#d91a1a}-71.89\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 7.7476ms 2.3241ms 430.2833 Ops/s 395.6654 Ops/s $\textbf{\color{#35bf28}+8.75\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 5.2899ms 1.3715ms 729.1523 Ops/s 819.8035 Ops/s $\textbf{\color{#d91a1a}-11.06\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.9350ms 4.4304ms 225.7112 Ops/s 33.1906 Ops/s $\textbf{\color{#35bf28}+580.04\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 9.7095ms 2.5877ms 386.4459 Ops/s 407.2670 Ops/s $\textbf{\color{#d91a1a}-5.11\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 2.9803ms 1.4729ms 678.9126 Ops/s 680.1393 Ops/s $\color{#d91a1a}-0.18\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.6608ms 12.2788ms 81.4409 Ops/s 80.3843 Ops/s $\color{#35bf28}+1.31\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 16.7923ms 14.6667ms 68.1816 Ops/s 69.3257 Ops/s $\color{#d91a1a}-1.65\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 22.2194ms 21.2467ms 47.0662 Ops/s 46.8349 Ops/s $\color{#35bf28}+0.49\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 16.4824ms 14.8332ms 67.4164 Ops/s 68.5436 Ops/s $\color{#d91a1a}-1.64\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 23.1100ms 21.4816ms 46.5516 Ops/s 47.2883 Ops/s $\color{#d91a1a}-1.56\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 17.0264ms 16.3425ms 61.1903 Ops/s 61.6961 Ops/s $\color{#d91a1a}-0.82\%$

Copy link

github-actions bot commented Feb 20, 2025

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}24$. Worsened: $\large\color{#d91a1a}13$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.8882s 0.8050s 1.2423 Ops/s 1.2236 Ops/s $\color{#35bf28}+1.52\%$
test_transformed 1.4791s 1.3947s 0.7170 Ops/s 0.7063 Ops/s $\color{#35bf28}+1.51\%$
test_serial 2.3880s 2.2975s 0.4353 Ops/s 0.4351 Ops/s $\color{#35bf28}+0.05\%$
test_parallel 1.9545s 1.8566s 0.5386 Ops/s 0.5235 Ops/s $\color{#35bf28}+2.89\%$
test_step_mdp_speed[True-True-True-True-True] 0.1195ms 38.1872μs 26.1868 KOps/s 25.6753 KOps/s $\color{#35bf28}+1.99\%$
test_step_mdp_speed[True-True-True-True-False] 55.4110μs 22.5237μs 44.3977 KOps/s 43.5694 KOps/s $\color{#35bf28}+1.90\%$
test_step_mdp_speed[True-True-True-False-True] 66.7810μs 21.7248μs 46.0304 KOps/s 45.8469 KOps/s $\color{#35bf28}+0.40\%$
test_step_mdp_speed[True-True-True-False-False] 45.8210μs 12.4683μs 80.2037 KOps/s 78.6542 KOps/s $\color{#35bf28}+1.97\%$
test_step_mdp_speed[True-True-False-True-True] 74.2510μs 40.7155μs 24.5607 KOps/s 24.0858 KOps/s $\color{#35bf28}+1.97\%$
test_step_mdp_speed[True-True-False-True-False] 93.4920μs 24.1393μs 41.4262 KOps/s 39.9194 KOps/s $\color{#35bf28}+3.77\%$
test_step_mdp_speed[True-True-False-False-True] 55.2610μs 23.1707μs 43.1579 KOps/s 41.9121 KOps/s $\color{#35bf28}+2.97\%$
test_step_mdp_speed[True-True-False-False-False] 45.2600μs 14.6587μs 68.2190 KOps/s 66.1708 KOps/s $\color{#35bf28}+3.10\%$
test_step_mdp_speed[True-False-True-True-True] 76.9320μs 42.5709μs 23.4902 KOps/s 22.5789 KOps/s $\color{#35bf28}+4.04\%$
test_step_mdp_speed[True-False-True-True-False] 56.7310μs 26.6351μs 37.5445 KOps/s 37.0048 KOps/s $\color{#35bf28}+1.46\%$
test_step_mdp_speed[True-False-True-False-True] 85.2710μs 23.2823μs 42.9510 KOps/s 42.6503 KOps/s $\color{#35bf28}+0.71\%$
test_step_mdp_speed[True-False-True-False-False] 46.8110μs 14.5305μs 68.8210 KOps/s 67.6734 KOps/s $\color{#35bf28}+1.70\%$
test_step_mdp_speed[True-False-False-True-True] 74.8120μs 44.8997μs 22.2719 KOps/s 22.0189 KOps/s $\color{#35bf28}+1.15\%$
test_step_mdp_speed[True-False-False-True-False] 58.5510μs 28.8475μs 34.6650 KOps/s 33.6171 KOps/s $\color{#35bf28}+3.12\%$
test_step_mdp_speed[True-False-False-False-True] 57.3910μs 25.2296μs 39.6360 KOps/s 37.7641 KOps/s $\color{#35bf28}+4.96\%$
test_step_mdp_speed[True-False-False-False-False] 46.6210μs 16.6585μs 60.0296 KOps/s 58.2804 KOps/s $\color{#35bf28}+3.00\%$
test_step_mdp_speed[False-True-True-True-True] 81.3320μs 42.4100μs 23.5794 KOps/s 22.6615 KOps/s $\color{#35bf28}+4.05\%$
test_step_mdp_speed[False-True-True-True-False] 57.7610μs 26.9329μs 37.1293 KOps/s 36.5559 KOps/s $\color{#35bf28}+1.57\%$
test_step_mdp_speed[False-True-True-False-True] 53.0410μs 26.9704μs 37.0777 KOps/s 36.1649 KOps/s $\color{#35bf28}+2.52\%$
test_step_mdp_speed[False-True-True-False-False] 42.6300μs 16.2917μs 61.3811 KOps/s 60.6078 KOps/s $\color{#35bf28}+1.28\%$
test_step_mdp_speed[False-True-False-True-True] 75.5010μs 44.7393μs 22.3517 KOps/s 21.8080 KOps/s $\color{#35bf28}+2.49\%$
test_step_mdp_speed[False-True-False-True-False] 73.4010μs 28.9767μs 34.5105 KOps/s 33.5643 KOps/s $\color{#35bf28}+2.82\%$
test_step_mdp_speed[False-True-False-False-True] 3.2621ms 30.1864μs 33.1275 KOps/s 33.6173 KOps/s $\color{#d91a1a}-1.46\%$
test_step_mdp_speed[False-True-False-False-False] 52.5510μs 18.6702μs 53.5612 KOps/s 53.4725 KOps/s $\color{#35bf28}+0.17\%$
test_step_mdp_speed[False-False-True-True-True] 91.5320μs 47.5655μs 21.0236 KOps/s 20.7630 KOps/s $\color{#35bf28}+1.26\%$
test_step_mdp_speed[False-False-True-True-False] 59.0610μs 31.4810μs 31.7652 KOps/s 31.2238 KOps/s $\color{#35bf28}+1.73\%$
test_step_mdp_speed[False-False-True-False-True] 86.0710μs 29.5725μs 33.8152 KOps/s 33.6186 KOps/s $\color{#35bf28}+0.58\%$
test_step_mdp_speed[False-False-True-False-False] 72.6610μs 18.2416μs 54.8199 KOps/s 53.0101 KOps/s $\color{#35bf28}+3.41\%$
test_step_mdp_speed[False-False-False-True-True] 94.2020μs 48.7160μs 20.5271 KOps/s 19.8742 KOps/s $\color{#35bf28}+3.29\%$
test_step_mdp_speed[False-False-False-True-False] 70.4310μs 33.2967μs 30.0330 KOps/s 29.1610 KOps/s $\color{#35bf28}+2.99\%$
test_step_mdp_speed[False-False-False-False-True] 74.5810μs 31.2910μs 31.9581 KOps/s 30.8463 KOps/s $\color{#35bf28}+3.60\%$
test_step_mdp_speed[False-False-False-False-False] 47.1910μs 20.6486μs 48.4294 KOps/s 47.4841 KOps/s $\color{#35bf28}+1.99\%$
test_values[generalized_advantage_estimate-True-True] 25.1286ms 24.7253ms 40.4444 Ops/s 40.0956 Ops/s $\color{#35bf28}+0.87\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1051s 2.9992ms 333.4192 Ops/s 338.3323 Ops/s $\color{#d91a1a}-1.45\%$
test_values[td0_return_estimate-False-False] 0.1063ms 79.7682μs 12.5363 KOps/s 12.6473 KOps/s $\color{#d91a1a}-0.88\%$
test_values[td1_return_estimate-False-False] 55.0856ms 54.7013ms 18.2811 Ops/s 17.9060 Ops/s $\color{#35bf28}+2.09\%$
test_values[vec_td1_return_estimate-False-False] 1.3574ms 1.0803ms 925.6344 Ops/s 923.0480 Ops/s $\color{#35bf28}+0.28\%$
test_values[td_lambda_return_estimate-True-False] 87.0734ms 86.7116ms 11.5325 Ops/s 11.4950 Ops/s $\color{#35bf28}+0.33\%$
test_values[vec_td_lambda_return_estimate-True-False] 1.3066ms 1.0771ms 928.4152 Ops/s 926.2303 Ops/s $\color{#35bf28}+0.24\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 24.7524ms 24.4916ms 40.8304 Ops/s 40.6521 Ops/s $\color{#35bf28}+0.44\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0232ms 0.7477ms 1.3374 KOps/s 1.3263 KOps/s $\color{#35bf28}+0.84\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.9086ms 0.6670ms 1.4992 KOps/s 1.4961 KOps/s $\color{#35bf28}+0.21\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5295ms 1.4820ms 674.7445 Ops/s 672.7030 Ops/s $\color{#35bf28}+0.30\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7443ms 0.6807ms 1.4690 KOps/s 1.4671 KOps/s $\color{#35bf28}+0.13\%$
test_dqn_speed[False-None] 7.1030ms 1.5008ms 666.2977 Ops/s 658.4892 Ops/s $\color{#35bf28}+1.19\%$
test_dqn_speed[False-backward] 2.2614ms 2.1032ms 475.4601 Ops/s 469.6977 Ops/s $\color{#35bf28}+1.23\%$
test_dqn_speed[True-None] 1.0231ms 0.5733ms 1.7443 KOps/s 1.6836 KOps/s $\color{#35bf28}+3.61\%$
test_dqn_speed[True-backward] 1.1870ms 1.1185ms 894.0449 Ops/s 874.1429 Ops/s $\color{#35bf28}+2.28\%$
test_dqn_speed[reduce-overhead-None] 0.6696ms 0.5806ms 1.7223 KOps/s 1.7089 KOps/s $\color{#35bf28}+0.79\%$
test_dqn_speed[reduce-overhead-backward] 1.0167ms 0.9621ms 1.0394 KOps/s 1.0257 KOps/s $\color{#35bf28}+1.34\%$
test_ddpg_speed[False-None] 3.1142ms 2.8234ms 354.1834 Ops/s 351.5800 Ops/s $\color{#35bf28}+0.74\%$
test_ddpg_speed[False-backward] 4.7051ms 4.1381ms 241.6573 Ops/s 238.2744 Ops/s $\color{#35bf28}+1.42\%$
test_ddpg_speed[True-None] 1.4612ms 1.3400ms 746.2842 Ops/s 734.2056 Ops/s $\color{#35bf28}+1.65\%$
test_ddpg_speed[True-backward] 2.5347ms 2.4199ms 413.2400 Ops/s 383.5263 Ops/s $\textbf{\color{#35bf28}+7.75\%}$
test_ddpg_speed[reduce-overhead-None] 1.5472ms 1.3656ms 732.2649 Ops/s 724.9505 Ops/s $\color{#35bf28}+1.01\%$
test_ddpg_speed[reduce-overhead-backward] 1.9416ms 1.8829ms 531.1026 Ops/s 477.7957 Ops/s $\textbf{\color{#35bf28}+11.16\%}$
test_sac_speed[False-None] 8.3493ms 7.9134ms 126.3672 Ops/s 121.6310 Ops/s $\color{#35bf28}+3.89\%$
test_sac_speed[False-backward] 11.3975ms 10.8853ms 91.8667 Ops/s 88.1497 Ops/s $\color{#35bf28}+4.22\%$
test_sac_speed[True-None] 2.0787ms 1.8528ms 539.7243 Ops/s 528.2034 Ops/s $\color{#35bf28}+2.18\%$
test_sac_speed[True-backward] 3.6957ms 3.5572ms 281.1231 Ops/s 260.0034 Ops/s $\textbf{\color{#35bf28}+8.12\%}$
test_sac_speed[reduce-overhead-None] 20.5177ms 11.7332ms 85.2280 Ops/s 85.8257 Ops/s $\color{#d91a1a}-0.70\%$
test_sac_speed[reduce-overhead-backward] 1.7063ms 1.6458ms 607.6018 Ops/s 541.6473 Ops/s $\textbf{\color{#35bf28}+12.18\%}$
test_redq_speed[False-None] 7.9179ms 7.3492ms 136.0692 Ops/s 132.9132 Ops/s $\color{#35bf28}+2.37\%$
test_redq_speed[False-backward] 11.7336ms 11.1838ms 89.4152 Ops/s 84.9255 Ops/s $\textbf{\color{#35bf28}+5.29\%}$
test_redq_speed[True-None] 2.4336ms 2.3263ms 429.8667 Ops/s 422.0679 Ops/s $\color{#35bf28}+1.85\%$
test_redq_speed[True-backward] 4.5421ms 4.1808ms 239.1863 Ops/s 230.7811 Ops/s $\color{#35bf28}+3.64\%$
test_redq_speed[reduce-overhead-None] 2.4460ms 2.3456ms 426.3221 Ops/s 415.5429 Ops/s $\color{#35bf28}+2.59\%$
test_redq_speed[reduce-overhead-backward] 4.5676ms 4.1865ms 238.8614 Ops/s 230.6122 Ops/s $\color{#35bf28}+3.58\%$
test_redq_deprec_speed[False-None] 9.3648ms 8.9015ms 112.3407 Ops/s 110.9871 Ops/s $\color{#35bf28}+1.22\%$
test_redq_deprec_speed[False-backward] 12.6858ms 12.2190ms 81.8400 Ops/s 80.8072 Ops/s $\color{#35bf28}+1.28\%$
test_redq_deprec_speed[True-None] 2.7465ms 2.6583ms 376.1846 Ops/s 371.8822 Ops/s $\color{#35bf28}+1.16\%$
test_redq_deprec_speed[True-backward] 4.7203ms 4.4567ms 224.3831 Ops/s 226.5212 Ops/s $\color{#d91a1a}-0.94\%$
test_redq_deprec_speed[reduce-overhead-None] 2.6843ms 2.6211ms 381.5209 Ops/s 370.8244 Ops/s $\color{#35bf28}+2.88\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.8825ms 4.4431ms 225.0683 Ops/s 224.2505 Ops/s $\color{#35bf28}+0.36\%$
test_td3_speed[False-None] 7.8779ms 7.8418ms 127.5215 Ops/s 126.2586 Ops/s $\color{#35bf28}+1.00\%$
test_td3_speed[False-backward] 11.0436ms 10.3919ms 96.2290 Ops/s 97.7345 Ops/s $\color{#d91a1a}-1.54\%$
test_td3_speed[True-None] 1.7413ms 1.6678ms 599.6070 Ops/s 585.4200 Ops/s $\color{#35bf28}+2.42\%$
test_td3_speed[True-backward] 3.3896ms 3.3448ms 298.9690 Ops/s 301.4308 Ops/s $\color{#d91a1a}-0.82\%$
test_td3_speed[reduce-overhead-None] 49.1544ms 25.2845ms 39.5499 Ops/s 39.3205 Ops/s $\color{#35bf28}+0.58\%$
test_td3_speed[reduce-overhead-backward] 1.5864ms 1.5214ms 657.3022 Ops/s 694.6871 Ops/s $\textbf{\color{#d91a1a}-5.38\%}$
test_cql_speed[False-None] 17.2509ms 16.5708ms 60.3470 Ops/s 59.7604 Ops/s $\color{#35bf28}+0.98\%$
test_cql_speed[False-backward] 22.4712ms 21.9891ms 45.4772 Ops/s 45.2112 Ops/s $\color{#35bf28}+0.59\%$
test_cql_speed[True-None] 3.5236ms 3.2730ms 305.5346 Ops/s 301.2242 Ops/s $\color{#35bf28}+1.43\%$
test_cql_speed[True-backward] 5.9198ms 5.5026ms 181.7337 Ops/s 172.7758 Ops/s $\textbf{\color{#35bf28}+5.18\%}$
test_cql_speed[reduce-overhead-None] 21.2096ms 12.9337ms 77.3175 Ops/s 77.8052 Ops/s $\color{#d91a1a}-0.63\%$
test_cql_speed[reduce-overhead-backward] 2.1197ms 1.8603ms 537.5390 Ops/s 489.6616 Ops/s $\textbf{\color{#35bf28}+9.78\%}$
test_a2c_speed[False-None] 3.2215ms 3.1295ms 319.5438 Ops/s 300.4619 Ops/s $\textbf{\color{#35bf28}+6.35\%}$
test_a2c_speed[False-backward] 7.0224ms 6.2918ms 158.9369 Ops/s 154.7909 Ops/s $\color{#35bf28}+2.68\%$
test_a2c_speed[True-None] 1.9492ms 1.3465ms 742.6524 Ops/s 724.9722 Ops/s $\color{#35bf28}+2.44\%$
test_a2c_speed[True-backward] 3.0128ms 2.8980ms 345.0600 Ops/s 319.0134 Ops/s $\textbf{\color{#35bf28}+8.16\%}$
test_a2c_speed[reduce-overhead-None] 15.1904ms 8.6283ms 115.8981 Ops/s 116.8312 Ops/s $\color{#d91a1a}-0.80\%$
test_a2c_speed[reduce-overhead-backward] 1.5434ms 1.4562ms 686.7103 Ops/s 675.9391 Ops/s $\color{#35bf28}+1.59\%$
test_ppo_speed[False-None] 4.0600ms 3.6278ms 275.6505 Ops/s 265.5014 Ops/s $\color{#35bf28}+3.82\%$
test_ppo_speed[False-backward] 6.9264ms 6.7233ms 148.7363 Ops/s 145.1431 Ops/s $\color{#35bf28}+2.48\%$
test_ppo_speed[True-None] 1.6138ms 1.4078ms 710.3514 Ops/s 688.7787 Ops/s $\color{#35bf28}+3.13\%$
test_ppo_speed[True-backward] 3.2540ms 3.0370ms 329.2674 Ops/s 321.7373 Ops/s $\color{#35bf28}+2.34\%$
test_ppo_speed[reduce-overhead-None] 1.3775ms 0.9676ms 1.0335 KOps/s 1.0333 KOps/s $\color{#35bf28}+0.02\%$
test_ppo_speed[reduce-overhead-backward] 1.5154ms 1.4165ms 705.9900 Ops/s 639.6247 Ops/s $\textbf{\color{#35bf28}+10.38\%}$
test_reinforce_speed[False-None] 2.3189ms 2.2273ms 448.9641 Ops/s 416.5139 Ops/s $\textbf{\color{#35bf28}+7.79\%}$
test_reinforce_speed[False-backward] 3.6761ms 3.2516ms 307.5448 Ops/s 283.9976 Ops/s $\textbf{\color{#35bf28}+8.29\%}$
test_reinforce_speed[True-None] 1.8436ms 1.2925ms 773.6840 Ops/s 745.3862 Ops/s $\color{#35bf28}+3.80\%$
test_reinforce_speed[True-backward] 2.9748ms 2.9131ms 343.2797 Ops/s 322.8897 Ops/s $\textbf{\color{#35bf28}+6.31\%}$
test_reinforce_speed[reduce-overhead-None] 18.3198ms 9.7692ms 102.3630 Ops/s 104.2179 Ops/s $\color{#d91a1a}-1.78\%$
test_reinforce_speed[reduce-overhead-backward] 1.5702ms 1.5068ms 663.6736 Ops/s 591.4082 Ops/s $\textbf{\color{#35bf28}+12.22\%}$
test_iql_speed[False-None] 9.6591ms 9.1400ms 109.4087 Ops/s 107.4973 Ops/s $\color{#35bf28}+1.78\%$
test_iql_speed[False-backward] 13.6281ms 12.7821ms 78.2346 Ops/s 74.8008 Ops/s $\color{#35bf28}+4.59\%$
test_iql_speed[True-None] 2.7171ms 2.2180ms 450.8503 Ops/s 430.9432 Ops/s $\color{#35bf28}+4.62\%$
test_iql_speed[True-backward] 4.9851ms 4.7363ms 211.1369 Ops/s 203.9229 Ops/s $\color{#35bf28}+3.54\%$
test_iql_speed[reduce-overhead-None] 0.4768s 12.5710ms 79.5479 Ops/s 93.5434 Ops/s $\textbf{\color{#d91a1a}-14.96\%}$
test_iql_speed[reduce-overhead-backward] 2.0630ms 1.9220ms 520.2800 Ops/s 499.4377 Ops/s $\color{#35bf28}+4.17\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.4237ms 6.0868ms 164.2899 Ops/s 162.6105 Ops/s $\color{#35bf28}+1.03\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6137ms 0.3414ms 2.9290 KOps/s 3.7944 KOps/s $\textbf{\color{#d91a1a}-22.81\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6910ms 0.3232ms 3.0940 KOps/s 4.1654 KOps/s $\textbf{\color{#d91a1a}-25.72\%}$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.4503ms 5.8027ms 172.3348 Ops/s 169.6291 Ops/s $\color{#35bf28}+1.60\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.0035ms 0.3381ms 2.9576 KOps/s 3.4588 KOps/s $\textbf{\color{#d91a1a}-14.49\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.8377ms 0.3211ms 3.1146 KOps/s 3.4663 KOps/s $\textbf{\color{#d91a1a}-10.15\%}$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.7904ms 1.4448ms 692.1500 Ops/s 752.2782 Ops/s $\textbf{\color{#d91a1a}-7.99\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.6694ms 1.2881ms 776.3455 Ops/s 843.0362 Ops/s $\textbf{\color{#d91a1a}-7.91\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4549ms 5.9888ms 166.9773 Ops/s 164.8118 Ops/s $\color{#35bf28}+1.31\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8697ms 0.4103ms 2.4375 KOps/s 2.1781 KOps/s $\textbf{\color{#35bf28}+11.91\%}$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.9212ms 0.4647ms 2.1519 KOps/s 2.2416 KOps/s $\color{#d91a1a}-4.00\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.2388ms 5.8023ms 172.3450 Ops/s 167.7819 Ops/s $\color{#35bf28}+2.72\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 2.2656ms 0.3792ms 2.6375 KOps/s 3.8497 KOps/s $\textbf{\color{#d91a1a}-31.49\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8826ms 0.3277ms 3.0515 KOps/s 4.1416 KOps/s $\textbf{\color{#d91a1a}-26.32\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.1605ms 5.7765ms 173.1139 Ops/s 169.2970 Ops/s $\color{#35bf28}+2.25\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 1.8230ms 0.3131ms 3.1938 KOps/s 2.9988 KOps/s $\textbf{\color{#35bf28}+6.50\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5099ms 0.2979ms 3.3573 KOps/s 3.3303 KOps/s $\color{#35bf28}+0.81\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.2440ms 6.0124ms 166.3243 Ops/s 164.4164 Ops/s $\color{#35bf28}+1.16\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.9928ms 0.4222ms 2.3687 KOps/s 2.0463 KOps/s $\textbf{\color{#35bf28}+15.76\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6388ms 0.4057ms 2.4648 KOps/s 2.1538 KOps/s $\textbf{\color{#35bf28}+14.44\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.9196ms 5.3899ms 185.5336 Ops/s 180.3332 Ops/s $\color{#35bf28}+2.88\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 10.2216ms 2.0688ms 483.3703 Ops/s 397.3571 Ops/s $\textbf{\color{#35bf28}+21.65\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 3.4873ms 1.1190ms 893.6805 Ops/s 732.7312 Ops/s $\textbf{\color{#35bf28}+21.97\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4565s 14.5118ms 68.9095 Ops/s 179.4263 Ops/s $\textbf{\color{#d91a1a}-61.59\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 3.8084ms 1.8169ms 550.3987 Ops/s 437.1710 Ops/s $\textbf{\color{#35bf28}+25.90\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 9.3251ms 1.2387ms 807.2949 Ops/s 855.7532 Ops/s $\textbf{\color{#d91a1a}-5.66\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 8.0376ms 5.6217ms 177.8824 Ops/s 31.2508 Ops/s $\textbf{\color{#35bf28}+469.21\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 8.5914ms 2.1966ms 455.2413 Ops/s 430.5805 Ops/s $\textbf{\color{#35bf28}+5.73\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 7.8253ms 1.4067ms 710.9034 Ops/s 784.4842 Ops/s $\textbf{\color{#d91a1a}-9.38\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.2138ms 12.8768ms 77.6590 Ops/s 73.1336 Ops/s $\textbf{\color{#35bf28}+6.19\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 19.0948ms 16.8241ms 59.4384 Ops/s 59.4337 Ops/s $+0.01\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.0931ms 17.6399ms 56.6897 Ops/s 55.4613 Ops/s $\color{#35bf28}+2.21\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 18.7802ms 17.0797ms 58.5489 Ops/s 59.3266 Ops/s $\color{#d91a1a}-1.31\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 18.6364ms 17.6524ms 56.6497 Ops/s 55.4995 Ops/s $\color{#35bf28}+2.07\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 23.0462ms 18.4973ms 54.0618 Ops/s 54.8152 Ops/s $\color{#d91a1a}-1.37\%$

@vmoens vmoens force-pushed the fix-env-ci branch 3 times, most recently from 0f43ee4 to 2ed5d88 Compare February 24, 2025 18:38
@vmoens vmoens force-pushed the fix-env-ci branch 2 times, most recently from 2bb2640 to 0d46818 Compare February 25, 2025 09:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Environments Adds or modifies an environment wrapper
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants