-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Send info dict to the storage device in RBs #2527
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Oct 29, 2024
ghstack-source-id: 4ed60d649b17f96b49f90d234e679937c60a3c32 Pull Request resolved: #2527
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2527
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New Failure, 5 Unrelated FailuresAs of commit 9534be2 with merge base 3e4b292 (): NEW FAILURE - The following job has failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Oct 29, 2024
3 tasks
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.4109s | 0.4092s | 2.4440 Ops/s | 2.3117 Ops/s | |
test_transformed | 0.6724s | 0.5957s | 1.6788 Ops/s | 1.6480 Ops/s | |
test_serial | 1.3884s | 1.3222s | 0.7563 Ops/s | 0.7611 Ops/s | |
test_parallel | 1.2473s | 1.2413s | 0.8056 Ops/s | 0.7809 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.2197ms | 26.6928μs | 37.4633 KOps/s | 36.0256 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 48.6410μs | 15.4414μs | 64.7611 KOps/s | 63.8274 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 0.1317ms | 15.3338μs | 65.2153 KOps/s | 63.2694 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 38.4320μs | 8.7807μs | 113.8866 KOps/s | 110.3056 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 62.8170μs | 28.5839μs | 34.9847 KOps/s | 33.6793 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 40.5960μs | 17.3194μs | 57.7386 KOps/s | 57.0413 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 51.5160μs | 17.0536μs | 58.6385 KOps/s | 57.1593 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 35.9370μs | 10.4647μs | 95.5596 KOps/s | 92.5107 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 82.8450μs | 30.3996μs | 32.8952 KOps/s | 32.0770 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 72.8670μs | 18.9137μs | 52.8719 KOps/s | 51.7607 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 45.8660μs | 17.0149μs | 58.7720 KOps/s | 56.9593 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 37.1590μs | 10.4524μs | 95.6720 KOps/s | 94.4845 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 0.1008ms | 31.6887μs | 31.5570 KOps/s | 30.7303 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 67.9670μs | 20.4878μs | 48.8096 KOps/s | 48.1721 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 48.0200μs | 18.2689μs | 54.7378 KOps/s | 52.3962 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 40.0350μs | 12.0101μs | 83.2632 KOps/s | 81.7497 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 67.7970μs | 30.2176μs | 33.0933 KOps/s | 32.5358 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 82.5510μs | 18.6930μs | 53.4959 KOps/s | 52.2024 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 39.7750μs | 19.2550μs | 51.9345 KOps/s | 50.4654 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 84.1560μs | 11.6192μs | 86.0642 KOps/s | 83.4259 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 96.1490μs | 31.5229μs | 31.7230 KOps/s | 30.2340 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 61.7850μs | 20.2417μs | 49.4029 KOps/s | 48.3864 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 2.7618ms | 20.7385μs | 48.2195 KOps/s | 47.0076 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 46.7870μs | 13.1551μs | 76.0160 KOps/s | 74.5223 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 91.4930μs | 33.2585μs | 30.0675 KOps/s | 29.3012 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 47.9500μs | 22.0456μs | 45.3604 KOps/s | 44.6418 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 56.1250μs | 20.5907μs | 48.5657 KOps/s | 46.6896 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 59.8100μs | 12.8962μs | 77.5424 KOps/s | 73.4317 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 80.6330μs | 34.4587μs | 29.0202 KOps/s | 28.0006 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 58.0690μs | 23.4877μs | 42.5754 KOps/s | 41.7663 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 58.1790μs | 22.1496μs | 45.1475 KOps/s | 43.8334 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 57.0370μs | 14.6668μs | 68.1814 KOps/s | 67.3915 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 9.8213ms | 9.4617ms | 105.6893 Ops/s | 106.4141 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 37.4862ms | 35.4438ms | 28.2137 Ops/s | 27.7855 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.2342ms | 0.1739ms | 5.7500 KOps/s | 5.7123 KOps/s | |
test_values[td1_return_estimate-False-False] | 25.5359ms | 23.8254ms | 41.9720 Ops/s | 41.4845 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 37.6566ms | 35.6873ms | 28.0212 Ops/s | 27.7114 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 35.2862ms | 34.4722ms | 29.0089 Ops/s | 28.6611 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 38.0934ms | 35.5810ms | 28.1049 Ops/s | 27.9226 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 10.1926ms | 8.3625ms | 119.5812 Ops/s | 119.8936 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.1938ms | 1.7818ms | 561.2408 Ops/s | 453.9889 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.4615ms | 0.3533ms | 2.8304 KOps/s | 2.7895 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 48.2477ms | 46.8703ms | 21.3355 Ops/s | 20.2175 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 3.9237ms | 3.0301ms | 330.0215 Ops/s | 329.3062 Ops/s | |
test_dqn_speed[False-None] | 5.9455ms | 1.3336ms | 749.8661 Ops/s | 742.3061 Ops/s | |
test_dqn_speed[False-backward] | 1.8926ms | 1.8131ms | 551.5366 Ops/s | 549.2455 Ops/s | |
test_dqn_speed[True-None] | 1.2532ms | 0.4644ms | 2.1534 KOps/s | 2.1643 KOps/s | |
test_dqn_speed[True-backward] | 1.0050ms | 0.8903ms | 1.1233 KOps/s | 1.1326 KOps/s | |
test_dqn_speed[reduce-overhead-None] | 0.6050ms | 0.4711ms | 2.1228 KOps/s | 2.1505 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 0.9491ms | 0.8857ms | 1.1291 KOps/s | 1.1288 KOps/s | |
test_ddpg_speed[False-None] | 3.6735ms | 2.7638ms | 361.8206 Ops/s | 358.4483 Ops/s | |
test_ddpg_speed[False-backward] | 4.1167ms | 3.9048ms | 256.0966 Ops/s | 254.8918 Ops/s | |
test_ddpg_speed[True-None] | 1.3838ms | 1.0134ms | 986.8030 Ops/s | 982.7805 Ops/s | |
test_ddpg_speed[True-backward] | 1.9518ms | 1.8916ms | 528.6497 Ops/s | 528.8361 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.1626ms | 1.0129ms | 987.2166 Ops/s | 988.7529 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 1.9366ms | 1.8964ms | 527.3169 Ops/s | 527.1304 Ops/s | |
test_sac_speed[False-None] | 8.2383ms | 7.8089ms | 128.0590 Ops/s | 126.9505 Ops/s | |
test_sac_speed[False-backward] | 12.2476ms | 10.5819ms | 94.5014 Ops/s | 94.9021 Ops/s | |
test_sac_speed[True-None] | 2.1423ms | 1.8320ms | 545.8461 Ops/s | 538.3485 Ops/s | |
test_sac_speed[True-backward] | 3.6503ms | 3.5228ms | 283.8690 Ops/s | 278.8408 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.2534ms | 1.8305ms | 546.3071 Ops/s | 541.0455 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 3.8110ms | 3.5344ms | 282.9311 Ops/s | 280.6653 Ops/s | |
test_redq_speed[False-None] | 14.1837ms | 12.4691ms | 80.1984 Ops/s | 71.5816 Ops/s | |
test_redq_speed[False-backward] | 23.5869ms | 21.7221ms | 46.0361 Ops/s | 44.1702 Ops/s | |
test_redq_speed[True-None] | 5.7105ms | 4.4638ms | 224.0239 Ops/s | 207.5802 Ops/s | |
test_redq_speed[True-backward] | 12.9022ms | 11.7926ms | 84.7992 Ops/s | 79.8016 Ops/s | |
test_redq_speed[reduce-overhead-None] | 5.5038ms | 4.4856ms | 222.9352 Ops/s | 199.2055 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 12.2307ms | 11.7311ms | 85.2438 Ops/s | 77.4540 Ops/s | |
test_redq_deprec_speed[False-None] | 13.4510ms | 12.4080ms | 80.5930 Ops/s | 76.6075 Ops/s | |
test_redq_deprec_speed[False-backward] | 20.2394ms | 18.1111ms | 55.2146 Ops/s | 50.8634 Ops/s | |
test_redq_deprec_speed[True-None] | 4.5017ms | 3.5612ms | 280.8042 Ops/s | 274.1604 Ops/s | |
test_redq_deprec_speed[True-backward] | 8.7983ms | 8.0159ms | 124.7515 Ops/s | 122.0335 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 4.7178ms | 3.5536ms | 281.4028 Ops/s | 273.0734 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 8.2211ms | 7.8892ms | 126.7562 Ops/s | 122.0617 Ops/s | |
test_td3_speed[False-None] | 8.0042ms | 7.7367ms | 129.2549 Ops/s | 126.8785 Ops/s | |
test_td3_speed[False-backward] | 11.7184ms | 10.1801ms | 98.2305 Ops/s | 96.3768 Ops/s | |
test_td3_speed[True-None] | 1.8620ms | 1.7138ms | 583.5114 Ops/s | 565.8928 Ops/s | |
test_td3_speed[True-backward] | 3.3878ms | 3.3270ms | 300.5695 Ops/s | 291.4255 Ops/s | |
test_td3_speed[reduce-overhead-None] | 1.8068ms | 1.7181ms | 582.0400 Ops/s | 557.6295 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.3943ms | 3.3132ms | 301.8214 Ops/s | 285.8219 Ops/s | |
test_cql_speed[False-None] | 44.9391ms | 36.2649ms | 27.5749 Ops/s | 27.3620 Ops/s | |
test_cql_speed[False-backward] | 0.2767s | 51.1013ms | 19.5690 Ops/s | 21.4575 Ops/s | |
test_cql_speed[True-None] | 16.9444ms | 15.7981ms | 63.2988 Ops/s | 64.3498 Ops/s | |
test_cql_speed[True-backward] | 23.4909ms | 22.1310ms | 45.1855 Ops/s | 43.9347 Ops/s | |
test_cql_speed[reduce-overhead-None] | 16.2246ms | 15.4229ms | 64.8386 Ops/s | 62.3311 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 23.1687ms | 21.9593ms | 45.5389 Ops/s | 44.5610 Ops/s | |
test_a2c_speed[False-None] | 8.5970ms | 7.0628ms | 141.5874 Ops/s | 137.7662 Ops/s | |
test_a2c_speed[False-backward] | 16.1906ms | 14.1487ms | 70.6776 Ops/s | 68.4180 Ops/s | |
test_a2c_speed[True-None] | 3.9132ms | 3.2904ms | 303.9107 Ops/s | 300.0172 Ops/s | |
test_a2c_speed[True-backward] | 10.0701ms | 9.6152ms | 104.0017 Ops/s | 102.3440 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 4.3648ms | 3.3622ms | 297.4270 Ops/s | 298.7917 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 9.8964ms | 9.5808ms | 104.3755 Ops/s | 102.9064 Ops/s | |
test_ppo_speed[False-None] | 9.0911ms | 7.3445ms | 136.1567 Ops/s | 134.8228 Ops/s | |
test_ppo_speed[False-backward] | 15.9226ms | 14.5647ms | 68.6593 Ops/s | 66.0596 Ops/s | |
test_ppo_speed[True-None] | 4.3473ms | 3.6997ms | 270.2929 Ops/s | 267.3561 Ops/s | |
test_ppo_speed[True-backward] | 9.8361ms | 9.4411ms | 105.9197 Ops/s | 104.7688 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 4.3747ms | 3.6859ms | 271.3035 Ops/s | 265.9987 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 9.7554ms | 9.4770ms | 105.5191 Ops/s | 104.9060 Ops/s | |
test_reinforce_speed[False-None] | 7.3569ms | 6.4067ms | 156.0861 Ops/s | 153.9277 Ops/s | |
test_reinforce_speed[False-backward] | 11.3029ms | 9.6244ms | 103.9030 Ops/s | 102.7136 Ops/s | |
test_reinforce_speed[True-None] | 3.5578ms | 2.9083ms | 343.8387 Ops/s | 375.4933 Ops/s | |
test_reinforce_speed[True-backward] | 9.7562ms | 8.5101ms | 117.5078 Ops/s | 117.5461 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 3.0892ms | 2.6212ms | 381.5070 Ops/s | 371.8722 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 9.0909ms | 8.4699ms | 118.0650 Ops/s | 116.7153 Ops/s | |
test_iql_speed[False-None] | 33.9529ms | 31.8166ms | 31.4301 Ops/s | 30.7306 Ops/s | |
test_iql_speed[False-backward] | 47.0128ms | 44.6892ms | 22.3768 Ops/s | 21.7683 Ops/s | |
test_iql_speed[True-None] | 11.6362ms | 10.4632ms | 95.5735 Ops/s | 93.9577 Ops/s | |
test_iql_speed[True-backward] | 22.7498ms | 21.1695ms | 47.2377 Ops/s | 46.6716 Ops/s | |
test_iql_speed[reduce-overhead-None] | 11.9718ms | 10.4598ms | 95.6044 Ops/s | 93.3842 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 22.9129ms | 21.1937ms | 47.1838 Ops/s | 46.0657 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 7.4980ms | 4.8201ms | 207.4625 Ops/s | 204.2726 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.7587ms | 0.4977ms | 2.0092 KOps/s | 2.0762 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7900ms | 0.4783ms | 2.0909 KOps/s | 2.1591 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 7.1100ms | 4.5881ms | 217.9551 Ops/s | 214.7107 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.6461ms | 0.4950ms | 2.0202 KOps/s | 2.0873 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6675ms | 0.4667ms | 2.1425 KOps/s | 2.2159 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.4295ms | 1.6274ms | 614.4582 Ops/s | 612.6243 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.2780ms | 1.5763ms | 634.4029 Ops/s | 634.9500 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 7.2917ms | 4.6780ms | 213.7650 Ops/s | 209.5792 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.4955ms | 0.6383ms | 1.5667 KOps/s | 1.6060 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9909ms | 0.6168ms | 1.6213 KOps/s | 1.6644 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.0524ms | 4.5159ms | 221.4404 Ops/s | 216.7544 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.9835ms | 0.5012ms | 1.9953 KOps/s | 2.0661 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6671ms | 0.4748ms | 2.1062 KOps/s | 2.1748 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8527ms | 4.5426ms | 220.1405 Ops/s | 221.0962 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.6267ms | 0.4946ms | 2.0218 KOps/s | 2.0463 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.8227ms | 0.4757ms | 2.1020 KOps/s | 2.2072 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.9300ms | 4.6915ms | 213.1518 Ops/s | 207.2505 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.2117ms | 0.6475ms | 1.5445 KOps/s | 1.6102 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7515ms | 0.6140ms | 1.6288 KOps/s | 1.6304 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 7.2267ms | 4.2206ms | 236.9359 Ops/s | 230.5484 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 5.7785ms | 2.2371ms | 447.0165 Ops/s | 443.1257 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 6.1001ms | 1.2778ms | 782.6181 Ops/s | 784.9101 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.3822s | 11.7820ms | 84.8752 Ops/s | 221.7003 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 6.2995ms | 2.3519ms | 425.1925 Ops/s | 445.1053 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 5.4245ms | 1.3281ms | 752.9362 Ops/s | 782.6513 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 5.8949ms | 4.3423ms | 230.2950 Ops/s | 224.7640 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 8.9776ms | 2.4808ms | 403.0939 Ops/s | 413.2690 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 4.8704ms | 1.4510ms | 689.1965 Ops/s | 696.4398 Ops/s | |
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-True] | 45.6838ms | 40.3827ms | 24.7631 Ops/s | 22.0373 Ops/s | |
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-False] | 9.2556ms | 8.7897ms | 113.7697 Ops/s | 109.9488 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_simple | 0.7263s | 0.7262s | 1.3771 Ops/s | 1.3916 Ops/s | |
test_transformed | 1.0612s | 0.9846s | 1.0156 Ops/s | 1.0436 Ops/s | |
test_serial | 2.1701s | 2.0911s | 0.4782 Ops/s | 0.4804 Ops/s | |
test_parallel | 2.0851s | 1.9809s | 0.5048 Ops/s | 0.5113 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.2233ms | 36.4449μs | 27.4387 KOps/s | 27.6337 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 63.4110μs | 20.7100μs | 48.2859 KOps/s | 48.2264 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 47.6600μs | 20.1077μs | 49.7322 KOps/s | 50.8852 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 50.0610μs | 11.7278μs | 85.2675 KOps/s | 86.4735 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 70.4010μs | 38.8063μs | 25.7690 KOps/s | 25.9925 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 48.4510μs | 22.8370μs | 43.7886 KOps/s | 44.8188 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 52.5000μs | 21.6475μs | 46.1948 KOps/s | 46.1649 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 42.7410μs | 13.6220μs | 73.4107 KOps/s | 73.5544 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 67.6820μs | 41.1139μs | 24.3227 KOps/s | 24.6154 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 54.0810μs | 24.9296μs | 40.1130 KOps/s | 40.0467 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 50.7810μs | 22.0920μs | 45.2652 KOps/s | 45.3050 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 53.6310μs | 13.8083μs | 72.4201 KOps/s | 74.1019 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 76.7710μs | 43.0752μs | 23.2152 KOps/s | 24.0770 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 57.0210μs | 27.0309μs | 36.9946 KOps/s | 37.7090 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 53.7610μs | 24.0000μs | 41.6667 KOps/s | 41.6647 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 46.4800μs | 15.7996μs | 63.2928 KOps/s | 63.7994 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 98.6420μs | 40.3969μs | 24.7544 KOps/s | 25.1633 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 65.2110μs | 24.4477μs | 40.9036 KOps/s | 40.6745 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 49.4910μs | 25.7454μs | 38.8419 KOps/s | 39.3588 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 42.7810μs | 15.3189μs | 65.2789 KOps/s | 65.8416 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 82.9910μs | 41.3356μs | 24.1922 KOps/s | 23.7163 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 59.6410μs | 26.2197μs | 38.1393 KOps/s | 37.8834 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 3.4657ms | 28.0636μs | 35.6334 KOps/s | 35.6190 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 56.9010μs | 17.3258μs | 57.7175 KOps/s | 57.9876 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 77.3010μs | 45.1926μs | 22.1275 KOps/s | 22.2767 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 62.4610μs | 29.0763μs | 34.3923 KOps/s | 34.6244 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 69.7910μs | 28.0167μs | 35.6930 KOps/s | 36.1817 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 49.1710μs | 17.5797μs | 56.8837 KOps/s | 58.0651 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 0.1409ms | 45.6195μs | 21.9204 KOps/s | 22.1926 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 73.0010μs | 30.3763μs | 32.9204 KOps/s | 32.6601 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 53.8510μs | 29.4178μs | 33.9931 KOps/s | 34.0869 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 46.8010μs | 19.0435μs | 52.5114 KOps/s | 52.1693 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 25.1251ms | 24.7334ms | 40.4312 Ops/s | 41.3879 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 0.1199s | 3.2849ms | 304.4205 Ops/s | 346.4279 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.1398ms | 66.5821μs | 15.0191 KOps/s | 15.6020 KOps/s | |
test_values[td1_return_estimate-False-False] | 55.6121ms | 55.2454ms | 18.1011 Ops/s | 18.5022 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 1.2567ms | 1.0676ms | 936.7160 Ops/s | 944.2852 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 87.9760ms | 87.5931ms | 11.4164 Ops/s | 11.6570 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 1.2312ms | 1.0639ms | 939.9374 Ops/s | 947.5478 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 24.5362ms | 24.3687ms | 41.0362 Ops/s | 39.8636 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 1.0257ms | 0.7407ms | 1.3500 KOps/s | 1.3692 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7635ms | 0.6554ms | 1.5257 KOps/s | 1.5486 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5185ms | 1.4685ms | 680.9749 Ops/s | 684.8512 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.7155ms | 0.6702ms | 1.4921 KOps/s | 1.4883 KOps/s | |
test_dqn_speed[False-None] | 6.8643ms | 1.2916ms | 774.2515 Ops/s | 782.6185 Ops/s | |
test_dqn_speed[False-backward] | 1.8587ms | 1.7784ms | 562.2967 Ops/s | 556.9405 Ops/s | |
test_dqn_speed[True-None] | 0.9012ms | 0.5524ms | 1.8103 KOps/s | 1.6972 KOps/s | |
test_dqn_speed[True-backward] | 1.1204ms | 0.9937ms | 1.0064 KOps/s | 828.2356 Ops/s | |
test_dqn_speed[reduce-overhead-None] | 0.6200ms | 0.5533ms | 1.8073 KOps/s | 1.7104 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 1.0927ms | 1.0224ms | 978.0989 Ops/s | 1.0106 KOps/s | |
test_ddpg_speed[False-None] | 3.4186ms | 2.6645ms | 375.2986 Ops/s | 374.8383 Ops/s | |
test_ddpg_speed[False-backward] | 3.9713ms | 3.8486ms | 259.8340 Ops/s | 261.8517 Ops/s | |
test_ddpg_speed[True-None] | 1.3801ms | 1.2431ms | 804.4572 Ops/s | 785.3124 Ops/s | |
test_ddpg_speed[True-backward] | 2.2368ms | 2.1662ms | 461.6328 Ops/s | 454.3018 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.3353ms | 1.2196ms | 819.9220 Ops/s | 801.7292 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.2734ms | 2.1620ms | 462.5391 Ops/s | 455.4313 Ops/s | |
test_sac_speed[False-None] | 8.6948ms | 7.3285ms | 136.4527 Ops/s | 135.3828 Ops/s | |
test_sac_speed[False-backward] | 11.0928ms | 10.4450ms | 95.7393 Ops/s | 96.1315 Ops/s | |
test_sac_speed[True-None] | 2.1879ms | 1.9859ms | 503.5406 Ops/s | 469.6459 Ops/s | |
test_sac_speed[True-backward] | 4.0009ms | 3.9117ms | 255.6434 Ops/s | 249.7056 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.1054ms | 1.9827ms | 504.3596 Ops/s | 498.5620 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 3.9640ms | 3.8433ms | 260.1928 Ops/s | 255.6252 Ops/s | |
test_redq_speed[False-None] | 15.0083ms | 9.9675ms | 100.3261 Ops/s | 101.8415 Ops/s | |
test_redq_speed[False-backward] | 17.7578ms | 16.9014ms | 59.1668 Ops/s | 60.3767 Ops/s | |
test_redq_speed[True-None] | 3.6974ms | 3.4516ms | 289.7228 Ops/s | 297.6183 Ops/s | |
test_redq_speed[True-backward] | 8.8649ms | 8.5407ms | 117.0860 Ops/s | 122.6544 Ops/s | |
test_redq_speed[reduce-overhead-None] | 3.7682ms | 3.5508ms | 281.6245 Ops/s | 303.6865 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 8.8843ms | 8.5294ms | 117.2411 Ops/s | 123.4396 Ops/s | |
test_redq_deprec_speed[False-None] | 11.0514ms | 10.3165ms | 96.9325 Ops/s | 96.3213 Ops/s | |
test_redq_deprec_speed[False-backward] | 15.5452ms | 15.0763ms | 66.3293 Ops/s | 66.3618 Ops/s | |
test_redq_deprec_speed[True-None] | 3.3975ms | 3.1999ms | 312.5135 Ops/s | 306.4938 Ops/s | |
test_redq_deprec_speed[True-backward] | 7.5170ms | 7.1858ms | 139.1643 Ops/s | 140.2049 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 3.4166ms | 3.2431ms | 308.3462 Ops/s | 304.0174 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 7.4243ms | 7.1820ms | 139.2376 Ops/s | 140.4250 Ops/s | |
test_td3_speed[False-None] | 33.6415ms | 7.5976ms | 131.6202 Ops/s | 138.2855 Ops/s | |
test_td3_speed[False-backward] | 10.4993ms | 10.1136ms | 98.8767 Ops/s | 99.8938 Ops/s | |
test_td3_speed[True-None] | 1.9138ms | 1.8863ms | 530.1282 Ops/s | 527.4511 Ops/s | |
test_td3_speed[True-backward] | 3.7491ms | 3.6384ms | 274.8427 Ops/s | 274.5771 Ops/s | |
test_td3_speed[reduce-overhead-None] | 1.9018ms | 1.8828ms | 531.1143 Ops/s | 525.8575 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.8174ms | 3.6307ms | 275.4253 Ops/s | 273.6260 Ops/s | |
test_cql_speed[False-None] | 27.2321ms | 24.4187ms | 40.9522 Ops/s | 41.4235 Ops/s | |
test_cql_speed[False-backward] | 37.4943ms | 33.8320ms | 29.5578 Ops/s | 30.2939 Ops/s | |
test_cql_speed[True-None] | 11.1230ms | 10.7825ms | 92.7427 Ops/s | 96.8400 Ops/s | |
test_cql_speed[True-backward] | 16.7136ms | 16.4044ms | 60.9593 Ops/s | 66.9371 Ops/s | |
test_cql_speed[reduce-overhead-None] | 11.1284ms | 10.8159ms | 92.4568 Ops/s | 102.3257 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 16.6566ms | 16.2376ms | 61.5854 Ops/s | 66.8687 Ops/s | |
test_a2c_speed[False-None] | 5.3203ms | 5.1188ms | 195.3579 Ops/s | 202.1801 Ops/s | |
test_a2c_speed[False-backward] | 11.8230ms | 11.3362ms | 88.2131 Ops/s | 87.6412 Ops/s | |
test_a2c_speed[True-None] | 3.3072ms | 2.9864ms | 334.8471 Ops/s | 363.3952 Ops/s | |
test_a2c_speed[True-backward] | 8.5906ms | 8.2415ms | 121.3375 Ops/s | 119.7875 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 3.1420ms | 2.9973ms | 333.6377 Ops/s | 361.5097 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 8.4671ms | 8.1896ms | 122.1067 Ops/s | 127.0568 Ops/s | |
test_ppo_speed[False-None] | 6.1804ms | 5.5281ms | 180.8943 Ops/s | 192.9478 Ops/s | |
test_ppo_speed[False-backward] | 12.3677ms | 12.0388ms | 83.0646 Ops/s | 84.7256 Ops/s | |
test_ppo_speed[True-None] | 3.5909ms | 3.4345ms | 291.1605 Ops/s | 315.8091 Ops/s | |
test_ppo_speed[True-backward] | 8.0854ms | 7.9242ms | 126.1963 Ops/s | 122.3495 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 3.5900ms | 3.4069ms | 293.5192 Ops/s | 293.1329 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 8.3114ms | 7.9976ms | 125.0373 Ops/s | 126.5402 Ops/s | |
test_reinforce_speed[False-None] | 6.2010ms | 4.3701ms | 228.8298 Ops/s | 229.8840 Ops/s | |
test_reinforce_speed[False-backward] | 7.4849ms | 7.1586ms | 139.6923 Ops/s | 140.4760 Ops/s | |
test_reinforce_speed[True-None] | 2.3422ms | 2.1759ms | 459.5709 Ops/s | 442.3845 Ops/s | |
test_reinforce_speed[True-backward] | 7.1182ms | 6.9098ms | 144.7210 Ops/s | 134.8024 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 2.4570ms | 2.2033ms | 453.8618 Ops/s | 456.6721 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 7.1219ms | 6.9409ms | 144.0729 Ops/s | 147.6085 Ops/s | |
test_iql_speed[False-None] | 19.8780ms | 19.0751ms | 52.4242 Ops/s | 53.8734 Ops/s | |
test_iql_speed[False-backward] | 30.1513ms | 29.2161ms | 34.2277 Ops/s | 34.6587 Ops/s | |
test_iql_speed[True-None] | 7.0462ms | 6.6387ms | 150.6325 Ops/s | 150.0283 Ops/s | |
test_iql_speed[True-backward] | 15.3029ms | 14.8967ms | 67.1289 Ops/s | 66.7119 Ops/s | |
test_iql_speed[reduce-overhead-None] | 6.9653ms | 6.6257ms | 150.9269 Ops/s | 151.2150 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 15.5682ms | 15.0364ms | 66.5052 Ops/s | 66.5518 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.4456ms | 6.2840ms | 159.1348 Ops/s | 155.3300 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.9102ms | 0.2925ms | 3.4185 KOps/s | 3.8386 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.5036ms | 0.2695ms | 3.7111 KOps/s | 4.0333 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.2559ms | 6.0344ms | 165.7170 Ops/s | 161.1526 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6201ms | 0.2511ms | 3.9821 KOps/s | 3.8664 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.3786ms | 0.2310ms | 4.3290 KOps/s | 3.4980 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.4130ms | 1.2333ms | 810.8270 Ops/s | 787.0803 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.3700ms | 1.1751ms | 850.9764 Ops/s | 814.2153 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.5344ms | 6.3121ms | 158.4248 Ops/s | 155.4729 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.6823ms | 0.4211ms | 2.3747 KOps/s | 2.3706 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7587ms | 0.3945ms | 2.5351 KOps/s | 2.4466 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 6.1695ms | 6.0610ms | 164.9906 Ops/s | 159.6355 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.5786ms | 0.2541ms | 3.9351 KOps/s | 3.4663 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.5822ms | 0.2355ms | 4.2455 KOps/s | 3.7100 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 6.2367ms | 6.0209ms | 166.0877 Ops/s | 163.0772 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.5676ms | 0.2764ms | 3.6182 KOps/s | 2.4458 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 6.7301ms | 0.2404ms | 4.1590 KOps/s | 2.5481 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 6.3949ms | 6.2538ms | 159.9025 Ops/s | 158.0871 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.0567ms | 0.4441ms | 2.2519 KOps/s | 2.4870 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8122ms | 0.4164ms | 2.4017 KOps/s | 2.6639 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.4040s | 13.3042ms | 75.1640 Ops/s | 186.7863 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 9.3936ms | 2.0071ms | 498.2294 Ops/s | 461.1388 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 7.0521ms | 1.1571ms | 864.2184 Ops/s | 829.4075 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 7.0082ms | 5.3484ms | 186.9731 Ops/s | 188.3212 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 9.5544ms | 2.0333ms | 491.8209 Ops/s | 451.5002 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 6.0986ms | 1.1828ms | 845.4777 Ops/s | 830.6550 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.3729s | 12.9092ms | 77.4639 Ops/s | 32.8149 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 6.8601ms | 2.1723ms | 460.3494 Ops/s | 465.1411 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 9.2228ms | 1.3534ms | 738.8611 Ops/s | 743.7961 Ops/s | |
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-True] | 47.4081ms | 45.7946ms | 21.8366 Ops/s | 21.6221 Ops/s | |
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000-100-False] | 10.1031ms | 9.6900ms | 103.1988 Ops/s | 100.8414 Ops/s |
vmoens
added a commit
that referenced
this pull request
Oct 29, 2024
ghstack-source-id: 4ed60d649b17f96b49f90d234e679937c60a3c32 Pull Request resolved: #2527
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):