-
Notifications
You must be signed in to change notification settings - Fork 295
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BugFix] Extend RB with lazy stack (revamp) #2454
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2454
Note: Links to docs will display an error until the docs builds have been completed. ❌ 3 New Failures, 19 Unrelated FailuresAs of commit 94d5626 with merge base 1aca00e (): NEW FAILURES - The following jobs have failed:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
vmoens
added a commit
that referenced
this pull request
Sep 25, 2024
ghstack-source-id: df397d09166d8fb61eceacb5fe8659e0295ca414 Pull Request resolved: #2454
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 25, 2024
vmoens
added a commit
that referenced
this pull request
Sep 25, 2024
ghstack-source-id: df397d09166d8fb61eceacb5fe8659e0295ca414 Pull Request resolved: #2454
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_single | 59.7008ms | 59.0162ms | 16.9445 Ops/s | 16.8659 Ops/s | |
test_sync | 50.9551ms | 33.5245ms | 29.8289 Ops/s | 30.9393 Ops/s | |
test_async | 60.2890ms | 31.2443ms | 32.0059 Ops/s | 31.9267 Ops/s | |
test_simple | 0.4953s | 0.4242s | 2.3575 Ops/s | 2.4468 Ops/s | |
test_transformed | 0.5593s | 0.5580s | 1.7921 Ops/s | 1.7788 Ops/s | |
test_serial | 1.2679s | 1.2627s | 0.7919 Ops/s | 0.7732 Ops/s | |
test_parallel | 1.2083s | 1.1327s | 0.8829 Ops/s | 0.8875 Ops/s | |
test_step_mdp_speed[True-True-True-True-True] | 0.2043ms | 28.8817μs | 34.6240 KOps/s | 36.4023 KOps/s | |
test_step_mdp_speed[True-True-True-True-False] | 41.6380μs | 16.6464μs | 60.0732 KOps/s | 62.3295 KOps/s | |
test_step_mdp_speed[True-True-True-False-True] | 66.3320μs | 16.1267μs | 62.0089 KOps/s | 63.4118 KOps/s | |
test_step_mdp_speed[True-True-True-False-False] | 34.1640μs | 9.4612μs | 105.6946 KOps/s | 107.4701 KOps/s | |
test_step_mdp_speed[True-True-False-True-True] | 83.3350μs | 30.6884μs | 32.5856 KOps/s | 34.2959 KOps/s | |
test_step_mdp_speed[True-True-False-True-False] | 44.1520μs | 18.4885μs | 54.0877 KOps/s | 56.5557 KOps/s | |
test_step_mdp_speed[True-True-False-False-True] | 52.4080μs | 18.1858μs | 54.9879 KOps/s | 57.8660 KOps/s | |
test_step_mdp_speed[True-True-False-False-False] | 41.8680μs | 11.3398μs | 88.1847 KOps/s | 91.6627 KOps/s | |
test_step_mdp_speed[True-False-True-True-True] | 76.9130μs | 32.1581μs | 31.0964 KOps/s | 32.3224 KOps/s | |
test_step_mdp_speed[True-False-True-True-False] | 67.9380μs | 20.1698μs | 49.5791 KOps/s | 52.1619 KOps/s | |
test_step_mdp_speed[True-False-True-False-True] | 71.4330μs | 17.8931μs | 55.8876 KOps/s | 57.8386 KOps/s | |
test_step_mdp_speed[True-False-True-False-False] | 42.8900μs | 11.2165μs | 89.1547 KOps/s | 92.6168 KOps/s | |
test_step_mdp_speed[True-False-False-True-True] | 79.7990μs | 34.2264μs | 29.2172 KOps/s | 31.1384 KOps/s | |
test_step_mdp_speed[True-False-False-True-False] | 0.1534ms | 22.7357μs | 43.9837 KOps/s | 48.1478 KOps/s | |
test_step_mdp_speed[True-False-False-False-True] | 0.1558ms | 20.4174μs | 48.9778 KOps/s | 53.3092 KOps/s | |
test_step_mdp_speed[True-False-False-False-False] | 46.5270μs | 13.1516μs | 76.0366 KOps/s | 81.0268 KOps/s | |
test_step_mdp_speed[False-True-True-True-True] | 74.9000μs | 32.4615μs | 30.8058 KOps/s | 32.8851 KOps/s | |
test_step_mdp_speed[False-True-True-True-False] | 51.2150μs | 20.3499μs | 49.1403 KOps/s | 51.5348 KOps/s | |
test_step_mdp_speed[False-True-True-False-True] | 58.5900μs | 21.0098μs | 47.5968 KOps/s | 50.5592 KOps/s | |
test_step_mdp_speed[False-True-True-False-False] | 41.1170μs | 12.5358μs | 79.7715 KOps/s | 83.2557 KOps/s | |
test_step_mdp_speed[False-True-False-True-True] | 0.1846ms | 34.1821μs | 29.2551 KOps/s | 31.3256 KOps/s | |
test_step_mdp_speed[False-True-False-True-False] | 68.0670μs | 22.3207μs | 44.8014 KOps/s | 47.6787 KOps/s | |
test_step_mdp_speed[False-True-False-False-True] | 2.7544ms | 22.3982μs | 44.6465 KOps/s | 46.8903 KOps/s | |
test_step_mdp_speed[False-True-False-False-False] | 41.1870μs | 14.1871μs | 70.4865 KOps/s | 73.5938 KOps/s | |
test_step_mdp_speed[False-False-True-True-True] | 76.9930μs | 35.8484μs | 27.8952 KOps/s | 29.5235 KOps/s | |
test_step_mdp_speed[False-False-True-True-False] | 0.1216ms | 23.8869μs | 41.8640 KOps/s | 44.6134 KOps/s | |
test_step_mdp_speed[False-False-True-False-True] | 48.4600μs | 22.3431μs | 44.7565 KOps/s | 47.1149 KOps/s | |
test_step_mdp_speed[False-False-True-False-False] | 40.5660μs | 14.3057μs | 69.9021 KOps/s | 73.9854 KOps/s | |
test_step_mdp_speed[False-False-False-True-True] | 82.4630μs | 37.2956μs | 26.8128 KOps/s | 28.6316 KOps/s | |
test_step_mdp_speed[False-False-False-True-False] | 95.2480μs | 25.1453μs | 39.7689 KOps/s | 42.0943 KOps/s | |
test_step_mdp_speed[False-False-False-False-True] | 58.0990μs | 23.7317μs | 42.1377 KOps/s | 44.4648 KOps/s | |
test_step_mdp_speed[False-False-False-False-False] | 47.9890μs | 15.9290μs | 62.7785 KOps/s | 66.6134 KOps/s | |
test_values[generalized_advantage_estimate-True-True] | 10.5180ms | 9.5965ms | 104.2047 Ops/s | 103.5642 Ops/s | |
test_values[vec_generalized_advantage_estimate-True-True] | 39.1229ms | 35.5597ms | 28.1217 Ops/s | 28.1936 Ops/s | |
test_values[td0_return_estimate-False-False] | 0.2255ms | 0.1654ms | 6.0468 KOps/s | 5.6812 KOps/s | |
test_values[td1_return_estimate-False-False] | 25.9534ms | 23.5516ms | 42.4600 Ops/s | 42.0268 Ops/s | |
test_values[vec_td1_return_estimate-False-False] | 40.8499ms | 35.9967ms | 27.7803 Ops/s | 28.0167 Ops/s | |
test_values[td_lambda_return_estimate-True-False] | 36.1039ms | 34.1112ms | 29.3159 Ops/s | 29.0100 Ops/s | |
test_values[vec_td_lambda_return_estimate-True-False] | 37.9263ms | 35.9013ms | 27.8541 Ops/s | 28.1810 Ops/s | |
test_gae_speed[generalized_advantage_estimate-False-1-512] | 12.2678ms | 8.3839ms | 119.2761 Ops/s | 119.2263 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.5708ms | 2.0225ms | 494.4432 Ops/s | 492.8853 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.4166ms | 0.3562ms | 2.8073 KOps/s | 2.8152 KOps/s | |
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 48.1637ms | 46.4082ms | 21.5479 Ops/s | 19.9803 Ops/s | |
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 4.1059ms | 3.0287ms | 330.1788 Ops/s | 330.4802 Ops/s | |
test_dqn_speed[False-None] | 6.3770ms | 1.3175ms | 759.0085 Ops/s | 761.7204 Ops/s | |
test_dqn_speed[False-backward] | 1.8742ms | 1.7881ms | 559.2654 Ops/s | 562.6638 Ops/s | |
test_dqn_speed[True-None] | 0.6517ms | 0.4570ms | 2.1882 KOps/s | 2.1949 KOps/s | |
test_dqn_speed[True-backward] | 0.9377ms | 0.8630ms | 1.1587 KOps/s | 1.1389 KOps/s | |
test_dqn_speed[reduce-overhead-None] | 0.6001ms | 0.4579ms | 2.1839 KOps/s | 2.1754 KOps/s | |
test_dqn_speed[reduce-overhead-backward] | 0.9235ms | 0.8650ms | 1.1560 KOps/s | 1.1520 KOps/s | |
test_ddpg_speed[False-None] | 3.4691ms | 2.7348ms | 365.6627 Ops/s | 363.8044 Ops/s | |
test_ddpg_speed[False-backward] | 4.1752ms | 3.8763ms | 257.9773 Ops/s | 257.3192 Ops/s | |
test_ddpg_speed[True-None] | 1.2307ms | 0.9976ms | 1.0024 KOps/s | 982.4358 Ops/s | |
test_ddpg_speed[True-backward] | 1.9419ms | 1.8706ms | 534.5834 Ops/s | 456.8015 Ops/s | |
test_ddpg_speed[reduce-overhead-None] | 1.3439ms | 0.9853ms | 1.0149 KOps/s | 982.9051 Ops/s | |
test_ddpg_speed[reduce-overhead-backward] | 2.0191ms | 1.8906ms | 528.9437 Ops/s | 529.7543 Ops/s | |
test_sac_speed[False-None] | 9.7064ms | 7.7600ms | 128.8661 Ops/s | 128.1357 Ops/s | |
test_sac_speed[False-backward] | 10.7264ms | 10.3918ms | 96.2294 Ops/s | 94.0116 Ops/s | |
test_sac_speed[True-None] | 2.3036ms | 1.8314ms | 546.0182 Ops/s | 539.9378 Ops/s | |
test_sac_speed[True-backward] | 3.5573ms | 3.4914ms | 286.4140 Ops/s | 273.1227 Ops/s | |
test_sac_speed[reduce-overhead-None] | 2.2958ms | 1.8242ms | 548.1893 Ops/s | 529.5642 Ops/s | |
test_sac_speed[reduce-overhead-backward] | 3.5664ms | 3.4918ms | 286.3885 Ops/s | 278.0525 Ops/s | |
test_redq_speed[False-None] | 14.1767ms | 12.3967ms | 80.6666 Ops/s | 76.1387 Ops/s | |
test_redq_speed[False-backward] | 24.3366ms | 21.8858ms | 45.6916 Ops/s | 44.6358 Ops/s | |
test_redq_speed[True-None] | 5.1236ms | 4.4427ms | 225.0907 Ops/s | 212.7361 Ops/s | |
test_redq_speed[True-backward] | 13.5682ms | 11.7347ms | 85.2175 Ops/s | 78.6893 Ops/s | |
test_redq_speed[reduce-overhead-None] | 5.0364ms | 4.4477ms | 224.8349 Ops/s | 213.0392 Ops/s | |
test_redq_speed[reduce-overhead-backward] | 13.7050ms | 11.7988ms | 84.7544 Ops/s | 82.0781 Ops/s | |
test_redq_deprec_speed[False-None] | 14.2965ms | 12.3491ms | 80.9778 Ops/s | 79.7682 Ops/s | |
test_redq_deprec_speed[False-backward] | 19.5050ms | 18.0120ms | 55.5186 Ops/s | 53.8551 Ops/s | |
test_redq_deprec_speed[True-None] | 4.2065ms | 3.5117ms | 284.7641 Ops/s | 278.6217 Ops/s | |
test_redq_deprec_speed[True-backward] | 8.9014ms | 7.8844ms | 126.8328 Ops/s | 121.7378 Ops/s | |
test_redq_deprec_speed[reduce-overhead-None] | 3.9460ms | 3.5066ms | 285.1773 Ops/s | 276.6437 Ops/s | |
test_redq_deprec_speed[reduce-overhead-backward] | 7.9442ms | 7.8454ms | 127.4626 Ops/s | 122.9452 Ops/s | |
test_td3_speed[False-None] | 32.8361ms | 7.8319ms | 127.6834 Ops/s | 128.2225 Ops/s | |
test_td3_speed[False-backward] | 11.6207ms | 9.9746ms | 100.2551 Ops/s | 97.5034 Ops/s | |
test_td3_speed[True-None] | 2.0948ms | 1.8925ms | 528.3997 Ops/s | 501.5571 Ops/s | |
test_td3_speed[True-backward] | 3.5177ms | 3.4781ms | 287.5171 Ops/s | 275.9944 Ops/s | |
test_td3_speed[reduce-overhead-None] | 2.1102ms | 1.8974ms | 527.0309 Ops/s | 501.6961 Ops/s | |
test_td3_speed[reduce-overhead-backward] | 3.5435ms | 3.4733ms | 287.9105 Ops/s | 279.8449 Ops/s | |
test_cql_speed[False-None] | 37.8800ms | 35.1273ms | 28.4678 Ops/s | 27.6976 Ops/s | |
test_cql_speed[False-backward] | 45.3033ms | 43.8337ms | 22.8135 Ops/s | 21.3169 Ops/s | |
test_cql_speed[True-None] | 16.3302ms | 15.2145ms | 65.7266 Ops/s | 62.7233 Ops/s | |
test_cql_speed[True-backward] | 23.2094ms | 21.5542ms | 46.3946 Ops/s | 42.3446 Ops/s | |
test_cql_speed[reduce-overhead-None] | 18.8605ms | 15.5985ms | 64.1087 Ops/s | 61.7505 Ops/s | |
test_cql_speed[reduce-overhead-backward] | 22.6793ms | 21.4498ms | 46.6206 Ops/s | 45.2072 Ops/s | |
test_a2c_speed[False-None] | 9.1138ms | 7.0078ms | 142.6984 Ops/s | 139.0345 Ops/s | |
test_a2c_speed[False-backward] | 14.9763ms | 13.8517ms | 72.1931 Ops/s | 70.4961 Ops/s | |
test_a2c_speed[True-None] | 3.5670ms | 3.2688ms | 305.9187 Ops/s | 298.1687 Ops/s | |
test_a2c_speed[True-backward] | 10.1304ms | 9.6472ms | 103.6575 Ops/s | 101.7842 Ops/s | |
test_a2c_speed[reduce-overhead-None] | 3.6111ms | 3.2848ms | 304.4354 Ops/s | 300.2094 Ops/s | |
test_a2c_speed[reduce-overhead-backward] | 10.0560ms | 9.6340ms | 103.7995 Ops/s | 99.2324 Ops/s | |
test_ppo_speed[False-None] | 10.0193ms | 7.2986ms | 137.0132 Ops/s | 132.0465 Ops/s | |
test_ppo_speed[False-backward] | 15.7692ms | 14.2016ms | 70.4148 Ops/s | 66.7202 Ops/s | |
test_ppo_speed[True-None] | 4.1647ms | 3.6677ms | 272.6504 Ops/s | 267.3288 Ops/s | |
test_ppo_speed[True-backward] | 9.8184ms | 9.5014ms | 105.2473 Ops/s | 103.9136 Ops/s | |
test_ppo_speed[reduce-overhead-None] | 4.3656ms | 3.6640ms | 272.9255 Ops/s | 264.7567 Ops/s | |
test_ppo_speed[reduce-overhead-backward] | 10.7009ms | 9.5358ms | 104.8683 Ops/s | 103.4278 Ops/s | |
test_reinforce_speed[False-None] | 8.2065ms | 6.3845ms | 156.6287 Ops/s | 154.1112 Ops/s | |
test_reinforce_speed[False-backward] | 10.5049ms | 9.5750ms | 104.4386 Ops/s | 103.1643 Ops/s | |
test_reinforce_speed[True-None] | 3.1674ms | 2.6007ms | 384.5149 Ops/s | 374.4973 Ops/s | |
test_reinforce_speed[True-backward] | 8.8513ms | 8.4804ms | 117.9193 Ops/s | 115.7743 Ops/s | |
test_reinforce_speed[reduce-overhead-None] | 3.0220ms | 2.6061ms | 383.7171 Ops/s | 375.6870 Ops/s | |
test_reinforce_speed[reduce-overhead-backward] | 9.2844ms | 8.5140ms | 117.4534 Ops/s | 116.1830 Ops/s | |
test_iql_speed[False-None] | 32.6484ms | 31.4172ms | 31.8297 Ops/s | 31.9827 Ops/s | |
test_iql_speed[False-backward] | 45.2518ms | 43.7977ms | 22.8322 Ops/s | 22.7793 Ops/s | |
test_iql_speed[True-None] | 15.2359ms | 13.0279ms | 76.7581 Ops/s | 75.1779 Ops/s | |
test_iql_speed[True-backward] | 25.0131ms | 23.7066ms | 42.1824 Ops/s | 40.9696 Ops/s | |
test_iql_speed[reduce-overhead-None] | 14.1194ms | 12.9980ms | 76.9351 Ops/s | 72.3799 Ops/s | |
test_iql_speed[reduce-overhead-backward] | 24.8800ms | 23.6639ms | 42.2585 Ops/s | 40.1365 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.3662ms | 4.9981ms | 200.0769 Ops/s | 196.5500 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.1263ms | 0.4723ms | 2.1174 KOps/s | 2.0761 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6543ms | 0.4461ms | 2.2418 KOps/s | 2.2155 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 7.1788ms | 4.9545ms | 201.8380 Ops/s | 200.8335 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 2.6490ms | 0.4666ms | 2.1433 KOps/s | 2.1227 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6529ms | 0.4426ms | 2.2595 KOps/s | 2.2223 KOps/s | |
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.2908ms | 1.5897ms | 629.0635 Ops/s | 619.6046 Ops/s | |
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.1048ms | 1.5014ms | 666.0585 Ops/s | 658.6355 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.5653ms | 5.0904ms | 196.4472 Ops/s | 183.0738 Ops/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.1770ms | 0.6026ms | 1.6595 KOps/s | 1.6078 KOps/s | |
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8452ms | 0.5761ms | 1.7357 KOps/s | 1.6838 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.3175ms | 5.0013ms | 199.9487 Ops/s | 194.8685 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 2.4039ms | 0.4745ms | 2.1074 KOps/s | 2.0710 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6265ms | 0.4465ms | 2.2396 KOps/s | 2.1621 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.1214ms | 4.8639ms | 205.5975 Ops/s | 196.5083 Ops/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.0647ms | 0.4691ms | 2.1317 KOps/s | 2.1144 KOps/s | |
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6736ms | 0.4413ms | 2.2659 KOps/s | 2.2104 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.2305ms | 5.0823ms | 196.7594 Ops/s | 191.0756 Ops/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 2.3155ms | 0.6076ms | 1.6458 KOps/s | 1.6069 KOps/s | |
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7734ms | 0.5744ms | 1.7411 KOps/s | 1.6971 KOps/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 5.8774ms | 4.1869ms | 238.8411 Ops/s | 231.2569 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 7.8082ms | 2.2320ms | 448.0239 Ops/s | 532.3904 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 1.7339ms | 1.2326ms | 811.2652 Ops/s | 738.9698 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.3481s | 11.0305ms | 90.6578 Ops/s | 238.8695 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 5.9160ms | 2.1681ms | 461.2272 Ops/s | 462.7944 Ops/s | |
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 6.2493ms | 1.4226ms | 702.9195 Ops/s | 775.1260 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 7.8464ms | 4.3686ms | 228.9038 Ops/s | 228.9735 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 3.4092ms | 2.1046ms | 475.1500 Ops/s | 431.2705 Ops/s | |
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 5.9056ms | 1.4571ms | 686.2988 Ops/s | 673.4858 Ops/s |
Result of GPU Benchmark TestsExpand to view detailed results
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):