Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] export tutorial, TDM tuto refactoring #994

Merged
merged 15 commits into from
Sep 19, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 16, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: fde5222d755a8c67692505039c97e3bed196989a
Pull Request resolved: #994
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 16, 2024
Copy link

github-actions bot commented Sep 16, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}33$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 47.1780μs 20.1839μs 49.5445 KOps/s 48.1336 KOps/s $\color{#35bf28}+2.93\%$
test_plain_set_stack_nested 56.4250μs 20.8835μs 47.8846 KOps/s 47.8749 KOps/s $\color{#35bf28}+0.02\%$
test_plain_set_nested_inplace 0.2372ms 21.9339μs 45.5916 KOps/s 44.5128 KOps/s $\color{#35bf28}+2.42\%$
test_plain_set_stack_nested_inplace 61.1840μs 21.6992μs 46.0847 KOps/s 44.4508 KOps/s $\color{#35bf28}+3.68\%$
test_items 56.5960μs 4.1328μs 241.9679 KOps/s 236.8258 KOps/s $\color{#35bf28}+2.17\%$
test_items_nested 0.6649ms 0.3652ms 2.7382 KOps/s 2.7354 KOps/s $\color{#35bf28}+0.10\%$
test_items_nested_locked 0.6082ms 0.3642ms 2.7460 KOps/s 2.7484 KOps/s $\color{#d91a1a}-0.09\%$
test_items_nested_leaf 0.1210ms 68.7985μs 14.5352 KOps/s 14.5902 KOps/s $\color{#d91a1a}-0.38\%$
test_items_stack_nested 0.8487ms 0.3740ms 2.6741 KOps/s 2.7066 KOps/s $\color{#d91a1a}-1.20\%$
test_items_stack_nested_leaf 0.1429ms 71.0040μs 14.0837 KOps/s 14.0219 KOps/s $\color{#35bf28}+0.44\%$
test_items_stack_nested_locked 0.5709ms 0.3676ms 2.7201 KOps/s 2.6767 KOps/s $\color{#35bf28}+1.62\%$
test_keys 23.2830μs 3.5058μs 285.2389 KOps/s 285.0128 KOps/s $\color{#35bf28}+0.08\%$
test_keys_nested 0.2565ms 99.2490μs 10.0757 KOps/s 9.7935 KOps/s $\color{#35bf28}+2.88\%$
test_keys_nested_locked 0.7969ms 0.1036ms 9.6489 KOps/s 9.4849 KOps/s $\color{#35bf28}+1.73\%$
test_keys_nested_leaf 0.2930ms 82.2314μs 12.1608 KOps/s 12.0193 KOps/s $\color{#35bf28}+1.18\%$
test_keys_stack_nested 0.2261ms 99.2600μs 10.0745 KOps/s 9.9095 KOps/s $\color{#35bf28}+1.67\%$
test_keys_stack_nested_leaf 0.1348ms 82.2635μs 12.1561 KOps/s 11.9812 KOps/s $\color{#35bf28}+1.46\%$
test_keys_stack_nested_locked 0.2230ms 0.1035ms 9.6657 KOps/s 9.4801 KOps/s $\color{#35bf28}+1.96\%$
test_values 12.6674μs 1.0417μs 959.9691 KOps/s 950.9609 KOps/s $\color{#35bf28}+0.95\%$
test_values_nested 0.1895ms 75.7739μs 13.1972 KOps/s 13.4267 KOps/s $\color{#d91a1a}-1.71\%$
test_values_nested_locked 0.2406ms 75.2877μs 13.2824 KOps/s 13.4175 KOps/s $\color{#d91a1a}-1.01\%$
test_values_nested_leaf 0.1253ms 62.1106μs 16.1003 KOps/s 15.7662 KOps/s $\color{#35bf28}+2.12\%$
test_values_stack_nested 0.2305ms 77.1496μs 12.9618 KOps/s 12.9030 KOps/s $\color{#35bf28}+0.46\%$
test_values_stack_nested_leaf 0.1142ms 60.8644μs 16.4300 KOps/s 15.9352 KOps/s $\color{#35bf28}+3.10\%$
test_values_stack_nested_locked 0.2370ms 76.4645μs 13.0780 KOps/s 13.1891 KOps/s $\color{#d91a1a}-0.84\%$
test_membership 14.6143μs 0.7863μs 1.2719 MOps/s 1.3952 MOps/s $\textbf{\color{#d91a1a}-8.84\%}$
test_membership_nested 63.7980μs 2.7652μs 361.6379 KOps/s 363.5419 KOps/s $\color{#d91a1a}-0.52\%$
test_membership_nested_leaf 41.9080μs 2.7436μs 364.4894 KOps/s 364.6715 KOps/s $\color{#d91a1a}-0.05\%$
test_membership_stacked_nested 24.3160μs 2.7516μs 363.4216 KOps/s 363.7844 KOps/s $\color{#d91a1a}-0.10\%$
test_membership_stacked_nested_leaf 90.2280μs 2.7780μs 359.9758 KOps/s 361.0258 KOps/s $\color{#d91a1a}-0.29\%$
test_membership_nested_last 64.5500μs 3.9592μs 252.5741 KOps/s 250.2351 KOps/s $\color{#35bf28}+0.93\%$
test_membership_nested_leaf_last 41.2770μs 3.9653μs 252.1848 KOps/s 252.0457 KOps/s $\color{#35bf28}+0.06\%$
test_membership_stacked_nested_last 26.2090μs 3.9701μs 251.8841 KOps/s 218.2635 KOps/s $\textbf{\color{#35bf28}+15.40\%}$
test_membership_stacked_nested_leaf_last 81.1510μs 3.9837μs 251.0252 KOps/s 219.3991 KOps/s $\textbf{\color{#35bf28}+14.41\%}$
test_nested_getleaf 59.6200μs 10.7973μs 92.6155 KOps/s 90.8607 KOps/s $\color{#35bf28}+1.93\%$
test_nested_get 0.1078ms 10.3296μs 96.8095 KOps/s 97.5709 KOps/s $\color{#d91a1a}-0.78\%$
test_stacked_getleaf 69.7990μs 10.8691μs 92.0038 KOps/s 93.5081 KOps/s $\color{#d91a1a}-1.61\%$
test_stacked_get 37.7210μs 10.3032μs 97.0569 KOps/s 96.2052 KOps/s $\color{#35bf28}+0.89\%$
test_nested_getitemleaf 0.1094ms 11.0784μs 90.2660 KOps/s 85.4038 KOps/s $\textbf{\color{#35bf28}+5.69\%}$
test_nested_getitem 53.3690μs 10.4384μs 95.8001 KOps/s 93.2110 KOps/s $\color{#35bf28}+2.78\%$
test_stacked_getitemleaf 68.8370μs 11.0044μs 90.8730 KOps/s 88.3722 KOps/s $\color{#35bf28}+2.83\%$
test_stacked_getitem 72.7550μs 10.4881μs 95.3458 KOps/s 93.4772 KOps/s $\color{#35bf28}+2.00\%$
test_lock_nested 88.5799ms 0.5828ms 1.7159 KOps/s 2.0305 KOps/s $\textbf{\color{#d91a1a}-15.49\%}$
test_lock_stack_nested 0.6845ms 0.4536ms 2.2047 KOps/s 2.1638 KOps/s $\color{#35bf28}+1.89\%$
test_unlock_nested 0.1023s 0.5089ms 1.9650 KOps/s 2.3658 KOps/s $\textbf{\color{#d91a1a}-16.94\%}$
test_unlock_stack_nested 0.6043ms 0.3704ms 2.6997 KOps/s 2.5892 KOps/s $\color{#35bf28}+4.27\%$
test_flatten_speed 0.2207ms 87.1248μs 11.4778 KOps/s 11.4020 KOps/s $\color{#35bf28}+0.66\%$
test_unflatten_speed 0.6266ms 0.4630ms 2.1598 KOps/s 2.1403 KOps/s $\color{#35bf28}+0.91\%$
test_common_ops 4.6306ms 1.1123ms 899.0377 Ops/s 876.3026 Ops/s $\color{#35bf28}+2.59\%$
test_creation 41.4540μs 2.0269μs 493.3636 KOps/s 479.3630 KOps/s $\color{#35bf28}+2.92\%$
test_creation_empty 52.9990μs 17.7137μs 56.4534 KOps/s 55.4105 KOps/s $\color{#35bf28}+1.88\%$
test_creation_nested_1 50.1430μs 20.8669μs 47.9229 KOps/s 46.9569 KOps/s $\color{#35bf28}+2.06\%$
test_creation_nested_2 0.1252ms 25.4746μs 39.2548 KOps/s 39.1710 KOps/s $\color{#35bf28}+0.21\%$
test_clone 79.4070μs 16.7330μs 59.7622 KOps/s 58.5851 KOps/s $\color{#35bf28}+2.01\%$
test_getitem[int] 1.1261ms 16.6616μs 60.0184 KOps/s 58.2619 KOps/s $\color{#35bf28}+3.01\%$
test_getitem[slice_int] 0.1470ms 30.3298μs 32.9708 KOps/s 31.0043 KOps/s $\textbf{\color{#35bf28}+6.34\%}$
test_getitem[range] 0.1878ms 57.5869μs 17.3651 KOps/s 17.0547 KOps/s $\color{#35bf28}+1.82\%$
test_getitem[tuple] 0.1372ms 24.8797μs 40.1934 KOps/s 38.7626 KOps/s $\color{#35bf28}+3.69\%$
test_getitem[list] 0.5545ms 53.3289μs 18.7516 KOps/s 18.6444 KOps/s $\color{#35bf28}+0.57\%$
test_setitem_dim[int] 89.9770μs 33.3877μs 29.9511 KOps/s 29.2924 KOps/s $\color{#35bf28}+2.25\%$
test_setitem_dim[slice_int] 0.1203ms 61.4348μs 16.2774 KOps/s 15.8644 KOps/s $\color{#35bf28}+2.60\%$
test_setitem_dim[range] 0.1250ms 83.1913μs 12.0205 KOps/s 11.5643 KOps/s $\color{#35bf28}+3.94\%$
test_setitem_dim[tuple] 0.1195ms 49.2223μs 20.3160 KOps/s 19.8402 KOps/s $\color{#35bf28}+2.40\%$
test_setitem 74.9200μs 29.7005μs 33.6694 KOps/s 32.6934 KOps/s $\color{#35bf28}+2.99\%$
test_set 0.1377ms 29.1127μs 34.3492 KOps/s 32.5075 KOps/s $\textbf{\color{#35bf28}+5.67\%}$
test_set_shared 3.4273ms 0.2147ms 4.6580 KOps/s 4.6656 KOps/s $\color{#d91a1a}-0.16\%$
test_update 0.1745ms 36.0953μs 27.7044 KOps/s 26.2016 KOps/s $\textbf{\color{#35bf28}+5.74\%}$
test_update_nested 1.0830ms 47.1589μs 21.2049 KOps/s 20.4792 KOps/s $\color{#35bf28}+3.54\%$
test_update__nested 0.1593ms 34.7981μs 28.7372 KOps/s 28.2065 KOps/s $\color{#35bf28}+1.88\%$
test_set_nested 76.5620μs 31.4084μs 31.8386 KOps/s 30.6599 KOps/s $\color{#35bf28}+3.84\%$
test_set_nested_new 0.3021ms 36.7548μs 27.2073 KOps/s 26.5279 KOps/s $\color{#35bf28}+2.56\%$
test_select 0.2691ms 54.0335μs 18.5070 KOps/s 18.2926 KOps/s $\color{#35bf28}+1.17\%$
test_select_nested 0.1322ms 59.9112μs 16.6914 KOps/s 16.9941 KOps/s $\color{#d91a1a}-1.78\%$
test_exclude_nested 0.1988ms 74.0048μs 13.5126 KOps/s 13.6148 KOps/s $\color{#d91a1a}-0.75\%$
test_empty[True] 0.4900ms 0.3129ms 3.1962 KOps/s 3.1612 KOps/s $\color{#35bf28}+1.11\%$
test_empty[False] 27.0127μs 1.2064μs 828.9128 KOps/s 802.1806 KOps/s $\color{#35bf28}+3.33\%$
test_unbind_speed 0.4180ms 0.2976ms 3.3599 KOps/s 3.2797 KOps/s $\color{#35bf28}+2.44\%$
test_unbind_speed_stack0 0.5167ms 0.2931ms 3.4119 KOps/s 3.3457 KOps/s $\color{#35bf28}+1.98\%$
test_unbind_speed_stack1 0.1014s 0.8095ms 1.2353 KOps/s 1.3310 KOps/s $\textbf{\color{#d91a1a}-7.19\%}$
test_split 92.7769ms 2.1778ms 459.1873 Ops/s 451.8677 Ops/s $\color{#35bf28}+1.62\%$
test_chunk 2.2860ms 1.9933ms 501.6773 Ops/s 447.9348 Ops/s $\textbf{\color{#35bf28}+12.00\%}$
test_creation[device0] 4.3978ms 0.1195ms 8.3711 KOps/s 8.2206 KOps/s $\color{#35bf28}+1.83\%$
test_creation_from_tensor 0.2358ms 0.1161ms 8.6158 KOps/s 8.4689 KOps/s $\color{#35bf28}+1.73\%$
test_add_one[memmap_tensor0] 0.2732ms 7.3846μs 135.4173 KOps/s 134.1461 KOps/s $\color{#35bf28}+0.95\%$
test_contiguous[memmap_tensor0] 20.6080μs 1.9294μs 518.3042 KOps/s 506.8171 KOps/s $\color{#35bf28}+2.27\%$
test_stack[memmap_tensor0] 85.3890μs 5.5705μs 179.5156 KOps/s 179.8203 KOps/s $\color{#d91a1a}-0.17\%$
test_memmaptd_index 1.1254ms 0.3972ms 2.5173 KOps/s 2.5035 KOps/s $\color{#35bf28}+0.55\%$
test_memmaptd_index_astensor 0.9650ms 0.4738ms 2.1105 KOps/s 2.0788 KOps/s $\color{#35bf28}+1.52\%$
test_memmaptd_index_op 1.9690ms 0.9953ms 1.0047 KOps/s 968.4167 Ops/s $\color{#35bf28}+3.74\%$
test_serialize_model 0.2195s 0.1338s 7.4728 Ops/s 8.4295 Ops/s $\textbf{\color{#d91a1a}-11.35\%}$
test_serialize_model_pickle 0.4467s 0.3861s 2.5900 Ops/s 2.5283 Ops/s $\color{#35bf28}+2.44\%$
test_serialize_weights 0.1301s 0.1187s 8.4211 Ops/s 7.6851 Ops/s $\textbf{\color{#35bf28}+9.58\%}$
test_serialize_weights_returnearly 0.1752s 0.1645s 6.0781 Ops/s 6.4299 Ops/s $\textbf{\color{#d91a1a}-5.47\%}$
test_serialize_weights_pickle 1.0555s 0.6840s 1.4619 Ops/s 1.0954 Ops/s $\textbf{\color{#35bf28}+33.46\%}$
test_serialize_weights_filesystem 0.1449s 0.1397s 7.1591 Ops/s 6.7966 Ops/s $\textbf{\color{#35bf28}+5.33\%}$
test_serialize_model_filesystem 0.1477s 0.1436s 6.9632 Ops/s 6.4427 Ops/s $\textbf{\color{#35bf28}+8.08\%}$
test_reshape_pytree 0.1122ms 38.4097μs 26.0351 KOps/s 25.4902 KOps/s $\color{#35bf28}+2.14\%$
test_reshape_td 0.1087ms 45.0555μs 22.1949 KOps/s 20.1025 KOps/s $\textbf{\color{#35bf28}+10.41\%}$
test_view_pytree 90.4580μs 38.7050μs 25.8365 KOps/s 25.5862 KOps/s $\color{#35bf28}+0.98\%$
test_view_td 0.1568ms 51.8759μs 19.2768 KOps/s 17.6856 KOps/s $\textbf{\color{#35bf28}+9.00\%}$
test_unbind_pytree 0.1494ms 35.7792μs 27.9492 KOps/s 27.5685 KOps/s $\color{#35bf28}+1.38\%$
test_unbind_td 0.3173ms 44.4976μs 22.4731 KOps/s 21.1235 KOps/s $\textbf{\color{#35bf28}+6.39\%}$
test_split_pytree 0.2206ms 37.5565μs 26.6265 KOps/s 25.9082 KOps/s $\color{#35bf28}+2.77\%$
test_split_td 0.5573ms 61.0178μs 16.3887 KOps/s 17.0034 KOps/s $\color{#d91a1a}-3.62\%$
test_add_pytree 0.1410ms 45.6707μs 21.8959 KOps/s 21.4136 KOps/s $\color{#35bf28}+2.25\%$
test_add_td 0.2485ms 81.6390μs 12.2490 KOps/s 11.6876 KOps/s $\color{#35bf28}+4.80\%$
test_compile_add_one_nested[tensordict-compile] 0.1619ms 55.7527μs 17.9364 KOps/s 17.4042 KOps/s $\color{#35bf28}+3.06\%$
test_compile_add_one_nested[tensordict-eager] 0.4106ms 0.1759ms 5.6864 KOps/s 5.5468 KOps/s $\color{#35bf28}+2.52\%$
test_compile_add_one_nested[pytree-compile] 0.1548ms 55.4031μs 18.0495 KOps/s 16.9875 KOps/s $\textbf{\color{#35bf28}+6.25\%}$
test_compile_add_one_nested[pytree-eager] 0.4324ms 0.1417ms 7.0549 KOps/s 7.0361 KOps/s $\color{#35bf28}+0.27\%$
test_compile_copy_nested[tensordict-compile] 0.1116ms 21.4687μs 46.5795 KOps/s 46.8156 KOps/s $\color{#d91a1a}-0.50\%$
test_compile_copy_nested[tensordict-eager] 0.1802ms 67.4325μs 14.8296 KOps/s 14.8395 KOps/s $\color{#d91a1a}-0.07\%$
test_compile_copy_nested[pytree-compile] 0.2230ms 79.3341μs 12.6049 KOps/s 13.1865 KOps/s $\color{#d91a1a}-4.41\%$
test_compile_copy_nested[pytree-eager] 0.1612ms 70.6138μs 14.1615 KOps/s 14.6899 KOps/s $\color{#d91a1a}-3.60\%$
test_compile_add_one_flat[tensordict-compile] 0.3803ms 0.1689ms 5.9190 KOps/s 5.7034 KOps/s $\color{#35bf28}+3.78\%$
test_compile_add_one_flat[tensordict-eager] 0.3866ms 0.1898ms 5.2699 KOps/s 5.2043 KOps/s $\color{#35bf28}+1.26\%$
test_compile_add_one_flat[tensorclass-compile] 0.1270ms 45.1218μs 22.1623 KOps/s 21.0780 KOps/s $\textbf{\color{#35bf28}+5.14\%}$
test_compile_add_one_flat[tensorclass-eager] 0.1761ms 68.7274μs 14.5502 KOps/s 14.2681 KOps/s $\color{#35bf28}+1.98\%$
test_compile_add_one_flat[pytree-compile] 0.3969ms 0.1736ms 5.7615 KOps/s 5.5451 KOps/s $\color{#35bf28}+3.90\%$
test_compile_add_one_flat[pytree-eager] 0.5876ms 0.2827ms 3.5371 KOps/s 3.4441 KOps/s $\color{#35bf28}+2.70\%$
test_compile_add_self_flat[tensordict-eager] 0.4001ms 0.2041ms 4.9006 KOps/s 4.8175 KOps/s $\color{#35bf28}+1.73\%$
test_compile_add_self_flat[tensordict-compile] 0.4545ms 0.1729ms 5.7833 KOps/s 5.7220 KOps/s $\color{#35bf28}+1.07\%$
test_compile_add_self_flat[tensorclass-eager] 0.1973ms 62.6126μs 15.9712 KOps/s 15.8533 KOps/s $\color{#35bf28}+0.74\%$
test_compile_add_self_flat[tensorclass-compile] 0.1017ms 46.2473μs 21.6229 KOps/s 20.7726 KOps/s $\color{#35bf28}+4.09\%$
test_compile_add_self_flat[pytree-eager] 0.4936ms 0.2349ms 4.2573 KOps/s 4.3249 KOps/s $\color{#d91a1a}-1.56\%$
test_compile_add_self_flat[pytree-compile] 0.4359ms 0.1757ms 5.6907 KOps/s 5.5702 KOps/s $\color{#35bf28}+2.16\%$
test_compile_copy_flat[tensordict-compile] 0.2550ms 0.1035ms 9.6619 KOps/s 9.5444 KOps/s $\color{#35bf28}+1.23\%$
test_compile_copy_flat[tensordict-eager] 0.1281ms 56.6205μs 17.6615 KOps/s 17.5741 KOps/s $\color{#35bf28}+0.50\%$
test_compile_copy_flat[pytree-compile] 0.1445ms 78.2263μs 12.7834 KOps/s 12.9312 KOps/s $\color{#d91a1a}-1.14\%$
test_compile_copy_flat[pytree-eager] 0.1959ms 69.9309μs 14.2998 KOps/s 14.4200 KOps/s $\color{#d91a1a}-0.83\%$
test_compile_assign_and_add[tensordict-compile] 0.4548ms 0.1997ms 5.0080 KOps/s 4.9898 KOps/s $\color{#35bf28}+0.36\%$
test_compile_assign_and_add[tensordict-eager] 2.1384ms 1.6298ms 613.5593 Ops/s 593.9931 Ops/s $\color{#35bf28}+3.29\%$
test_compile_assign_and_add[pytree-compile] 0.5247ms 0.1927ms 5.1891 KOps/s 5.1061 KOps/s $\color{#35bf28}+1.63\%$
test_compile_assign_and_add[pytree-eager] 1.2654ms 1.0788ms 926.9460 Ops/s 901.8481 Ops/s $\color{#35bf28}+2.78\%$
test_compile_assign_and_add_stack[compile] 0.8537ms 0.4252ms 2.3521 KOps/s 2.3192 KOps/s $\color{#35bf28}+1.42\%$
test_compile_assign_and_add_stack[eager] 5.8678ms 3.8320ms 260.9625 Ops/s 255.1261 Ops/s $\color{#35bf28}+2.29\%$
test_compile_indexing[tensor-tensordict-compile] 0.1500ms 33.5975μs 29.7641 KOps/s 27.9056 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_compile_indexing[tensor-tensordict-eager] 1.2333ms 49.0106μs 20.4038 KOps/s 20.1430 KOps/s $\color{#35bf28}+1.29\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1252ms 28.3746μs 35.2428 KOps/s 31.9796 KOps/s $\textbf{\color{#35bf28}+10.20\%}$
test_compile_indexing[tensor-tensorclass-eager] 0.1137ms 29.5441μs 33.8477 KOps/s 34.4227 KOps/s $\color{#d91a1a}-1.67\%$
test_compile_indexing[tensor-pytree-compile] 0.2393ms 28.0149μs 35.6953 KOps/s 32.1458 KOps/s $\textbf{\color{#35bf28}+11.04\%}$
test_compile_indexing[tensor-pytree-eager] 0.1003ms 29.1846μs 34.2646 KOps/s 34.2730 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_indexing[slice-tensordict-compile] 0.2055ms 72.7787μs 13.7403 KOps/s 13.0934 KOps/s $\color{#35bf28}+4.94\%$
test_compile_indexing[slice-tensordict-eager] 0.5836ms 27.5311μs 36.3226 KOps/s 34.0909 KOps/s $\textbf{\color{#35bf28}+6.55\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1378ms 66.6446μs 15.0050 KOps/s 14.2509 KOps/s $\textbf{\color{#35bf28}+5.29\%}$
test_compile_indexing[slice-tensorclass-eager] 0.1202ms 23.4560μs 42.6331 KOps/s 42.1291 KOps/s $\color{#35bf28}+1.20\%$
test_compile_indexing[slice-pytree-compile] 0.1590ms 66.6854μs 14.9958 KOps/s 14.3513 KOps/s $\color{#35bf28}+4.49\%$
test_compile_indexing[slice-pytree-eager] 0.1209ms 23.2014μs 43.1008 KOps/s 42.3048 KOps/s $\color{#35bf28}+1.88\%$
test_compile_indexing[int-tensordict-compile] 0.1808ms 73.2386μs 13.6540 KOps/s 13.1596 KOps/s $\color{#35bf28}+3.76\%$
test_compile_indexing[int-tensordict-eager] 0.9827ms 27.4056μs 36.4889 KOps/s 34.7475 KOps/s $\textbf{\color{#35bf28}+5.01\%}$
test_compile_indexing[int-tensorclass-compile] 0.1795ms 66.4717μs 15.0440 KOps/s 14.3881 KOps/s $\color{#35bf28}+4.56\%$
test_compile_indexing[int-tensorclass-eager] 0.3214ms 22.8076μs 43.8449 KOps/s 42.6742 KOps/s $\color{#35bf28}+2.74\%$
test_compile_indexing[int-pytree-compile] 0.2163ms 65.6613μs 15.2297 KOps/s 14.3950 KOps/s $\textbf{\color{#35bf28}+5.80\%}$
test_compile_indexing[int-pytree-eager] 84.0460μs 22.9850μs 43.5067 KOps/s 42.9722 KOps/s $\color{#35bf28}+1.24\%$
test_mod_add[eager] 0.1114ms 25.5790μs 39.0946 KOps/s 37.5500 KOps/s $\color{#35bf28}+4.11\%$
test_mod_add[compile] 0.1116ms 37.9702μs 26.3364 KOps/s 24.9118 KOps/s $\textbf{\color{#35bf28}+5.72\%}$
test_mod_add[compile-overhead] 88.0740μs 38.9795μs 25.6545 KOps/s 24.3228 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_mod_wrap[eager] 0.4366ms 0.2070ms 4.8304 KOps/s 4.7220 KOps/s $\color{#35bf28}+2.29\%$
test_mod_wrap[compile] 0.4327ms 0.2266ms 4.4136 KOps/s 4.1713 KOps/s $\textbf{\color{#35bf28}+5.81\%}$
test_mod_wrap[compile-overhead] 0.4101ms 0.2264ms 4.4169 KOps/s 4.2508 KOps/s $\color{#35bf28}+3.91\%$
test_mod_wrap_and_backward[eager] 13.9729ms 11.5002ms 86.9552 Ops/s 89.6277 Ops/s $\color{#d91a1a}-2.98\%$
test_mod_wrap_and_backward[compile] 16.8575ms 11.7360ms 85.2080 Ops/s 86.4131 Ops/s $\color{#d91a1a}-1.39\%$
test_mod_wrap_and_backward[compile-overhead] 13.7560ms 11.7254ms 85.2848 Ops/s 88.7364 Ops/s $\color{#d91a1a}-3.89\%$
test_seq_add[eager] 0.2588ms 89.9307μs 11.1197 KOps/s 10.5752 KOps/s $\textbf{\color{#35bf28}+5.15\%}$
test_seq_add[compile] 0.1159ms 63.6952μs 15.6998 KOps/s 15.0978 KOps/s $\color{#35bf28}+3.99\%$
test_seq_add[compile-overhead] 0.1617ms 61.7851μs 16.1851 KOps/s 15.5010 KOps/s $\color{#35bf28}+4.41\%$
test_seq_wrap[eager] 0.7516ms 0.3796ms 2.6344 KOps/s 2.4968 KOps/s $\textbf{\color{#35bf28}+5.51\%}$
test_seq_wrap[compile] 1.2768ms 0.2786ms 3.5898 KOps/s 3.5509 KOps/s $\color{#35bf28}+1.10\%$
test_seq_wrap[compile-overhead] 1.3744ms 0.2669ms 3.7467 KOps/s 3.6263 KOps/s $\color{#35bf28}+3.32\%$
test_func_call_runtime[False-eager] 0.9317ms 0.5045ms 1.9820 KOps/s 1.8817 KOps/s $\textbf{\color{#35bf28}+5.33\%}$
test_func_call_runtime[False-compile] 1.0347ms 0.5049ms 1.9805 KOps/s 1.9595 KOps/s $\color{#35bf28}+1.07\%$
test_func_call_runtime[False-compile-overhead] 0.7018ms 0.5028ms 1.9890 KOps/s 1.9659 KOps/s $\color{#35bf28}+1.18\%$
test_func_call_runtime[True-eager] 1.2653ms 0.7205ms 1.3879 KOps/s 1.3315 KOps/s $\color{#35bf28}+4.24\%$
test_func_call_runtime[True-compile] 0.7813ms 0.5108ms 1.9576 KOps/s 1.9115 KOps/s $\color{#35bf28}+2.41\%$
test_func_call_runtime[True-compile-overhead] 0.7480ms 0.5175ms 1.9322 KOps/s 1.9354 KOps/s $\color{#d91a1a}-0.17\%$
test_func_call_cm_runtime[False-eager] 1.0036ms 0.5026ms 1.9897 KOps/s 1.8871 KOps/s $\textbf{\color{#35bf28}+5.44\%}$
test_func_call_cm_runtime[False-compile] 1.0058ms 0.5114ms 1.9556 KOps/s 1.9694 KOps/s $\color{#d91a1a}-0.70\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6731ms 0.5083ms 1.9673 KOps/s 1.9632 KOps/s $\color{#35bf28}+0.21\%$
test_func_call_cm_runtime[True-eager] 1.3392ms 0.8689ms 1.1508 KOps/s 1.1377 KOps/s $\color{#35bf28}+1.16\%$
test_func_call_cm_runtime[True-compile] 1.1199ms 0.7196ms 1.3896 KOps/s 1.3283 KOps/s $\color{#35bf28}+4.62\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2101ms 0.7262ms 1.3770 KOps/s 1.3209 KOps/s $\color{#35bf28}+4.25\%$
test_vmap_func_call_cm_runtime[eager] 2.5749ms 1.8850ms 530.5072 Ops/s 532.5123 Ops/s $\color{#d91a1a}-0.38\%$
test_vmap_func_call_cm_runtime[compile] 2.7023ms 1.9396ms 515.5669 Ops/s 517.4172 Ops/s $\color{#d91a1a}-0.36\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.7665ms 1.9344ms 516.9478 Ops/s 516.8820 Ops/s $\color{#35bf28}+0.01\%$
test_distributed 0.2898ms 0.1242ms 8.0547 KOps/s 7.8655 KOps/s $\color{#35bf28}+2.41\%$
test_tdmodule 34.7450μs 18.6118μs 53.7294 KOps/s 54.3643 KOps/s $\color{#d91a1a}-1.17\%$
test_tdmodule_dispatch 57.8670μs 37.0875μs 26.9633 KOps/s 27.4414 KOps/s $\color{#d91a1a}-1.74\%$
test_tdseq 52.2470μs 21.0814μs 47.4351 KOps/s 47.4334 KOps/s $+0.00\%$
test_tdseq_dispatch 64.5000μs 42.3849μs 23.5933 KOps/s 23.8070 KOps/s $\color{#d91a1a}-0.90\%$
test_instantiation_functorch 1.7723ms 1.6024ms 624.0803 Ops/s 620.0295 Ops/s $\color{#35bf28}+0.65\%$
test_instantiation_td 1.8551ms 1.1797ms 847.6682 Ops/s 839.5402 Ops/s $\color{#35bf28}+0.97\%$
test_exec_functorch 0.2893ms 0.1819ms 5.4971 KOps/s 5.3999 KOps/s $\color{#35bf28}+1.80\%$
test_exec_functional_call 0.3634ms 0.1673ms 5.9774 KOps/s 5.6808 KOps/s $\textbf{\color{#35bf28}+5.22\%}$
test_exec_td 0.2650ms 0.1661ms 6.0200 KOps/s 5.7568 KOps/s $\color{#35bf28}+4.57\%$
test_exec_td_decorator 1.2056ms 0.2178ms 4.5918 KOps/s 4.4143 KOps/s $\color{#35bf28}+4.02\%$
test_vmap_mlp_speed[True-True] 1.0851ms 0.6534ms 1.5305 KOps/s 1.5363 KOps/s $\color{#d91a1a}-0.38\%$
test_vmap_mlp_speed[True-False] 0.9940ms 0.6501ms 1.5382 KOps/s 1.5404 KOps/s $\color{#d91a1a}-0.15\%$
test_vmap_mlp_speed[False-True] 0.9147ms 0.4988ms 2.0048 KOps/s 2.0057 KOps/s $\color{#d91a1a}-0.04\%$
test_vmap_mlp_speed[False-False] 0.7855ms 0.4988ms 2.0048 KOps/s 1.9978 KOps/s $\color{#35bf28}+0.35\%$
test_vmap_mlp_speed_decorator[True-True] 1.0268ms 0.6299ms 1.5875 KOps/s 1.5796 KOps/s $\color{#35bf28}+0.50\%$
test_vmap_mlp_speed_decorator[True-False] 0.9259ms 0.6376ms 1.5683 KOps/s 1.5782 KOps/s $\color{#d91a1a}-0.63\%$
test_vmap_mlp_speed_decorator[False-True] 0.8945ms 0.5181ms 1.9301 KOps/s 1.9264 KOps/s $\color{#35bf28}+0.19\%$
test_vmap_mlp_speed_decorator[False-False] 0.8422ms 0.5229ms 1.9125 KOps/s 1.9284 KOps/s $\color{#d91a1a}-0.82\%$
test_to_module_speed[True] 1.5847ms 1.3364ms 748.2679 Ops/s 768.7079 Ops/s $\color{#d91a1a}-2.66\%$
test_to_module_speed[False] 1.8768ms 1.3122ms 762.0712 Ops/s 790.0530 Ops/s $\color{#d91a1a}-3.54\%$
test_tc_init 0.1117ms 44.2332μs 22.6075 KOps/s 22.4207 KOps/s $\color{#35bf28}+0.83\%$
test_tc_init_nested 0.2190ms 88.6583μs 11.2793 KOps/s 10.9260 KOps/s $\color{#35bf28}+3.23\%$
test_tc_first_layer_tensor 82.8250μs 1.5303μs 653.4634 KOps/s 646.4972 KOps/s $\color{#35bf28}+1.08\%$
test_tc_first_layer_nontensor 43.5700μs 4.7226μs 211.7495 KOps/s 210.9379 KOps/s $\color{#35bf28}+0.38\%$
test_tc_second_layer_tensor 23.7640μs 2.8656μs 348.9650 KOps/s 349.4046 KOps/s $\color{#d91a1a}-0.13\%$
test_tc_second_layer_nontensor 0.1157ms 6.0303μs 165.8285 KOps/s 166.3349 KOps/s $\color{#d91a1a}-0.30\%$
test_unbind 0.5170s 14.3864ms 69.5101 Ops/s 72.0720 Ops/s $\color{#d91a1a}-3.55\%$
test_full_like 9.2815ms 8.0127ms 124.8022 Ops/s 132.0116 Ops/s $\textbf{\color{#d91a1a}-5.46\%}$
test_zeros_like 14.9346ms 7.0005ms 142.8476 Ops/s 355.7758 Ops/s $\textbf{\color{#d91a1a}-59.85\%}$
test_ones_like 13.8970ms 8.0099ms 124.8460 Ops/s 299.4325 Ops/s $\textbf{\color{#d91a1a}-58.31\%}$
test_clone 16.7846ms 9.6349ms 103.7898 Ops/s 193.2683 Ops/s $\textbf{\color{#d91a1a}-46.30\%}$
test_squeeze 0.1051ms 12.7076μs 78.6933 KOps/s 78.2757 KOps/s $\color{#35bf28}+0.53\%$
test_unsqueeze 0.1597ms 91.2882μs 10.9543 KOps/s 9.9627 KOps/s $\textbf{\color{#35bf28}+9.95\%}$
test_split 0.4198ms 0.1987ms 5.0315 KOps/s 4.9647 KOps/s $\color{#35bf28}+1.35\%$
test_permute 0.5247ms 0.2279ms 4.3888 KOps/s 4.4296 KOps/s $\color{#d91a1a}-0.92\%$
test_stack 32.4950ms 25.8056ms 38.7513 Ops/s 37.8760 Ops/s $\color{#35bf28}+2.31\%$
test_cat 30.8747ms 25.4695ms 39.2626 Ops/s 36.5354 Ops/s $\textbf{\color{#35bf28}+7.46\%}$

Copy link

github-actions bot commented Sep 16, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}27$. Worsened: $\large\color{#d91a1a}12$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 39.9610μs 12.8008μs 78.1201 KOps/s 69.3434 KOps/s $\textbf{\color{#35bf28}+12.66\%}$
test_plain_set_stack_nested 38.5510μs 13.0227μs 76.7892 KOps/s 67.7642 KOps/s $\textbf{\color{#35bf28}+13.32\%}$
test_plain_set_nested_inplace 50.5510μs 13.7975μs 72.4767 KOps/s 63.9815 KOps/s $\textbf{\color{#35bf28}+13.28\%}$
test_plain_set_stack_nested_inplace 48.6620μs 13.7231μs 72.8699 KOps/s 64.7601 KOps/s $\textbf{\color{#35bf28}+12.52\%}$
test_items 42.4610μs 2.8778μs 347.4896 KOps/s 349.2834 KOps/s $\color{#d91a1a}-0.51\%$
test_items_nested 0.3972ms 0.3279ms 3.0498 KOps/s 3.0646 KOps/s $\color{#d91a1a}-0.48\%$
test_items_nested_locked 0.3917ms 0.3321ms 3.0111 KOps/s 3.0503 KOps/s $\color{#d91a1a}-1.29\%$
test_items_nested_leaf 81.2420μs 55.4735μs 18.0266 KOps/s 18.0670 KOps/s $\color{#d91a1a}-0.22\%$
test_items_stack_nested 0.4090ms 0.3313ms 3.0181 KOps/s 3.0259 KOps/s $\color{#d91a1a}-0.26\%$
test_items_stack_nested_leaf 82.1520μs 57.3747μs 17.4293 KOps/s 17.5928 KOps/s $\color{#d91a1a}-0.93\%$
test_items_stack_nested_locked 0.3903ms 0.3306ms 3.0252 KOps/s 3.0323 KOps/s $\color{#d91a1a}-0.23\%$
test_keys 30.4910μs 3.4228μs 292.1596 KOps/s 289.9364 KOps/s $\color{#35bf28}+0.77\%$
test_keys_nested 84.1620μs 56.3377μs 17.7501 KOps/s 17.6351 KOps/s $\color{#35bf28}+0.65\%$
test_keys_nested_locked 2.8095ms 62.6122μs 15.9713 KOps/s 16.0641 KOps/s $\color{#d91a1a}-0.58\%$
test_keys_nested_leaf 75.7010μs 47.8491μs 20.8990 KOps/s 20.9378 KOps/s $\color{#d91a1a}-0.19\%$
test_keys_stack_nested 0.1001ms 56.1691μs 17.8034 KOps/s 17.7566 KOps/s $\color{#35bf28}+0.26\%$
test_keys_stack_nested_leaf 79.1820μs 48.3694μs 20.6742 KOps/s 20.7848 KOps/s $\color{#d91a1a}-0.53\%$
test_keys_stack_nested_locked 94.5820μs 61.6316μs 16.2254 KOps/s 16.1675 KOps/s $\color{#35bf28}+0.36\%$
test_values 5.6167μs 0.8601μs 1.1627 MOps/s 1.1910 MOps/s $\color{#d91a1a}-2.38\%$
test_values_nested 69.2420μs 40.7025μs 24.5685 KOps/s 24.3536 KOps/s $\color{#35bf28}+0.88\%$
test_values_nested_locked 76.8920μs 42.5411μs 23.5067 KOps/s 23.2606 KOps/s $\color{#35bf28}+1.06\%$
test_values_nested_leaf 62.9410μs 35.4461μs 28.2119 KOps/s 28.0142 KOps/s $\color{#35bf28}+0.71\%$
test_values_stack_nested 74.7020μs 41.9279μs 23.8505 KOps/s 23.8398 KOps/s $\color{#35bf28}+0.04\%$
test_values_stack_nested_leaf 64.2010μs 36.0698μs 27.7240 KOps/s 27.6053 KOps/s $\color{#35bf28}+0.43\%$
test_values_stack_nested_locked 70.0620μs 43.5416μs 22.9665 KOps/s 22.9581 KOps/s $\color{#35bf28}+0.04\%$
test_membership 1.7766μs 0.5010μs 1.9960 MOps/s 1.9607 MOps/s $\color{#35bf28}+1.80\%$
test_membership_nested 16.7505μs 1.8877μs 529.7576 KOps/s 492.5576 KOps/s $\textbf{\color{#35bf28}+7.55\%}$
test_membership_nested_leaf 17.9350μs 1.9056μs 524.7633 KOps/s 513.2682 KOps/s $\color{#35bf28}+2.24\%$
test_membership_stacked_nested 27.1800μs 1.9364μs 516.4311 KOps/s 501.3917 KOps/s $\color{#35bf28}+3.00\%$
test_membership_stacked_nested_leaf 26.8610μs 1.9546μs 511.6165 KOps/s 499.2452 KOps/s $\color{#35bf28}+2.48\%$
test_membership_nested_last 25.8200μs 2.8569μs 350.0261 KOps/s 351.8604 KOps/s $\color{#d91a1a}-0.52\%$
test_membership_nested_leaf_last 31.1700μs 2.8405μs 352.0552 KOps/s 353.4170 KOps/s $\color{#d91a1a}-0.39\%$
test_membership_stacked_nested_last 37.0900μs 3.4941μs 286.1972 KOps/s 267.0709 KOps/s $\textbf{\color{#35bf28}+7.16\%}$
test_membership_stacked_nested_leaf_last 29.2610μs 3.4797μs 287.3846 KOps/s 268.9674 KOps/s $\textbf{\color{#35bf28}+6.85\%}$
test_nested_getleaf 28.8710μs 6.1125μs 163.5982 KOps/s 162.4862 KOps/s $\color{#35bf28}+0.68\%$
test_nested_get 33.0200μs 5.7633μs 173.5129 KOps/s 173.6646 KOps/s $\color{#d91a1a}-0.09\%$
test_stacked_getleaf 36.5110μs 6.0648μs 164.8850 KOps/s 163.7395 KOps/s $\color{#35bf28}+0.70\%$
test_stacked_get 30.2010μs 5.6018μs 178.5138 KOps/s 174.2348 KOps/s $\color{#35bf28}+2.46\%$
test_nested_getitemleaf 26.1110μs 6.0832μs 164.3883 KOps/s 161.6921 KOps/s $\color{#35bf28}+1.67\%$
test_nested_getitem 44.6010μs 5.7565μs 173.7157 KOps/s 173.6560 KOps/s $\color{#35bf28}+0.03\%$
test_stacked_getitemleaf 38.7410μs 6.1566μs 162.4264 KOps/s 164.2526 KOps/s $\color{#d91a1a}-1.11\%$
test_stacked_getitem 42.1710μs 5.7346μs 174.3804 KOps/s 174.4219 KOps/s $\color{#d91a1a}-0.02\%$
test_lock_nested 4.8914ms 0.4112ms 2.4318 KOps/s 2.4394 KOps/s $\color{#d91a1a}-0.31\%$
test_lock_stack_nested 0.4338ms 0.3746ms 2.6692 KOps/s 2.6674 KOps/s $\color{#35bf28}+0.07\%$
test_unlock_nested 0.7635ms 0.3490ms 2.8651 KOps/s 2.8844 KOps/s $\color{#d91a1a}-0.67\%$
test_unlock_stack_nested 0.3572ms 0.3134ms 3.1904 KOps/s 3.1596 KOps/s $\color{#35bf28}+0.97\%$
test_flatten_speed 0.1039ms 68.7929μs 14.5364 KOps/s 14.3497 KOps/s $\color{#35bf28}+1.30\%$
test_unflatten_speed 0.3288ms 0.2871ms 3.4829 KOps/s 3.4668 KOps/s $\color{#35bf28}+0.46\%$
test_common_ops 1.5329ms 1.1914ms 839.3788 Ops/s 814.7842 Ops/s $\color{#35bf28}+3.02\%$
test_creation 34.5110μs 1.4917μs 670.3983 KOps/s 687.5444 KOps/s $\color{#d91a1a}-2.49\%$
test_creation_empty 37.4510μs 13.2834μs 75.2820 KOps/s 60.2168 KOps/s $\textbf{\color{#35bf28}+25.02\%}$
test_creation_nested_1 35.9400μs 14.9884μs 66.7182 KOps/s 55.1730 KOps/s $\textbf{\color{#35bf28}+20.93\%}$
test_creation_nested_2 49.3520μs 17.6226μs 56.7452 KOps/s 48.3404 KOps/s $\textbf{\color{#35bf28}+17.39\%}$
test_clone 1.2965ms 29.0430μs 34.4318 KOps/s 36.3528 KOps/s $\textbf{\color{#d91a1a}-5.28\%}$
test_getitem[int] 98.5490ms 23.2433μs 43.0232 KOps/s 64.3103 KOps/s $\textbf{\color{#d91a1a}-33.10\%}$
test_getitem[slice_int] 0.1209ms 26.5803μs 37.6219 KOps/s 37.9317 KOps/s $\color{#d91a1a}-0.82\%$
test_getitem[range] 0.2306ms 0.1088ms 9.1890 KOps/s 8.8151 KOps/s $\color{#35bf28}+4.24\%$
test_getitem[tuple] 0.1186ms 22.8259μs 43.8098 KOps/s 43.6713 KOps/s $\color{#35bf28}+0.32\%$
test_getitem[list] 0.2057ms 97.8240μs 10.2224 KOps/s 10.2640 KOps/s $\color{#d91a1a}-0.41\%$
test_setitem_dim[int] 85.1520μs 46.2506μs 21.6213 KOps/s 22.7747 KOps/s $\textbf{\color{#d91a1a}-5.06\%}$
test_setitem_dim[slice_int] 0.2090ms 66.7450μs 14.9824 KOps/s 15.1124 KOps/s $\color{#d91a1a}-0.86\%$
test_setitem_dim[range] 0.1578ms 0.1260ms 7.9377 KOps/s 8.0041 KOps/s $\color{#d91a1a}-0.83\%$
test_setitem_dim[tuple] 84.5020μs 59.7386μs 16.7396 KOps/s 17.0398 KOps/s $\color{#d91a1a}-1.76\%$
test_setitem 76.7320μs 39.4053μs 25.3773 KOps/s 24.5273 KOps/s $\color{#35bf28}+3.47\%$
test_set 87.9620μs 39.8712μs 25.0808 KOps/s 25.2005 KOps/s $\color{#d91a1a}-0.48\%$
test_set_shared 0.3417ms 49.4552μs 20.2203 KOps/s 20.1183 KOps/s $\color{#35bf28}+0.51\%$
test_update 78.8020μs 45.5067μs 21.9748 KOps/s 20.5563 KOps/s $\textbf{\color{#35bf28}+6.90\%}$
test_update_nested 92.1830μs 52.7795μs 18.9468 KOps/s 17.0554 KOps/s $\textbf{\color{#35bf28}+11.09\%}$
test_update__nested 99.5220μs 58.3269μs 17.1447 KOps/s 16.1581 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_set_nested 79.3220μs 41.1365μs 24.3093 KOps/s 22.2659 KOps/s $\textbf{\color{#35bf28}+9.18\%}$
test_set_nested_new 81.7420μs 44.4927μs 22.4756 KOps/s 20.3418 KOps/s $\textbf{\color{#35bf28}+10.49\%}$
test_select 92.6920μs 57.3830μs 17.4268 KOps/s 15.8898 KOps/s $\textbf{\color{#35bf28}+9.67\%}$
test_select_nested 0.3539ms 41.7620μs 23.9452 KOps/s 23.9488 KOps/s $\color{#d91a1a}-0.02\%$
test_exclude_nested 92.8330μs 58.2206μs 17.1761 KOps/s 17.0415 KOps/s $\color{#35bf28}+0.79\%$
test_empty[True] 0.2877ms 0.2431ms 4.1128 KOps/s 4.1222 KOps/s $\color{#d91a1a}-0.23\%$
test_empty[False] 3.2171μs 0.7460μs 1.3405 MOps/s 1.3578 MOps/s $\color{#d91a1a}-1.28\%$
test_to 69.6520μs 23.6647μs 42.2570 KOps/s 41.8548 KOps/s $\color{#35bf28}+0.96\%$
test_to_nonblocking 48.9510μs 22.9668μs 43.5412 KOps/s 42.9890 KOps/s $\color{#35bf28}+1.28\%$
test_unbind_speed 1.5310ms 0.2708ms 3.6930 KOps/s 3.7138 KOps/s $\color{#d91a1a}-0.56\%$
test_unbind_speed_stack0 0.3252ms 0.2690ms 3.7176 KOps/s 3.6956 KOps/s $\color{#35bf28}+0.59\%$
test_unbind_speed_stack1 0.1046s 0.6970ms 1.4347 KOps/s 1.4145 KOps/s $\color{#35bf28}+1.43\%$
test_split 0.1069s 2.1265ms 470.2473 Ops/s 475.4294 Ops/s $\color{#d91a1a}-1.09\%$
test_chunk 0.1014s 2.1134ms 473.1675 Ops/s 472.8333 Ops/s $\color{#35bf28}+0.07\%$
test_creation[device0] 0.3433ms 0.1302ms 7.6787 KOps/s 7.9690 KOps/s $\color{#d91a1a}-3.64\%$
test_creation_from_tensor 0.3432ms 0.1324ms 7.5541 KOps/s 7.5888 KOps/s $\color{#d91a1a}-0.46\%$
test_add_one[memmap_tensor0] 0.1382ms 8.5327μs 117.1959 KOps/s 117.1978 KOps/s $-0.00\%$
test_contiguous[memmap_tensor0] 31.1400μs 2.1454μs 466.1180 KOps/s 465.7500 KOps/s $\color{#35bf28}+0.08\%$
test_stack[memmap_tensor0] 36.6810μs 6.6561μs 150.2380 KOps/s 154.4188 KOps/s $\color{#d91a1a}-2.71\%$
test_memmaptd_index 1.1487ms 0.4060ms 2.4633 KOps/s 2.4850 KOps/s $\color{#d91a1a}-0.87\%$
test_memmaptd_index_astensor 0.7041ms 0.4639ms 2.1555 KOps/s 2.1725 KOps/s $\color{#d91a1a}-0.78\%$
test_memmaptd_index_op 1.3585ms 0.9538ms 1.0484 KOps/s 1.0014 KOps/s $\color{#35bf28}+4.69\%$
test_serialize_model 0.1309s 0.1298s 7.7047 Ops/s 7.6871 Ops/s $\color{#35bf28}+0.23\%$
test_serialize_model_pickle 1.3499s 1.2129s 0.8245 Ops/s 0.8428 Ops/s $\color{#d91a1a}-2.17\%$
test_serialize_weights 0.1299s 0.1290s 7.7498 Ops/s 7.7328 Ops/s $\color{#35bf28}+0.22\%$
test_serialize_weights_returnearly 0.2528s 57.1195ms 17.5071 Ops/s 17.5319 Ops/s $\color{#d91a1a}-0.14\%$
test_serialize_weights_pickle 1.3495s 1.2116s 0.8253 Ops/s 0.8211 Ops/s $\color{#35bf28}+0.52\%$
test_reshape_pytree 58.6520μs 34.3710μs 29.0943 KOps/s 28.6813 KOps/s $\color{#35bf28}+1.44\%$
test_reshape_td 96.1720μs 40.3839μs 24.7623 KOps/s 24.4680 KOps/s $\color{#35bf28}+1.20\%$
test_view_pytree 62.5020μs 33.6849μs 29.6869 KOps/s 29.0150 KOps/s $\color{#35bf28}+2.32\%$
test_view_td 81.7820μs 44.3160μs 22.5652 KOps/s 21.9265 KOps/s $\color{#35bf28}+2.91\%$
test_unbind_pytree 66.4010μs 33.3257μs 30.0069 KOps/s 29.7597 KOps/s $\color{#35bf28}+0.83\%$
test_unbind_td 0.5266ms 41.9421μs 23.8424 KOps/s 24.1838 KOps/s $\color{#d91a1a}-1.41\%$
test_split_pytree 85.1320μs 46.3851μs 21.5586 KOps/s 22.2253 KOps/s $\color{#d91a1a}-3.00\%$
test_split_td 0.6950ms 55.3470μs 18.0678 KOps/s 18.5178 KOps/s $\color{#d91a1a}-2.43\%$
test_add_pytree 91.4320μs 55.2524μs 18.0988 KOps/s 17.5327 KOps/s $\color{#35bf28}+3.23\%$
test_add_td 0.1219ms 84.9270μs 11.7748 KOps/s 11.3316 KOps/s $\color{#35bf28}+3.91\%$
test_compile_add_one_nested[tensordict-compile] 0.4143ms 0.2126ms 4.7047 KOps/s 4.5624 KOps/s $\color{#35bf28}+3.12\%$
test_compile_add_one_nested[tensordict-eager] 0.2031ms 0.1490ms 6.7128 KOps/s 6.7504 KOps/s $\color{#d91a1a}-0.56\%$
test_compile_add_one_nested[pytree-compile] 0.1831ms 0.1437ms 6.9572 KOps/s 7.0316 KOps/s $\color{#d91a1a}-1.06\%$
test_compile_add_one_nested[pytree-eager] 0.2471ms 0.1827ms 5.4740 KOps/s 5.6609 KOps/s $\color{#d91a1a}-3.30\%$
test_compile_copy_nested[tensordict-compile] 58.9820μs 21.9877μs 45.4799 KOps/s 48.4211 KOps/s $\textbf{\color{#d91a1a}-6.07\%}$
test_compile_copy_nested[tensordict-eager] 78.5820μs 43.5989μs 22.9363 KOps/s 23.1790 KOps/s $\color{#d91a1a}-1.05\%$
test_compile_copy_nested[pytree-compile] 0.2341ms 64.1781μs 15.5817 KOps/s 15.7969 KOps/s $\color{#d91a1a}-1.36\%$
test_compile_copy_nested[pytree-eager] 75.1520μs 50.5636μs 19.7771 KOps/s 20.3009 KOps/s $\color{#d91a1a}-2.58\%$
test_compile_add_one_flat[tensordict-compile] 0.4186ms 0.3156ms 3.1681 KOps/s 3.1789 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_add_one_flat[tensordict-eager] 0.2680ms 0.2062ms 4.8493 KOps/s 4.8476 KOps/s $\color{#35bf28}+0.03\%$
test_compile_add_one_flat[tensorclass-compile] 0.1755ms 0.1264ms 7.9094 KOps/s 7.8916 KOps/s $\color{#35bf28}+0.23\%$
test_compile_add_one_flat[tensorclass-eager] 0.1031ms 60.1304μs 16.6305 KOps/s 16.9330 KOps/s $\color{#d91a1a}-1.79\%$
test_compile_add_one_flat[pytree-compile] 0.3547ms 0.3141ms 3.1832 KOps/s 3.1410 KOps/s $\color{#35bf28}+1.34\%$
test_compile_add_one_flat[pytree-eager] 0.7642ms 0.6125ms 1.6327 KOps/s 1.7039 KOps/s $\color{#d91a1a}-4.18\%$
test_compile_add_self_flat[tensordict-eager] 0.2911ms 0.2434ms 4.1076 KOps/s 4.0402 KOps/s $\color{#35bf28}+1.67\%$
test_compile_add_self_flat[tensordict-compile] 0.4646ms 0.3163ms 3.1615 KOps/s 3.1665 KOps/s $\color{#d91a1a}-0.16\%$
test_compile_add_self_flat[tensorclass-eager] 0.1115ms 70.7594μs 14.1324 KOps/s 14.4894 KOps/s $\color{#d91a1a}-2.46\%$
test_compile_add_self_flat[tensorclass-compile] 0.1858ms 0.1273ms 7.8578 KOps/s 7.8755 KOps/s $\color{#d91a1a}-0.23\%$
test_compile_add_self_flat[pytree-eager] 0.5883ms 0.5094ms 1.9633 KOps/s 1.9822 KOps/s $\color{#d91a1a}-0.95\%$
test_compile_add_self_flat[pytree-compile] 0.3613ms 0.3145ms 3.1799 KOps/s 3.1950 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_copy_flat[tensordict-compile] 64.8520μs 19.2841μs 51.8561 KOps/s 53.6982 KOps/s $\color{#d91a1a}-3.43\%$
test_compile_copy_flat[tensordict-eager] 56.8910μs 27.4241μs 36.4642 KOps/s 36.4837 KOps/s $\color{#d91a1a}-0.05\%$
test_compile_copy_flat[pytree-compile] 0.1021ms 68.2391μs 14.6544 KOps/s 14.5651 KOps/s $\color{#35bf28}+0.61\%$
test_compile_copy_flat[pytree-eager] 81.2820μs 51.0291μs 19.5967 KOps/s 19.4832 KOps/s $\color{#35bf28}+0.58\%$
test_compile_assign_and_add[tensordict-compile] 2.2759ms 0.7970ms 1.2548 KOps/s 1.1364 KOps/s $\textbf{\color{#35bf28}+10.41\%}$
test_compile_assign_and_add[tensordict-eager] 3.4344ms 3.0978ms 322.8115 Ops/s 313.9494 Ops/s $\color{#35bf28}+2.82\%$
test_compile_assign_and_add[pytree-compile] 2.2904ms 0.7957ms 1.2568 KOps/s 1.1706 KOps/s $\textbf{\color{#35bf28}+7.36\%}$
test_compile_assign_and_add[pytree-eager] 3.2515ms 3.1113ms 321.4130 Ops/s 328.9031 Ops/s $\color{#d91a1a}-2.28\%$
test_compile_indexing[tensor-tensordict-compile] 0.1564ms 0.1086ms 9.2120 KOps/s 9.1060 KOps/s $\color{#35bf28}+1.16\%$
test_compile_indexing[tensor-tensordict-eager] 0.1882ms 60.2087μs 16.6089 KOps/s 15.9773 KOps/s $\color{#35bf28}+3.95\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1750ms 0.1024ms 9.7637 KOps/s 9.7084 KOps/s $\color{#35bf28}+0.57\%$
test_compile_indexing[tensor-tensorclass-eager] 0.2279ms 46.1400μs 21.6732 KOps/s 23.6807 KOps/s $\textbf{\color{#d91a1a}-8.48\%}$
test_compile_indexing[tensor-pytree-compile] 0.2763ms 0.1092ms 9.1586 KOps/s 9.6084 KOps/s $\color{#d91a1a}-4.68\%$
test_compile_indexing[tensor-pytree-eager] 89.4030μs 45.5330μs 21.9621 KOps/s 24.0918 KOps/s $\textbf{\color{#d91a1a}-8.84\%}$
test_compile_indexing[slice-tensordict-compile] 0.2622ms 0.1360ms 7.3550 KOps/s 7.3833 KOps/s $\color{#d91a1a}-0.38\%$
test_compile_indexing[slice-tensordict-eager] 0.1542ms 25.0181μs 39.9710 KOps/s 39.9240 KOps/s $\color{#35bf28}+0.12\%$
test_compile_indexing[slice-tensorclass-compile] 0.1752ms 0.1296ms 7.7134 KOps/s 7.7360 KOps/s $\color{#d91a1a}-0.29\%$
test_compile_indexing[slice-tensorclass-eager] 60.6110μs 20.6780μs 48.3606 KOps/s 48.8127 KOps/s $\color{#d91a1a}-0.93\%$
test_compile_indexing[slice-pytree-compile] 0.1843ms 0.1362ms 7.3448 KOps/s 7.6770 KOps/s $\color{#d91a1a}-4.33\%$
test_compile_indexing[slice-pytree-eager] 52.1810μs 20.2986μs 49.2644 KOps/s 49.6839 KOps/s $\color{#d91a1a}-0.84\%$
test_compile_indexing[int-tensordict-compile] 0.1837ms 0.1368ms 7.3114 KOps/s 7.3251 KOps/s $\color{#d91a1a}-0.19\%$
test_compile_indexing[int-tensordict-eager] 0.4598ms 24.8331μs 40.2689 KOps/s 40.6749 KOps/s $\color{#d91a1a}-1.00\%$
test_compile_indexing[int-tensorclass-compile] 0.1828ms 0.1372ms 7.2885 KOps/s 7.6695 KOps/s $\color{#d91a1a}-4.97\%$
test_compile_indexing[int-tensorclass-eager] 0.1229ms 22.6829μs 44.0861 KOps/s 49.5851 KOps/s $\textbf{\color{#d91a1a}-11.09\%}$
test_compile_indexing[int-pytree-compile] 0.1851ms 0.1376ms 7.2655 KOps/s 7.6647 KOps/s $\textbf{\color{#d91a1a}-5.21\%}$
test_compile_indexing[int-pytree-eager] 58.0510μs 20.3410μs 49.1618 KOps/s 49.2015 KOps/s $\color{#d91a1a}-0.08\%$
test_mod_add[eager] 71.7220μs 29.4783μs 33.9233 KOps/s 32.1998 KOps/s $\textbf{\color{#35bf28}+5.35\%}$
test_mod_add[compile] 0.1187ms 69.2600μs 14.4383 KOps/s 13.5930 KOps/s $\textbf{\color{#35bf28}+6.22\%}$
test_mod_add[compile-overhead] 0.2675ms 0.1356ms 7.3768 KOps/s 6.6888 KOps/s $\textbf{\color{#35bf28}+10.29\%}$
test_mod_wrap[eager] 0.3984ms 0.2520ms 3.9683 KOps/s 4.1291 KOps/s $\color{#d91a1a}-3.90\%$
test_mod_wrap[compile] 1.5223ms 0.2941ms 3.4001 KOps/s 3.3974 KOps/s $\color{#35bf28}+0.08\%$
test_mod_wrap[compile-overhead] 7.8456ms 4.1689ms 239.8716 Ops/s 244.7609 Ops/s $\color{#d91a1a}-2.00\%$
test_mod_wrap_and_backward[eager] 1.5086ms 1.4116ms 708.4211 Ops/s 704.5122 Ops/s $\color{#35bf28}+0.55\%$
test_mod_wrap_and_backward[compile] 1.8411ms 1.4097ms 709.3632 Ops/s 715.6648 Ops/s $\color{#d91a1a}-0.88\%$
test_mod_wrap_and_backward[compile-overhead] 1.7344ms 1.0467ms 955.3474 Ops/s 991.6654 Ops/s $\color{#d91a1a}-3.66\%$
test_seq_add[eager] 0.1393ms 92.1889μs 10.8473 KOps/s 10.5417 KOps/s $\color{#35bf28}+2.90\%$
test_seq_add[compile] 0.2400ms 81.9425μs 12.2037 KOps/s 11.9373 KOps/s $\color{#35bf28}+2.23\%$
test_seq_add[compile-overhead] 0.1798ms 0.1128ms 8.8687 KOps/s 8.8898 KOps/s $\color{#d91a1a}-0.24\%$
test_seq_wrap[eager] 0.4568ms 0.3754ms 2.6638 KOps/s 2.6026 KOps/s $\color{#35bf28}+2.35\%$
test_seq_wrap[compile] 0.3618ms 0.3092ms 3.2340 KOps/s 3.1891 KOps/s $\color{#35bf28}+1.41\%$
test_seq_wrap[compile-overhead] 0.2797ms 0.2167ms 4.6154 KOps/s 4.5351 KOps/s $\color{#35bf28}+1.77\%$
test_func_call_runtime[False-eager] 0.8435ms 0.7506ms 1.3322 KOps/s 1.2959 KOps/s $\color{#35bf28}+2.80\%$
test_func_call_runtime[False-compile] 0.8349ms 0.7791ms 1.2836 KOps/s 1.2986 KOps/s $\color{#d91a1a}-1.16\%$
test_func_call_runtime[False-compile-overhead] 0.4021ms 0.3555ms 2.8127 KOps/s 2.8195 KOps/s $\color{#d91a1a}-0.24\%$
test_func_call_runtime[True-eager] 1.1704ms 0.8927ms 1.1202 KOps/s 1.0844 KOps/s $\color{#35bf28}+3.31\%$
test_func_call_runtime[True-compile] 0.9228ms 0.8186ms 1.2216 KOps/s 1.2112 KOps/s $\color{#35bf28}+0.86\%$
test_func_call_runtime[True-compile-overhead] 0.4849ms 0.3923ms 2.5490 KOps/s 2.5743 KOps/s $\color{#d91a1a}-0.98\%$
test_func_call_cm_runtime[False-eager] 0.7863ms 0.7303ms 1.3693 KOps/s 1.3782 KOps/s $\color{#d91a1a}-0.65\%$
test_func_call_cm_runtime[False-compile] 1.0103ms 0.8251ms 1.2119 KOps/s 1.2908 KOps/s $\textbf{\color{#d91a1a}-6.11\%}$
test_func_call_cm_runtime[False-compile-overhead] 0.4003ms 0.3583ms 2.7908 KOps/s 2.8082 KOps/s $\color{#d91a1a}-0.62\%$
test_func_call_cm_runtime[True-eager] 1.3680ms 0.9922ms 1.0079 KOps/s 1.0063 KOps/s $\color{#35bf28}+0.16\%$
test_func_call_cm_runtime[True-compile] 1.0754ms 0.8783ms 1.1386 KOps/s 1.1894 KOps/s $\color{#d91a1a}-4.27\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4689ms 0.4147ms 2.4112 KOps/s 2.3754 KOps/s $\color{#35bf28}+1.51\%$
test_vmap_func_call_cm_runtime[eager] 2.5265ms 2.0566ms 486.2487 Ops/s 482.1671 Ops/s $\color{#35bf28}+0.85\%$
test_vmap_func_call_cm_runtime[compile] 0.9148ms 0.8573ms 1.1664 KOps/s 1.1860 KOps/s $\color{#d91a1a}-1.65\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4708ms 0.4212ms 2.3742 KOps/s 2.3864 KOps/s $\color{#d91a1a}-0.51\%$
test_distributed 15.2093ms 0.2312ms 4.3243 KOps/s 8.6672 KOps/s $\textbf{\color{#d91a1a}-50.11\%}$
test_tdmodule 0.1130ms 13.8824μs 72.0337 KOps/s 62.7924 KOps/s $\textbf{\color{#35bf28}+14.72\%}$
test_tdmodule_dispatch 60.7420μs 26.6233μs 37.5611 KOps/s 32.7317 KOps/s $\textbf{\color{#35bf28}+14.75\%}$
test_tdseq 33.4110μs 14.2978μs 69.9408 KOps/s 61.5302 KOps/s $\textbf{\color{#35bf28}+13.67\%}$
test_tdseq_dispatch 52.5410μs 28.7305μs 34.8063 KOps/s 29.8925 KOps/s $\textbf{\color{#35bf28}+16.44\%}$
test_instantiation_functorch 2.0251ms 1.8702ms 534.7108 Ops/s 540.8377 Ops/s $\color{#d91a1a}-1.13\%$
test_instantiation_td 1.8097ms 1.1937ms 837.7149 Ops/s 842.9529 Ops/s $\color{#d91a1a}-0.62\%$
test_exec_functorch 0.2657ms 0.2081ms 4.8051 KOps/s 4.9570 KOps/s $\color{#d91a1a}-3.06\%$
test_exec_functional_call 0.2698ms 0.2091ms 4.7823 KOps/s 4.8956 KOps/s $\color{#d91a1a}-2.31\%$
test_exec_td 0.2923ms 0.2156ms 4.6376 KOps/s 4.6601 KOps/s $\color{#d91a1a}-0.48\%$
test_exec_td_decorator 0.9269ms 0.2566ms 3.8969 KOps/s 3.8929 KOps/s $\color{#35bf28}+0.10\%$
test_vmap_mlp_speed[True-True] 0.8167ms 0.6783ms 1.4743 KOps/s 1.4608 KOps/s $\color{#35bf28}+0.92\%$
test_vmap_mlp_speed[True-False] 0.7496ms 0.6753ms 1.4808 KOps/s 1.4612 KOps/s $\color{#35bf28}+1.34\%$
test_vmap_mlp_speed[False-True] 0.6903ms 0.5811ms 1.7209 KOps/s 1.7113 KOps/s $\color{#35bf28}+0.56\%$
test_vmap_mlp_speed[False-False] 0.6732ms 0.5922ms 1.6887 KOps/s 1.7189 KOps/s $\color{#d91a1a}-1.75\%$
test_vmap_mlp_speed_decorator[True-True] 1.3022ms 0.6718ms 1.4885 KOps/s 1.4549 KOps/s $\color{#35bf28}+2.31\%$
test_vmap_mlp_speed_decorator[True-False] 0.7708ms 0.6769ms 1.4773 KOps/s 1.4801 KOps/s $\color{#d91a1a}-0.19\%$
test_vmap_mlp_speed_decorator[False-True] 0.8201ms 0.6012ms 1.6632 KOps/s 1.7116 KOps/s $\color{#d91a1a}-2.83\%$
test_vmap_mlp_speed_decorator[False-False] 0.7392ms 0.6096ms 1.6405 KOps/s 1.7276 KOps/s $\textbf{\color{#d91a1a}-5.04\%}$
test_vmap_transformer_speed[True-True] 8.5528ms 8.3495ms 119.7675 Ops/s 121.3128 Ops/s $\color{#d91a1a}-1.27\%$
test_vmap_transformer_speed[True-False] 8.5374ms 8.2973ms 120.5205 Ops/s 121.9030 Ops/s $\color{#d91a1a}-1.13\%$
test_vmap_transformer_speed[False-True] 8.4169ms 8.1578ms 122.5828 Ops/s 125.3422 Ops/s $\color{#d91a1a}-2.20\%$
test_vmap_transformer_speed[False-False] 8.3537ms 8.0940ms 123.5476 Ops/s 125.0049 Ops/s $\color{#d91a1a}-1.17\%$
test_vmap_transformer_speed_decorator[True-True] 19.9549ms 19.3971ms 51.5541 Ops/s 52.2756 Ops/s $\color{#d91a1a}-1.38\%$
test_vmap_transformer_speed_decorator[True-False] 20.5845ms 19.6070ms 51.0021 Ops/s 52.2619 Ops/s $\color{#d91a1a}-2.41\%$
test_vmap_transformer_speed_decorator[False-True] 19.8672ms 19.4737ms 51.3513 Ops/s 52.8644 Ops/s $\color{#d91a1a}-2.86\%$
test_vmap_transformer_speed_decorator[False-False] 19.8616ms 19.5021ms 51.2764 Ops/s 52.2253 Ops/s $\color{#d91a1a}-1.82\%$
test_to_module_speed[True] 1.0426ms 0.9478ms 1.0551 KOps/s 1.0652 KOps/s $\color{#d91a1a}-0.95\%$
test_to_module_speed[False] 1.1777ms 0.9186ms 1.0887 KOps/s 1.0853 KOps/s $\color{#35bf28}+0.31\%$
test_tc_init 59.9210μs 30.4402μs 32.8512 KOps/s 28.2686 KOps/s $\textbf{\color{#35bf28}+16.21\%}$
test_tc_init_nested 92.2620μs 61.5901μs 16.2364 KOps/s 14.3914 KOps/s $\textbf{\color{#35bf28}+12.82\%}$
test_tc_first_layer_tensor 4.6030μs 0.6719μs 1.4882 MOps/s 1.4907 MOps/s $\color{#d91a1a}-0.16\%$
test_tc_first_layer_nontensor 26.5000μs 2.2684μs 440.8435 KOps/s 439.9591 KOps/s $\color{#35bf28}+0.20\%$
test_tc_second_layer_tensor 28.6830μs 1.3719μs 728.8940 KOps/s 727.1100 KOps/s $\color{#35bf28}+0.25\%$
test_tc_second_layer_nontensor 30.6800μs 2.9545μs 338.4703 KOps/s 335.8060 KOps/s $\color{#35bf28}+0.79\%$
test_unbind 0.2100s 12.6292ms 79.1815 Ops/s 89.5697 Ops/s $\textbf{\color{#d91a1a}-11.60\%}$
test_full_like 0.6583ms 0.5749ms 1.7394 KOps/s 1.7337 KOps/s $\color{#35bf28}+0.33\%$
test_zeros_like 0.3339ms 0.1978ms 5.0549 KOps/s 5.0492 KOps/s $\color{#35bf28}+0.11\%$
test_ones_like 0.2502ms 0.1976ms 5.0610 KOps/s 5.0563 KOps/s $\color{#35bf28}+0.09\%$
test_clone 0.5150ms 0.4143ms 2.4139 KOps/s 2.4189 KOps/s $\color{#d91a1a}-0.21\%$
test_squeeze 36.7110μs 9.7847μs 102.2005 KOps/s 102.4445 KOps/s $\color{#d91a1a}-0.24\%$
test_unsqueeze 0.2186ms 73.3370μs 13.6357 KOps/s 13.6685 KOps/s $\color{#d91a1a}-0.24\%$
test_split 0.4506ms 0.1564ms 6.3929 KOps/s 6.2347 KOps/s $\color{#35bf28}+2.54\%$
test_permute 0.2117ms 0.1770ms 5.6492 KOps/s 5.5660 KOps/s $\color{#35bf28}+1.50\%$
test_stack 1.2532ms 0.8692ms 1.1505 KOps/s 1.1242 KOps/s $\color{#35bf28}+2.34\%$
test_cat 1.3139ms 1.2315ms 811.9976 Ops/s 811.6867 Ops/s $\color{#35bf28}+0.04\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: bd18cc26e0c3147d72ac435740e019399c432e3c
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: f547a2e80a99c31062ea4962ef044b4f44064c60
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: 2cf05e277a1b587cccfe75391e9a9b7538bc34b3
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: 48c9a030643f870b810f1e832374b7c612dff4bb
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 16, 2024
ghstack-source-id: 7be68be89000804f864750ab9205bdfe65bb66f5
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 190c37737e970a4bedd7e3bcdb31a6dafef1fdb7
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 9ade05abda4c29a60d8f33d1b4951bbfe11f31c1
Pull Request resolved: #994
vmoens added a commit that referenced this pull request Sep 17, 2024
ghstack-source-id: 9ade05abda4c29a60d8f33d1b4951bbfe11f31c1
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 18, 2024
ghstack-source-id: cf56a4f5219c6c31642772f46b36e1f35ddae679
Pull Request resolved: #994
@vmoens vmoens added the documentation Improvements or additions to documentation label Sep 18, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 18, 2024
ghstack-source-id: 4119db55e22bc41310085c7e8a96bd81c212b826
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 18, 2024
ghstack-source-id: d285166b131eacfe7712a07250442dffba646353
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 19, 2024
ghstack-source-id: 9268402aa425d82b9f4ce301971ea6ffd0cea65b
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 19, 2024
ghstack-source-id: 695b419d6a7a7b504462152a29b2d5e1b9e60843
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 19, 2024
ghstack-source-id: b464acffd2fed4c483dcf163ef53a0dc4807bb91
Pull Request resolved: #994
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 19, 2024
ghstack-source-id: f6e0b2b6779c63948084cb607f45b64f7555c274
Pull Request resolved: #994
@vmoens vmoens merged commit d85a10b into gh/vmoens/21/base Sep 19, 2024
37 of 42 checks passed
vmoens added a commit that referenced this pull request Sep 19, 2024
ghstack-source-id: f6e0b2b6779c63948084cb607f45b64f7555c274
Pull Request resolved: #994
@vmoens vmoens deleted the gh/vmoens/21/head branch September 19, 2024 19:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants