-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] export tutorial, TDM tuto refactoring #994
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Sep 16, 2024
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: fde5222d755a8c67692505039c97e3bed196989a Pull Request resolved: #994
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 16, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 47.1780μs | 20.1839μs | 49.5445 KOps/s | 48.1336 KOps/s | |
test_plain_set_stack_nested | 56.4250μs | 20.8835μs | 47.8846 KOps/s | 47.8749 KOps/s | |
test_plain_set_nested_inplace | 0.2372ms | 21.9339μs | 45.5916 KOps/s | 44.5128 KOps/s | |
test_plain_set_stack_nested_inplace | 61.1840μs | 21.6992μs | 46.0847 KOps/s | 44.4508 KOps/s | |
test_items | 56.5960μs | 4.1328μs | 241.9679 KOps/s | 236.8258 KOps/s | |
test_items_nested | 0.6649ms | 0.3652ms | 2.7382 KOps/s | 2.7354 KOps/s | |
test_items_nested_locked | 0.6082ms | 0.3642ms | 2.7460 KOps/s | 2.7484 KOps/s | |
test_items_nested_leaf | 0.1210ms | 68.7985μs | 14.5352 KOps/s | 14.5902 KOps/s | |
test_items_stack_nested | 0.8487ms | 0.3740ms | 2.6741 KOps/s | 2.7066 KOps/s | |
test_items_stack_nested_leaf | 0.1429ms | 71.0040μs | 14.0837 KOps/s | 14.0219 KOps/s | |
test_items_stack_nested_locked | 0.5709ms | 0.3676ms | 2.7201 KOps/s | 2.6767 KOps/s | |
test_keys | 23.2830μs | 3.5058μs | 285.2389 KOps/s | 285.0128 KOps/s | |
test_keys_nested | 0.2565ms | 99.2490μs | 10.0757 KOps/s | 9.7935 KOps/s | |
test_keys_nested_locked | 0.7969ms | 0.1036ms | 9.6489 KOps/s | 9.4849 KOps/s | |
test_keys_nested_leaf | 0.2930ms | 82.2314μs | 12.1608 KOps/s | 12.0193 KOps/s | |
test_keys_stack_nested | 0.2261ms | 99.2600μs | 10.0745 KOps/s | 9.9095 KOps/s | |
test_keys_stack_nested_leaf | 0.1348ms | 82.2635μs | 12.1561 KOps/s | 11.9812 KOps/s | |
test_keys_stack_nested_locked | 0.2230ms | 0.1035ms | 9.6657 KOps/s | 9.4801 KOps/s | |
test_values | 12.6674μs | 1.0417μs | 959.9691 KOps/s | 950.9609 KOps/s | |
test_values_nested | 0.1895ms | 75.7739μs | 13.1972 KOps/s | 13.4267 KOps/s | |
test_values_nested_locked | 0.2406ms | 75.2877μs | 13.2824 KOps/s | 13.4175 KOps/s | |
test_values_nested_leaf | 0.1253ms | 62.1106μs | 16.1003 KOps/s | 15.7662 KOps/s | |
test_values_stack_nested | 0.2305ms | 77.1496μs | 12.9618 KOps/s | 12.9030 KOps/s | |
test_values_stack_nested_leaf | 0.1142ms | 60.8644μs | 16.4300 KOps/s | 15.9352 KOps/s | |
test_values_stack_nested_locked | 0.2370ms | 76.4645μs | 13.0780 KOps/s | 13.1891 KOps/s | |
test_membership | 14.6143μs | 0.7863μs | 1.2719 MOps/s | 1.3952 MOps/s | |
test_membership_nested | 63.7980μs | 2.7652μs | 361.6379 KOps/s | 363.5419 KOps/s | |
test_membership_nested_leaf | 41.9080μs | 2.7436μs | 364.4894 KOps/s | 364.6715 KOps/s | |
test_membership_stacked_nested | 24.3160μs | 2.7516μs | 363.4216 KOps/s | 363.7844 KOps/s | |
test_membership_stacked_nested_leaf | 90.2280μs | 2.7780μs | 359.9758 KOps/s | 361.0258 KOps/s | |
test_membership_nested_last | 64.5500μs | 3.9592μs | 252.5741 KOps/s | 250.2351 KOps/s | |
test_membership_nested_leaf_last | 41.2770μs | 3.9653μs | 252.1848 KOps/s | 252.0457 KOps/s | |
test_membership_stacked_nested_last | 26.2090μs | 3.9701μs | 251.8841 KOps/s | 218.2635 KOps/s | |
test_membership_stacked_nested_leaf_last | 81.1510μs | 3.9837μs | 251.0252 KOps/s | 219.3991 KOps/s | |
test_nested_getleaf | 59.6200μs | 10.7973μs | 92.6155 KOps/s | 90.8607 KOps/s | |
test_nested_get | 0.1078ms | 10.3296μs | 96.8095 KOps/s | 97.5709 KOps/s | |
test_stacked_getleaf | 69.7990μs | 10.8691μs | 92.0038 KOps/s | 93.5081 KOps/s | |
test_stacked_get | 37.7210μs | 10.3032μs | 97.0569 KOps/s | 96.2052 KOps/s | |
test_nested_getitemleaf | 0.1094ms | 11.0784μs | 90.2660 KOps/s | 85.4038 KOps/s | |
test_nested_getitem | 53.3690μs | 10.4384μs | 95.8001 KOps/s | 93.2110 KOps/s | |
test_stacked_getitemleaf | 68.8370μs | 11.0044μs | 90.8730 KOps/s | 88.3722 KOps/s | |
test_stacked_getitem | 72.7550μs | 10.4881μs | 95.3458 KOps/s | 93.4772 KOps/s | |
test_lock_nested | 88.5799ms | 0.5828ms | 1.7159 KOps/s | 2.0305 KOps/s | |
test_lock_stack_nested | 0.6845ms | 0.4536ms | 2.2047 KOps/s | 2.1638 KOps/s | |
test_unlock_nested | 0.1023s | 0.5089ms | 1.9650 KOps/s | 2.3658 KOps/s | |
test_unlock_stack_nested | 0.6043ms | 0.3704ms | 2.6997 KOps/s | 2.5892 KOps/s | |
test_flatten_speed | 0.2207ms | 87.1248μs | 11.4778 KOps/s | 11.4020 KOps/s | |
test_unflatten_speed | 0.6266ms | 0.4630ms | 2.1598 KOps/s | 2.1403 KOps/s | |
test_common_ops | 4.6306ms | 1.1123ms | 899.0377 Ops/s | 876.3026 Ops/s | |
test_creation | 41.4540μs | 2.0269μs | 493.3636 KOps/s | 479.3630 KOps/s | |
test_creation_empty | 52.9990μs | 17.7137μs | 56.4534 KOps/s | 55.4105 KOps/s | |
test_creation_nested_1 | 50.1430μs | 20.8669μs | 47.9229 KOps/s | 46.9569 KOps/s | |
test_creation_nested_2 | 0.1252ms | 25.4746μs | 39.2548 KOps/s | 39.1710 KOps/s | |
test_clone | 79.4070μs | 16.7330μs | 59.7622 KOps/s | 58.5851 KOps/s | |
test_getitem[int] | 1.1261ms | 16.6616μs | 60.0184 KOps/s | 58.2619 KOps/s | |
test_getitem[slice_int] | 0.1470ms | 30.3298μs | 32.9708 KOps/s | 31.0043 KOps/s | |
test_getitem[range] | 0.1878ms | 57.5869μs | 17.3651 KOps/s | 17.0547 KOps/s | |
test_getitem[tuple] | 0.1372ms | 24.8797μs | 40.1934 KOps/s | 38.7626 KOps/s | |
test_getitem[list] | 0.5545ms | 53.3289μs | 18.7516 KOps/s | 18.6444 KOps/s | |
test_setitem_dim[int] | 89.9770μs | 33.3877μs | 29.9511 KOps/s | 29.2924 KOps/s | |
test_setitem_dim[slice_int] | 0.1203ms | 61.4348μs | 16.2774 KOps/s | 15.8644 KOps/s | |
test_setitem_dim[range] | 0.1250ms | 83.1913μs | 12.0205 KOps/s | 11.5643 KOps/s | |
test_setitem_dim[tuple] | 0.1195ms | 49.2223μs | 20.3160 KOps/s | 19.8402 KOps/s | |
test_setitem | 74.9200μs | 29.7005μs | 33.6694 KOps/s | 32.6934 KOps/s | |
test_set | 0.1377ms | 29.1127μs | 34.3492 KOps/s | 32.5075 KOps/s | |
test_set_shared | 3.4273ms | 0.2147ms | 4.6580 KOps/s | 4.6656 KOps/s | |
test_update | 0.1745ms | 36.0953μs | 27.7044 KOps/s | 26.2016 KOps/s | |
test_update_nested | 1.0830ms | 47.1589μs | 21.2049 KOps/s | 20.4792 KOps/s | |
test_update__nested | 0.1593ms | 34.7981μs | 28.7372 KOps/s | 28.2065 KOps/s | |
test_set_nested | 76.5620μs | 31.4084μs | 31.8386 KOps/s | 30.6599 KOps/s | |
test_set_nested_new | 0.3021ms | 36.7548μs | 27.2073 KOps/s | 26.5279 KOps/s | |
test_select | 0.2691ms | 54.0335μs | 18.5070 KOps/s | 18.2926 KOps/s | |
test_select_nested | 0.1322ms | 59.9112μs | 16.6914 KOps/s | 16.9941 KOps/s | |
test_exclude_nested | 0.1988ms | 74.0048μs | 13.5126 KOps/s | 13.6148 KOps/s | |
test_empty[True] | 0.4900ms | 0.3129ms | 3.1962 KOps/s | 3.1612 KOps/s | |
test_empty[False] | 27.0127μs | 1.2064μs | 828.9128 KOps/s | 802.1806 KOps/s | |
test_unbind_speed | 0.4180ms | 0.2976ms | 3.3599 KOps/s | 3.2797 KOps/s | |
test_unbind_speed_stack0 | 0.5167ms | 0.2931ms | 3.4119 KOps/s | 3.3457 KOps/s | |
test_unbind_speed_stack1 | 0.1014s | 0.8095ms | 1.2353 KOps/s | 1.3310 KOps/s | |
test_split | 92.7769ms | 2.1778ms | 459.1873 Ops/s | 451.8677 Ops/s | |
test_chunk | 2.2860ms | 1.9933ms | 501.6773 Ops/s | 447.9348 Ops/s | |
test_creation[device0] | 4.3978ms | 0.1195ms | 8.3711 KOps/s | 8.2206 KOps/s | |
test_creation_from_tensor | 0.2358ms | 0.1161ms | 8.6158 KOps/s | 8.4689 KOps/s | |
test_add_one[memmap_tensor0] | 0.2732ms | 7.3846μs | 135.4173 KOps/s | 134.1461 KOps/s | |
test_contiguous[memmap_tensor0] | 20.6080μs | 1.9294μs | 518.3042 KOps/s | 506.8171 KOps/s | |
test_stack[memmap_tensor0] | 85.3890μs | 5.5705μs | 179.5156 KOps/s | 179.8203 KOps/s | |
test_memmaptd_index | 1.1254ms | 0.3972ms | 2.5173 KOps/s | 2.5035 KOps/s | |
test_memmaptd_index_astensor | 0.9650ms | 0.4738ms | 2.1105 KOps/s | 2.0788 KOps/s | |
test_memmaptd_index_op | 1.9690ms | 0.9953ms | 1.0047 KOps/s | 968.4167 Ops/s | |
test_serialize_model | 0.2195s | 0.1338s | 7.4728 Ops/s | 8.4295 Ops/s | |
test_serialize_model_pickle | 0.4467s | 0.3861s | 2.5900 Ops/s | 2.5283 Ops/s | |
test_serialize_weights | 0.1301s | 0.1187s | 8.4211 Ops/s | 7.6851 Ops/s | |
test_serialize_weights_returnearly | 0.1752s | 0.1645s | 6.0781 Ops/s | 6.4299 Ops/s | |
test_serialize_weights_pickle | 1.0555s | 0.6840s | 1.4619 Ops/s | 1.0954 Ops/s | |
test_serialize_weights_filesystem | 0.1449s | 0.1397s | 7.1591 Ops/s | 6.7966 Ops/s | |
test_serialize_model_filesystem | 0.1477s | 0.1436s | 6.9632 Ops/s | 6.4427 Ops/s | |
test_reshape_pytree | 0.1122ms | 38.4097μs | 26.0351 KOps/s | 25.4902 KOps/s | |
test_reshape_td | 0.1087ms | 45.0555μs | 22.1949 KOps/s | 20.1025 KOps/s | |
test_view_pytree | 90.4580μs | 38.7050μs | 25.8365 KOps/s | 25.5862 KOps/s | |
test_view_td | 0.1568ms | 51.8759μs | 19.2768 KOps/s | 17.6856 KOps/s | |
test_unbind_pytree | 0.1494ms | 35.7792μs | 27.9492 KOps/s | 27.5685 KOps/s | |
test_unbind_td | 0.3173ms | 44.4976μs | 22.4731 KOps/s | 21.1235 KOps/s | |
test_split_pytree | 0.2206ms | 37.5565μs | 26.6265 KOps/s | 25.9082 KOps/s | |
test_split_td | 0.5573ms | 61.0178μs | 16.3887 KOps/s | 17.0034 KOps/s | |
test_add_pytree | 0.1410ms | 45.6707μs | 21.8959 KOps/s | 21.4136 KOps/s | |
test_add_td | 0.2485ms | 81.6390μs | 12.2490 KOps/s | 11.6876 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1619ms | 55.7527μs | 17.9364 KOps/s | 17.4042 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.4106ms | 0.1759ms | 5.6864 KOps/s | 5.5468 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1548ms | 55.4031μs | 18.0495 KOps/s | 16.9875 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.4324ms | 0.1417ms | 7.0549 KOps/s | 7.0361 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1116ms | 21.4687μs | 46.5795 KOps/s | 46.8156 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1802ms | 67.4325μs | 14.8296 KOps/s | 14.8395 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2230ms | 79.3341μs | 12.6049 KOps/s | 13.1865 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1612ms | 70.6138μs | 14.1615 KOps/s | 14.6899 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3803ms | 0.1689ms | 5.9190 KOps/s | 5.7034 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3866ms | 0.1898ms | 5.2699 KOps/s | 5.2043 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1270ms | 45.1218μs | 22.1623 KOps/s | 21.0780 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1761ms | 68.7274μs | 14.5502 KOps/s | 14.2681 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3969ms | 0.1736ms | 5.7615 KOps/s | 5.5451 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5876ms | 0.2827ms | 3.5371 KOps/s | 3.4441 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4001ms | 0.2041ms | 4.9006 KOps/s | 4.8175 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4545ms | 0.1729ms | 5.7833 KOps/s | 5.7220 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1973ms | 62.6126μs | 15.9712 KOps/s | 15.8533 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1017ms | 46.2473μs | 21.6229 KOps/s | 20.7726 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4936ms | 0.2349ms | 4.2573 KOps/s | 4.3249 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.4359ms | 0.1757ms | 5.6907 KOps/s | 5.5702 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2550ms | 0.1035ms | 9.6619 KOps/s | 9.5444 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1281ms | 56.6205μs | 17.6615 KOps/s | 17.5741 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1445ms | 78.2263μs | 12.7834 KOps/s | 12.9312 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1959ms | 69.9309μs | 14.2998 KOps/s | 14.4200 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.4548ms | 0.1997ms | 5.0080 KOps/s | 4.9898 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.1384ms | 1.6298ms | 613.5593 Ops/s | 593.9931 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.5247ms | 0.1927ms | 5.1891 KOps/s | 5.1061 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2654ms | 1.0788ms | 926.9460 Ops/s | 901.8481 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.8537ms | 0.4252ms | 2.3521 KOps/s | 2.3192 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.8678ms | 3.8320ms | 260.9625 Ops/s | 255.1261 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1500ms | 33.5975μs | 29.7641 KOps/s | 27.9056 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 1.2333ms | 49.0106μs | 20.4038 KOps/s | 20.1430 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1252ms | 28.3746μs | 35.2428 KOps/s | 31.9796 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1137ms | 29.5441μs | 33.8477 KOps/s | 34.4227 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2393ms | 28.0149μs | 35.6953 KOps/s | 32.1458 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1003ms | 29.1846μs | 34.2646 KOps/s | 34.2730 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2055ms | 72.7787μs | 13.7403 KOps/s | 13.0934 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5836ms | 27.5311μs | 36.3226 KOps/s | 34.0909 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1378ms | 66.6446μs | 15.0050 KOps/s | 14.2509 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1202ms | 23.4560μs | 42.6331 KOps/s | 42.1291 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1590ms | 66.6854μs | 14.9958 KOps/s | 14.3513 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1209ms | 23.2014μs | 43.1008 KOps/s | 42.3048 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1808ms | 73.2386μs | 13.6540 KOps/s | 13.1596 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.9827ms | 27.4056μs | 36.4889 KOps/s | 34.7475 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1795ms | 66.4717μs | 15.0440 KOps/s | 14.3881 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.3214ms | 22.8076μs | 43.8449 KOps/s | 42.6742 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2163ms | 65.6613μs | 15.2297 KOps/s | 14.3950 KOps/s | |
test_compile_indexing[int-pytree-eager] | 84.0460μs | 22.9850μs | 43.5067 KOps/s | 42.9722 KOps/s | |
test_mod_add[eager] | 0.1114ms | 25.5790μs | 39.0946 KOps/s | 37.5500 KOps/s | |
test_mod_add[compile] | 0.1116ms | 37.9702μs | 26.3364 KOps/s | 24.9118 KOps/s | |
test_mod_add[compile-overhead] | 88.0740μs | 38.9795μs | 25.6545 KOps/s | 24.3228 KOps/s | |
test_mod_wrap[eager] | 0.4366ms | 0.2070ms | 4.8304 KOps/s | 4.7220 KOps/s | |
test_mod_wrap[compile] | 0.4327ms | 0.2266ms | 4.4136 KOps/s | 4.1713 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4101ms | 0.2264ms | 4.4169 KOps/s | 4.2508 KOps/s | |
test_mod_wrap_and_backward[eager] | 13.9729ms | 11.5002ms | 86.9552 Ops/s | 89.6277 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.8575ms | 11.7360ms | 85.2080 Ops/s | 86.4131 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 13.7560ms | 11.7254ms | 85.2848 Ops/s | 88.7364 Ops/s | |
test_seq_add[eager] | 0.2588ms | 89.9307μs | 11.1197 KOps/s | 10.5752 KOps/s | |
test_seq_add[compile] | 0.1159ms | 63.6952μs | 15.6998 KOps/s | 15.0978 KOps/s | |
test_seq_add[compile-overhead] | 0.1617ms | 61.7851μs | 16.1851 KOps/s | 15.5010 KOps/s | |
test_seq_wrap[eager] | 0.7516ms | 0.3796ms | 2.6344 KOps/s | 2.4968 KOps/s | |
test_seq_wrap[compile] | 1.2768ms | 0.2786ms | 3.5898 KOps/s | 3.5509 KOps/s | |
test_seq_wrap[compile-overhead] | 1.3744ms | 0.2669ms | 3.7467 KOps/s | 3.6263 KOps/s | |
test_func_call_runtime[False-eager] | 0.9317ms | 0.5045ms | 1.9820 KOps/s | 1.8817 KOps/s | |
test_func_call_runtime[False-compile] | 1.0347ms | 0.5049ms | 1.9805 KOps/s | 1.9595 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.7018ms | 0.5028ms | 1.9890 KOps/s | 1.9659 KOps/s | |
test_func_call_runtime[True-eager] | 1.2653ms | 0.7205ms | 1.3879 KOps/s | 1.3315 KOps/s | |
test_func_call_runtime[True-compile] | 0.7813ms | 0.5108ms | 1.9576 KOps/s | 1.9115 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7480ms | 0.5175ms | 1.9322 KOps/s | 1.9354 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.0036ms | 0.5026ms | 1.9897 KOps/s | 1.8871 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0058ms | 0.5114ms | 1.9556 KOps/s | 1.9694 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6731ms | 0.5083ms | 1.9673 KOps/s | 1.9632 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3392ms | 0.8689ms | 1.1508 KOps/s | 1.1377 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.1199ms | 0.7196ms | 1.3896 KOps/s | 1.3283 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2101ms | 0.7262ms | 1.3770 KOps/s | 1.3209 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5749ms | 1.8850ms | 530.5072 Ops/s | 532.5123 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.7023ms | 1.9396ms | 515.5669 Ops/s | 517.4172 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.7665ms | 1.9344ms | 516.9478 Ops/s | 516.8820 Ops/s | |
test_distributed | 0.2898ms | 0.1242ms | 8.0547 KOps/s | 7.8655 KOps/s | |
test_tdmodule | 34.7450μs | 18.6118μs | 53.7294 KOps/s | 54.3643 KOps/s | |
test_tdmodule_dispatch | 57.8670μs | 37.0875μs | 26.9633 KOps/s | 27.4414 KOps/s | |
test_tdseq | 52.2470μs | 21.0814μs | 47.4351 KOps/s | 47.4334 KOps/s | |
test_tdseq_dispatch | 64.5000μs | 42.3849μs | 23.5933 KOps/s | 23.8070 KOps/s | |
test_instantiation_functorch | 1.7723ms | 1.6024ms | 624.0803 Ops/s | 620.0295 Ops/s | |
test_instantiation_td | 1.8551ms | 1.1797ms | 847.6682 Ops/s | 839.5402 Ops/s | |
test_exec_functorch | 0.2893ms | 0.1819ms | 5.4971 KOps/s | 5.3999 KOps/s | |
test_exec_functional_call | 0.3634ms | 0.1673ms | 5.9774 KOps/s | 5.6808 KOps/s | |
test_exec_td | 0.2650ms | 0.1661ms | 6.0200 KOps/s | 5.7568 KOps/s | |
test_exec_td_decorator | 1.2056ms | 0.2178ms | 4.5918 KOps/s | 4.4143 KOps/s | |
test_vmap_mlp_speed[True-True] | 1.0851ms | 0.6534ms | 1.5305 KOps/s | 1.5363 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.9940ms | 0.6501ms | 1.5382 KOps/s | 1.5404 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.9147ms | 0.4988ms | 2.0048 KOps/s | 2.0057 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.7855ms | 0.4988ms | 2.0048 KOps/s | 1.9978 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.0268ms | 0.6299ms | 1.5875 KOps/s | 1.5796 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9259ms | 0.6376ms | 1.5683 KOps/s | 1.5782 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8945ms | 0.5181ms | 1.9301 KOps/s | 1.9264 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8422ms | 0.5229ms | 1.9125 KOps/s | 1.9284 KOps/s | |
test_to_module_speed[True] | 1.5847ms | 1.3364ms | 748.2679 Ops/s | 768.7079 Ops/s | |
test_to_module_speed[False] | 1.8768ms | 1.3122ms | 762.0712 Ops/s | 790.0530 Ops/s | |
test_tc_init | 0.1117ms | 44.2332μs | 22.6075 KOps/s | 22.4207 KOps/s | |
test_tc_init_nested | 0.2190ms | 88.6583μs | 11.2793 KOps/s | 10.9260 KOps/s | |
test_tc_first_layer_tensor | 82.8250μs | 1.5303μs | 653.4634 KOps/s | 646.4972 KOps/s | |
test_tc_first_layer_nontensor | 43.5700μs | 4.7226μs | 211.7495 KOps/s | 210.9379 KOps/s | |
test_tc_second_layer_tensor | 23.7640μs | 2.8656μs | 348.9650 KOps/s | 349.4046 KOps/s | |
test_tc_second_layer_nontensor | 0.1157ms | 6.0303μs | 165.8285 KOps/s | 166.3349 KOps/s | |
test_unbind | 0.5170s | 14.3864ms | 69.5101 Ops/s | 72.0720 Ops/s | |
test_full_like | 9.2815ms | 8.0127ms | 124.8022 Ops/s | 132.0116 Ops/s | |
test_zeros_like | 14.9346ms | 7.0005ms | 142.8476 Ops/s | 355.7758 Ops/s | |
test_ones_like | 13.8970ms | 8.0099ms | 124.8460 Ops/s | 299.4325 Ops/s | |
test_clone | 16.7846ms | 9.6349ms | 103.7898 Ops/s | 193.2683 Ops/s | |
test_squeeze | 0.1051ms | 12.7076μs | 78.6933 KOps/s | 78.2757 KOps/s | |
test_unsqueeze | 0.1597ms | 91.2882μs | 10.9543 KOps/s | 9.9627 KOps/s | |
test_split | 0.4198ms | 0.1987ms | 5.0315 KOps/s | 4.9647 KOps/s | |
test_permute | 0.5247ms | 0.2279ms | 4.3888 KOps/s | 4.4296 KOps/s | |
test_stack | 32.4950ms | 25.8056ms | 38.7513 Ops/s | 37.8760 Ops/s | |
test_cat | 30.8747ms | 25.4695ms | 39.2626 Ops/s | 36.5354 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 39.9610μs | 12.8008μs | 78.1201 KOps/s | 69.3434 KOps/s | |
test_plain_set_stack_nested | 38.5510μs | 13.0227μs | 76.7892 KOps/s | 67.7642 KOps/s | |
test_plain_set_nested_inplace | 50.5510μs | 13.7975μs | 72.4767 KOps/s | 63.9815 KOps/s | |
test_plain_set_stack_nested_inplace | 48.6620μs | 13.7231μs | 72.8699 KOps/s | 64.7601 KOps/s | |
test_items | 42.4610μs | 2.8778μs | 347.4896 KOps/s | 349.2834 KOps/s | |
test_items_nested | 0.3972ms | 0.3279ms | 3.0498 KOps/s | 3.0646 KOps/s | |
test_items_nested_locked | 0.3917ms | 0.3321ms | 3.0111 KOps/s | 3.0503 KOps/s | |
test_items_nested_leaf | 81.2420μs | 55.4735μs | 18.0266 KOps/s | 18.0670 KOps/s | |
test_items_stack_nested | 0.4090ms | 0.3313ms | 3.0181 KOps/s | 3.0259 KOps/s | |
test_items_stack_nested_leaf | 82.1520μs | 57.3747μs | 17.4293 KOps/s | 17.5928 KOps/s | |
test_items_stack_nested_locked | 0.3903ms | 0.3306ms | 3.0252 KOps/s | 3.0323 KOps/s | |
test_keys | 30.4910μs | 3.4228μs | 292.1596 KOps/s | 289.9364 KOps/s | |
test_keys_nested | 84.1620μs | 56.3377μs | 17.7501 KOps/s | 17.6351 KOps/s | |
test_keys_nested_locked | 2.8095ms | 62.6122μs | 15.9713 KOps/s | 16.0641 KOps/s | |
test_keys_nested_leaf | 75.7010μs | 47.8491μs | 20.8990 KOps/s | 20.9378 KOps/s | |
test_keys_stack_nested | 0.1001ms | 56.1691μs | 17.8034 KOps/s | 17.7566 KOps/s | |
test_keys_stack_nested_leaf | 79.1820μs | 48.3694μs | 20.6742 KOps/s | 20.7848 KOps/s | |
test_keys_stack_nested_locked | 94.5820μs | 61.6316μs | 16.2254 KOps/s | 16.1675 KOps/s | |
test_values | 5.6167μs | 0.8601μs | 1.1627 MOps/s | 1.1910 MOps/s | |
test_values_nested | 69.2420μs | 40.7025μs | 24.5685 KOps/s | 24.3536 KOps/s | |
test_values_nested_locked | 76.8920μs | 42.5411μs | 23.5067 KOps/s | 23.2606 KOps/s | |
test_values_nested_leaf | 62.9410μs | 35.4461μs | 28.2119 KOps/s | 28.0142 KOps/s | |
test_values_stack_nested | 74.7020μs | 41.9279μs | 23.8505 KOps/s | 23.8398 KOps/s | |
test_values_stack_nested_leaf | 64.2010μs | 36.0698μs | 27.7240 KOps/s | 27.6053 KOps/s | |
test_values_stack_nested_locked | 70.0620μs | 43.5416μs | 22.9665 KOps/s | 22.9581 KOps/s | |
test_membership | 1.7766μs | 0.5010μs | 1.9960 MOps/s | 1.9607 MOps/s | |
test_membership_nested | 16.7505μs | 1.8877μs | 529.7576 KOps/s | 492.5576 KOps/s | |
test_membership_nested_leaf | 17.9350μs | 1.9056μs | 524.7633 KOps/s | 513.2682 KOps/s | |
test_membership_stacked_nested | 27.1800μs | 1.9364μs | 516.4311 KOps/s | 501.3917 KOps/s | |
test_membership_stacked_nested_leaf | 26.8610μs | 1.9546μs | 511.6165 KOps/s | 499.2452 KOps/s | |
test_membership_nested_last | 25.8200μs | 2.8569μs | 350.0261 KOps/s | 351.8604 KOps/s | |
test_membership_nested_leaf_last | 31.1700μs | 2.8405μs | 352.0552 KOps/s | 353.4170 KOps/s | |
test_membership_stacked_nested_last | 37.0900μs | 3.4941μs | 286.1972 KOps/s | 267.0709 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.2610μs | 3.4797μs | 287.3846 KOps/s | 268.9674 KOps/s | |
test_nested_getleaf | 28.8710μs | 6.1125μs | 163.5982 KOps/s | 162.4862 KOps/s | |
test_nested_get | 33.0200μs | 5.7633μs | 173.5129 KOps/s | 173.6646 KOps/s | |
test_stacked_getleaf | 36.5110μs | 6.0648μs | 164.8850 KOps/s | 163.7395 KOps/s | |
test_stacked_get | 30.2010μs | 5.6018μs | 178.5138 KOps/s | 174.2348 KOps/s | |
test_nested_getitemleaf | 26.1110μs | 6.0832μs | 164.3883 KOps/s | 161.6921 KOps/s | |
test_nested_getitem | 44.6010μs | 5.7565μs | 173.7157 KOps/s | 173.6560 KOps/s | |
test_stacked_getitemleaf | 38.7410μs | 6.1566μs | 162.4264 KOps/s | 164.2526 KOps/s | |
test_stacked_getitem | 42.1710μs | 5.7346μs | 174.3804 KOps/s | 174.4219 KOps/s | |
test_lock_nested | 4.8914ms | 0.4112ms | 2.4318 KOps/s | 2.4394 KOps/s | |
test_lock_stack_nested | 0.4338ms | 0.3746ms | 2.6692 KOps/s | 2.6674 KOps/s | |
test_unlock_nested | 0.7635ms | 0.3490ms | 2.8651 KOps/s | 2.8844 KOps/s | |
test_unlock_stack_nested | 0.3572ms | 0.3134ms | 3.1904 KOps/s | 3.1596 KOps/s | |
test_flatten_speed | 0.1039ms | 68.7929μs | 14.5364 KOps/s | 14.3497 KOps/s | |
test_unflatten_speed | 0.3288ms | 0.2871ms | 3.4829 KOps/s | 3.4668 KOps/s | |
test_common_ops | 1.5329ms | 1.1914ms | 839.3788 Ops/s | 814.7842 Ops/s | |
test_creation | 34.5110μs | 1.4917μs | 670.3983 KOps/s | 687.5444 KOps/s | |
test_creation_empty | 37.4510μs | 13.2834μs | 75.2820 KOps/s | 60.2168 KOps/s | |
test_creation_nested_1 | 35.9400μs | 14.9884μs | 66.7182 KOps/s | 55.1730 KOps/s | |
test_creation_nested_2 | 49.3520μs | 17.6226μs | 56.7452 KOps/s | 48.3404 KOps/s | |
test_clone | 1.2965ms | 29.0430μs | 34.4318 KOps/s | 36.3528 KOps/s | |
test_getitem[int] | 98.5490ms | 23.2433μs | 43.0232 KOps/s | 64.3103 KOps/s | |
test_getitem[slice_int] | 0.1209ms | 26.5803μs | 37.6219 KOps/s | 37.9317 KOps/s | |
test_getitem[range] | 0.2306ms | 0.1088ms | 9.1890 KOps/s | 8.8151 KOps/s | |
test_getitem[tuple] | 0.1186ms | 22.8259μs | 43.8098 KOps/s | 43.6713 KOps/s | |
test_getitem[list] | 0.2057ms | 97.8240μs | 10.2224 KOps/s | 10.2640 KOps/s | |
test_setitem_dim[int] | 85.1520μs | 46.2506μs | 21.6213 KOps/s | 22.7747 KOps/s | |
test_setitem_dim[slice_int] | 0.2090ms | 66.7450μs | 14.9824 KOps/s | 15.1124 KOps/s | |
test_setitem_dim[range] | 0.1578ms | 0.1260ms | 7.9377 KOps/s | 8.0041 KOps/s | |
test_setitem_dim[tuple] | 84.5020μs | 59.7386μs | 16.7396 KOps/s | 17.0398 KOps/s | |
test_setitem | 76.7320μs | 39.4053μs | 25.3773 KOps/s | 24.5273 KOps/s | |
test_set | 87.9620μs | 39.8712μs | 25.0808 KOps/s | 25.2005 KOps/s | |
test_set_shared | 0.3417ms | 49.4552μs | 20.2203 KOps/s | 20.1183 KOps/s | |
test_update | 78.8020μs | 45.5067μs | 21.9748 KOps/s | 20.5563 KOps/s | |
test_update_nested | 92.1830μs | 52.7795μs | 18.9468 KOps/s | 17.0554 KOps/s | |
test_update__nested | 99.5220μs | 58.3269μs | 17.1447 KOps/s | 16.1581 KOps/s | |
test_set_nested | 79.3220μs | 41.1365μs | 24.3093 KOps/s | 22.2659 KOps/s | |
test_set_nested_new | 81.7420μs | 44.4927μs | 22.4756 KOps/s | 20.3418 KOps/s | |
test_select | 92.6920μs | 57.3830μs | 17.4268 KOps/s | 15.8898 KOps/s | |
test_select_nested | 0.3539ms | 41.7620μs | 23.9452 KOps/s | 23.9488 KOps/s | |
test_exclude_nested | 92.8330μs | 58.2206μs | 17.1761 KOps/s | 17.0415 KOps/s | |
test_empty[True] | 0.2877ms | 0.2431ms | 4.1128 KOps/s | 4.1222 KOps/s | |
test_empty[False] | 3.2171μs | 0.7460μs | 1.3405 MOps/s | 1.3578 MOps/s | |
test_to | 69.6520μs | 23.6647μs | 42.2570 KOps/s | 41.8548 KOps/s | |
test_to_nonblocking | 48.9510μs | 22.9668μs | 43.5412 KOps/s | 42.9890 KOps/s | |
test_unbind_speed | 1.5310ms | 0.2708ms | 3.6930 KOps/s | 3.7138 KOps/s | |
test_unbind_speed_stack0 | 0.3252ms | 0.2690ms | 3.7176 KOps/s | 3.6956 KOps/s | |
test_unbind_speed_stack1 | 0.1046s | 0.6970ms | 1.4347 KOps/s | 1.4145 KOps/s | |
test_split | 0.1069s | 2.1265ms | 470.2473 Ops/s | 475.4294 Ops/s | |
test_chunk | 0.1014s | 2.1134ms | 473.1675 Ops/s | 472.8333 Ops/s | |
test_creation[device0] | 0.3433ms | 0.1302ms | 7.6787 KOps/s | 7.9690 KOps/s | |
test_creation_from_tensor | 0.3432ms | 0.1324ms | 7.5541 KOps/s | 7.5888 KOps/s | |
test_add_one[memmap_tensor0] | 0.1382ms | 8.5327μs | 117.1959 KOps/s | 117.1978 KOps/s | |
test_contiguous[memmap_tensor0] | 31.1400μs | 2.1454μs | 466.1180 KOps/s | 465.7500 KOps/s | |
test_stack[memmap_tensor0] | 36.6810μs | 6.6561μs | 150.2380 KOps/s | 154.4188 KOps/s | |
test_memmaptd_index | 1.1487ms | 0.4060ms | 2.4633 KOps/s | 2.4850 KOps/s | |
test_memmaptd_index_astensor | 0.7041ms | 0.4639ms | 2.1555 KOps/s | 2.1725 KOps/s | |
test_memmaptd_index_op | 1.3585ms | 0.9538ms | 1.0484 KOps/s | 1.0014 KOps/s | |
test_serialize_model | 0.1309s | 0.1298s | 7.7047 Ops/s | 7.6871 Ops/s | |
test_serialize_model_pickle | 1.3499s | 1.2129s | 0.8245 Ops/s | 0.8428 Ops/s | |
test_serialize_weights | 0.1299s | 0.1290s | 7.7498 Ops/s | 7.7328 Ops/s | |
test_serialize_weights_returnearly | 0.2528s | 57.1195ms | 17.5071 Ops/s | 17.5319 Ops/s | |
test_serialize_weights_pickle | 1.3495s | 1.2116s | 0.8253 Ops/s | 0.8211 Ops/s | |
test_reshape_pytree | 58.6520μs | 34.3710μs | 29.0943 KOps/s | 28.6813 KOps/s | |
test_reshape_td | 96.1720μs | 40.3839μs | 24.7623 KOps/s | 24.4680 KOps/s | |
test_view_pytree | 62.5020μs | 33.6849μs | 29.6869 KOps/s | 29.0150 KOps/s | |
test_view_td | 81.7820μs | 44.3160μs | 22.5652 KOps/s | 21.9265 KOps/s | |
test_unbind_pytree | 66.4010μs | 33.3257μs | 30.0069 KOps/s | 29.7597 KOps/s | |
test_unbind_td | 0.5266ms | 41.9421μs | 23.8424 KOps/s | 24.1838 KOps/s | |
test_split_pytree | 85.1320μs | 46.3851μs | 21.5586 KOps/s | 22.2253 KOps/s | |
test_split_td | 0.6950ms | 55.3470μs | 18.0678 KOps/s | 18.5178 KOps/s | |
test_add_pytree | 91.4320μs | 55.2524μs | 18.0988 KOps/s | 17.5327 KOps/s | |
test_add_td | 0.1219ms | 84.9270μs | 11.7748 KOps/s | 11.3316 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4143ms | 0.2126ms | 4.7047 KOps/s | 4.5624 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2031ms | 0.1490ms | 6.7128 KOps/s | 6.7504 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1831ms | 0.1437ms | 6.9572 KOps/s | 7.0316 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2471ms | 0.1827ms | 5.4740 KOps/s | 5.6609 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 58.9820μs | 21.9877μs | 45.4799 KOps/s | 48.4211 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 78.5820μs | 43.5989μs | 22.9363 KOps/s | 23.1790 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2341ms | 64.1781μs | 15.5817 KOps/s | 15.7969 KOps/s | |
test_compile_copy_nested[pytree-eager] | 75.1520μs | 50.5636μs | 19.7771 KOps/s | 20.3009 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4186ms | 0.3156ms | 3.1681 KOps/s | 3.1789 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2680ms | 0.2062ms | 4.8493 KOps/s | 4.8476 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1755ms | 0.1264ms | 7.9094 KOps/s | 7.8916 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1031ms | 60.1304μs | 16.6305 KOps/s | 16.9330 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.3547ms | 0.3141ms | 3.1832 KOps/s | 3.1410 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.7642ms | 0.6125ms | 1.6327 KOps/s | 1.7039 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.2911ms | 0.2434ms | 4.1076 KOps/s | 4.0402 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.4646ms | 0.3163ms | 3.1615 KOps/s | 3.1665 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1115ms | 70.7594μs | 14.1324 KOps/s | 14.4894 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1858ms | 0.1273ms | 7.8578 KOps/s | 7.8755 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5883ms | 0.5094ms | 1.9633 KOps/s | 1.9822 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3613ms | 0.3145ms | 3.1799 KOps/s | 3.1950 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 64.8520μs | 19.2841μs | 51.8561 KOps/s | 53.6982 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 56.8910μs | 27.4241μs | 36.4642 KOps/s | 36.4837 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1021ms | 68.2391μs | 14.6544 KOps/s | 14.5651 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.2820μs | 51.0291μs | 19.5967 KOps/s | 19.4832 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.2759ms | 0.7970ms | 1.2548 KOps/s | 1.1364 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.4344ms | 3.0978ms | 322.8115 Ops/s | 313.9494 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.2904ms | 0.7957ms | 1.2568 KOps/s | 1.1706 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.2515ms | 3.1113ms | 321.4130 Ops/s | 328.9031 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1564ms | 0.1086ms | 9.2120 KOps/s | 9.1060 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1882ms | 60.2087μs | 16.6089 KOps/s | 15.9773 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1750ms | 0.1024ms | 9.7637 KOps/s | 9.7084 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2279ms | 46.1400μs | 21.6732 KOps/s | 23.6807 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2763ms | 0.1092ms | 9.1586 KOps/s | 9.6084 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 89.4030μs | 45.5330μs | 21.9621 KOps/s | 24.0918 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2622ms | 0.1360ms | 7.3550 KOps/s | 7.3833 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1542ms | 25.0181μs | 39.9710 KOps/s | 39.9240 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1752ms | 0.1296ms | 7.7134 KOps/s | 7.7360 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 60.6110μs | 20.6780μs | 48.3606 KOps/s | 48.8127 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1843ms | 0.1362ms | 7.3448 KOps/s | 7.6770 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.1810μs | 20.2986μs | 49.2644 KOps/s | 49.6839 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1837ms | 0.1368ms | 7.3114 KOps/s | 7.3251 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4598ms | 24.8331μs | 40.2689 KOps/s | 40.6749 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1828ms | 0.1372ms | 7.2885 KOps/s | 7.6695 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1229ms | 22.6829μs | 44.0861 KOps/s | 49.5851 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1851ms | 0.1376ms | 7.2655 KOps/s | 7.6647 KOps/s | |
test_compile_indexing[int-pytree-eager] | 58.0510μs | 20.3410μs | 49.1618 KOps/s | 49.2015 KOps/s | |
test_mod_add[eager] | 71.7220μs | 29.4783μs | 33.9233 KOps/s | 32.1998 KOps/s | |
test_mod_add[compile] | 0.1187ms | 69.2600μs | 14.4383 KOps/s | 13.5930 KOps/s | |
test_mod_add[compile-overhead] | 0.2675ms | 0.1356ms | 7.3768 KOps/s | 6.6888 KOps/s | |
test_mod_wrap[eager] | 0.3984ms | 0.2520ms | 3.9683 KOps/s | 4.1291 KOps/s | |
test_mod_wrap[compile] | 1.5223ms | 0.2941ms | 3.4001 KOps/s | 3.3974 KOps/s | |
test_mod_wrap[compile-overhead] | 7.8456ms | 4.1689ms | 239.8716 Ops/s | 244.7609 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5086ms | 1.4116ms | 708.4211 Ops/s | 704.5122 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.8411ms | 1.4097ms | 709.3632 Ops/s | 715.6648 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.7344ms | 1.0467ms | 955.3474 Ops/s | 991.6654 Ops/s | |
test_seq_add[eager] | 0.1393ms | 92.1889μs | 10.8473 KOps/s | 10.5417 KOps/s | |
test_seq_add[compile] | 0.2400ms | 81.9425μs | 12.2037 KOps/s | 11.9373 KOps/s | |
test_seq_add[compile-overhead] | 0.1798ms | 0.1128ms | 8.8687 KOps/s | 8.8898 KOps/s | |
test_seq_wrap[eager] | 0.4568ms | 0.3754ms | 2.6638 KOps/s | 2.6026 KOps/s | |
test_seq_wrap[compile] | 0.3618ms | 0.3092ms | 3.2340 KOps/s | 3.1891 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2797ms | 0.2167ms | 4.6154 KOps/s | 4.5351 KOps/s | |
test_func_call_runtime[False-eager] | 0.8435ms | 0.7506ms | 1.3322 KOps/s | 1.2959 KOps/s | |
test_func_call_runtime[False-compile] | 0.8349ms | 0.7791ms | 1.2836 KOps/s | 1.2986 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4021ms | 0.3555ms | 2.8127 KOps/s | 2.8195 KOps/s | |
test_func_call_runtime[True-eager] | 1.1704ms | 0.8927ms | 1.1202 KOps/s | 1.0844 KOps/s | |
test_func_call_runtime[True-compile] | 0.9228ms | 0.8186ms | 1.2216 KOps/s | 1.2112 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4849ms | 0.3923ms | 2.5490 KOps/s | 2.5743 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.7863ms | 0.7303ms | 1.3693 KOps/s | 1.3782 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.0103ms | 0.8251ms | 1.2119 KOps/s | 1.2908 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4003ms | 0.3583ms | 2.7908 KOps/s | 2.8082 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3680ms | 0.9922ms | 1.0079 KOps/s | 1.0063 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.0754ms | 0.8783ms | 1.1386 KOps/s | 1.1894 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4689ms | 0.4147ms | 2.4112 KOps/s | 2.3754 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5265ms | 2.0566ms | 486.2487 Ops/s | 482.1671 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9148ms | 0.8573ms | 1.1664 KOps/s | 1.1860 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4708ms | 0.4212ms | 2.3742 KOps/s | 2.3864 KOps/s | |
test_distributed | 15.2093ms | 0.2312ms | 4.3243 KOps/s | 8.6672 KOps/s | |
test_tdmodule | 0.1130ms | 13.8824μs | 72.0337 KOps/s | 62.7924 KOps/s | |
test_tdmodule_dispatch | 60.7420μs | 26.6233μs | 37.5611 KOps/s | 32.7317 KOps/s | |
test_tdseq | 33.4110μs | 14.2978μs | 69.9408 KOps/s | 61.5302 KOps/s | |
test_tdseq_dispatch | 52.5410μs | 28.7305μs | 34.8063 KOps/s | 29.8925 KOps/s | |
test_instantiation_functorch | 2.0251ms | 1.8702ms | 534.7108 Ops/s | 540.8377 Ops/s | |
test_instantiation_td | 1.8097ms | 1.1937ms | 837.7149 Ops/s | 842.9529 Ops/s | |
test_exec_functorch | 0.2657ms | 0.2081ms | 4.8051 KOps/s | 4.9570 KOps/s | |
test_exec_functional_call | 0.2698ms | 0.2091ms | 4.7823 KOps/s | 4.8956 KOps/s | |
test_exec_td | 0.2923ms | 0.2156ms | 4.6376 KOps/s | 4.6601 KOps/s | |
test_exec_td_decorator | 0.9269ms | 0.2566ms | 3.8969 KOps/s | 3.8929 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.8167ms | 0.6783ms | 1.4743 KOps/s | 1.4608 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7496ms | 0.6753ms | 1.4808 KOps/s | 1.4612 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6903ms | 0.5811ms | 1.7209 KOps/s | 1.7113 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6732ms | 0.5922ms | 1.6887 KOps/s | 1.7189 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3022ms | 0.6718ms | 1.4885 KOps/s | 1.4549 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7708ms | 0.6769ms | 1.4773 KOps/s | 1.4801 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8201ms | 0.6012ms | 1.6632 KOps/s | 1.7116 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7392ms | 0.6096ms | 1.6405 KOps/s | 1.7276 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.5528ms | 8.3495ms | 119.7675 Ops/s | 121.3128 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.5374ms | 8.2973ms | 120.5205 Ops/s | 121.9030 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.4169ms | 8.1578ms | 122.5828 Ops/s | 125.3422 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.3537ms | 8.0940ms | 123.5476 Ops/s | 125.0049 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.9549ms | 19.3971ms | 51.5541 Ops/s | 52.2756 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.5845ms | 19.6070ms | 51.0021 Ops/s | 52.2619 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.8672ms | 19.4737ms | 51.3513 Ops/s | 52.8644 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.8616ms | 19.5021ms | 51.2764 Ops/s | 52.2253 Ops/s | |
test_to_module_speed[True] | 1.0426ms | 0.9478ms | 1.0551 KOps/s | 1.0652 KOps/s | |
test_to_module_speed[False] | 1.1777ms | 0.9186ms | 1.0887 KOps/s | 1.0853 KOps/s | |
test_tc_init | 59.9210μs | 30.4402μs | 32.8512 KOps/s | 28.2686 KOps/s | |
test_tc_init_nested | 92.2620μs | 61.5901μs | 16.2364 KOps/s | 14.3914 KOps/s | |
test_tc_first_layer_tensor | 4.6030μs | 0.6719μs | 1.4882 MOps/s | 1.4907 MOps/s | |
test_tc_first_layer_nontensor | 26.5000μs | 2.2684μs | 440.8435 KOps/s | 439.9591 KOps/s | |
test_tc_second_layer_tensor | 28.6830μs | 1.3719μs | 728.8940 KOps/s | 727.1100 KOps/s | |
test_tc_second_layer_nontensor | 30.6800μs | 2.9545μs | 338.4703 KOps/s | 335.8060 KOps/s | |
test_unbind | 0.2100s | 12.6292ms | 79.1815 Ops/s | 89.5697 Ops/s | |
test_full_like | 0.6583ms | 0.5749ms | 1.7394 KOps/s | 1.7337 KOps/s | |
test_zeros_like | 0.3339ms | 0.1978ms | 5.0549 KOps/s | 5.0492 KOps/s | |
test_ones_like | 0.2502ms | 0.1976ms | 5.0610 KOps/s | 5.0563 KOps/s | |
test_clone | 0.5150ms | 0.4143ms | 2.4139 KOps/s | 2.4189 KOps/s | |
test_squeeze | 36.7110μs | 9.7847μs | 102.2005 KOps/s | 102.4445 KOps/s | |
test_unsqueeze | 0.2186ms | 73.3370μs | 13.6357 KOps/s | 13.6685 KOps/s | |
test_split | 0.4506ms | 0.1564ms | 6.3929 KOps/s | 6.2347 KOps/s | |
test_permute | 0.2117ms | 0.1770ms | 5.6492 KOps/s | 5.5660 KOps/s | |
test_stack | 1.2532ms | 0.8692ms | 1.1505 KOps/s | 1.1242 KOps/s | |
test_cat | 1.3139ms | 1.2315ms | 811.9976 Ops/s | 811.6867 Ops/s |
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: bd18cc26e0c3147d72ac435740e019399c432e3c Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: f547a2e80a99c31062ea4962ef044b4f44064c60 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: 2cf05e277a1b587cccfe75391e9a9b7538bc34b3 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: 48c9a030643f870b810f1e832374b7c612dff4bb Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 16, 2024
ghstack-source-id: 7be68be89000804f864750ab9205bdfe65bb66f5 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 190c37737e970a4bedd7e3bcdb31a6dafef1fdb7 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 9ade05abda4c29a60d8f33d1b4951bbfe11f31c1 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 17, 2024
ghstack-source-id: 9ade05abda4c29a60d8f33d1b4951bbfe11f31c1 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 18, 2024
ghstack-source-id: cf56a4f5219c6c31642772f46b36e1f35ddae679 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 18, 2024
ghstack-source-id: 4119db55e22bc41310085c7e8a96bd81c212b826 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 18, 2024
ghstack-source-id: d285166b131eacfe7712a07250442dffba646353 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 19, 2024
ghstack-source-id: 9268402aa425d82b9f4ce301971ea6ffd0cea65b Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 19, 2024
ghstack-source-id: 695b419d6a7a7b504462152a29b2d5e1b9e60843 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 19, 2024
ghstack-source-id: b464acffd2fed4c483dcf163ef53a0dc4807bb91 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 19, 2024
ghstack-source-id: f6e0b2b6779c63948084cb607f45b64f7555c274 Pull Request resolved: #994
vmoens
added a commit
that referenced
this pull request
Sep 19, 2024
ghstack-source-id: f6e0b2b6779c63948084cb607f45b64f7555c274 Pull Request resolved: #994
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
documentation
Improvements or additions to documentation
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):