Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Fix windows wheels #1006

Merged
merged 1 commit into from
Sep 23, 2024
Merged

[CI] Fix windows wheels #1006

merged 1 commit into from
Sep 23, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 23, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 23, 2024
ghstack-source-id: e5c9fd8a8534fef623982fe435cadaf0a9c4703a
Pull Request resolved: #1006
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 23, 2024
@vmoens vmoens added the CI label Sep 23, 2024
@vmoens vmoens merged commit 3fb4393 into gh/vmoens/20/base Sep 23, 2024
45 checks passed
@vmoens vmoens deleted the gh/vmoens/20/head branch September 23, 2024 10:49
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 222. Improved: $\large\color{#35bf28}14$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 44.7130μs 18.9368μs 52.8072 KOps/s 50.6421 KOps/s $\color{#35bf28}+4.28\%$
test_plain_set_stack_nested 47.9200μs 19.1687μs 52.1683 KOps/s 49.2244 KOps/s $\textbf{\color{#35bf28}+5.98\%}$
test_plain_set_nested_inplace 57.8690μs 20.7765μs 48.1314 KOps/s 46.8178 KOps/s $\color{#35bf28}+2.81\%$
test_plain_set_stack_nested_inplace 69.2000μs 20.8835μs 47.8846 KOps/s 46.4384 KOps/s $\color{#35bf28}+3.11\%$
test_items 99.5360μs 4.2003μs 238.0800 KOps/s 243.7985 KOps/s $\color{#d91a1a}-2.35\%$
test_items_nested 0.7000ms 0.3716ms 2.6912 KOps/s 2.7484 KOps/s $\color{#d91a1a}-2.08\%$
test_items_nested_locked 0.4381ms 0.3672ms 2.7236 KOps/s 2.7655 KOps/s $\color{#d91a1a}-1.51\%$
test_items_nested_leaf 0.1470ms 67.7062μs 14.7697 KOps/s 14.4723 KOps/s $\color{#35bf28}+2.05\%$
test_items_stack_nested 0.5172ms 0.3717ms 2.6905 KOps/s 2.7131 KOps/s $\color{#d91a1a}-0.83\%$
test_items_stack_nested_leaf 0.1179ms 71.4127μs 14.0031 KOps/s 13.7042 KOps/s $\color{#35bf28}+2.18\%$
test_items_stack_nested_locked 0.6801ms 0.3849ms 2.5982 KOps/s 2.7249 KOps/s $\color{#d91a1a}-4.65\%$
test_keys 47.6490μs 3.5729μs 279.8863 KOps/s 284.3278 KOps/s $\color{#d91a1a}-1.56\%$
test_keys_nested 0.1420ms 0.1008ms 9.9231 KOps/s 9.6401 KOps/s $\color{#35bf28}+2.94\%$
test_keys_nested_locked 0.7157ms 0.1050ms 9.5247 KOps/s 9.3909 KOps/s $\color{#35bf28}+1.42\%$
test_keys_nested_leaf 0.1617ms 83.6563μs 11.9537 KOps/s 11.6273 KOps/s $\color{#35bf28}+2.81\%$
test_keys_stack_nested 0.1462ms 0.1012ms 9.8784 KOps/s 9.7977 KOps/s $\color{#35bf28}+0.82\%$
test_keys_stack_nested_leaf 0.1628ms 84.9737μs 11.7683 KOps/s 11.7792 KOps/s $\color{#d91a1a}-0.09\%$
test_keys_stack_nested_locked 0.1484ms 0.1070ms 9.3452 KOps/s 9.3297 KOps/s $\color{#35bf28}+0.17\%$
test_values 5.8168μs 1.0428μs 958.9647 KOps/s 951.3652 KOps/s $\color{#35bf28}+0.80\%$
test_values_nested 0.1364ms 75.2114μs 13.2958 KOps/s 13.5008 KOps/s $\color{#d91a1a}-1.52\%$
test_values_nested_locked 0.1371ms 75.2902μs 13.2819 KOps/s 13.5209 KOps/s $\color{#d91a1a}-1.77\%$
test_values_nested_leaf 0.1186ms 61.7561μs 16.1927 KOps/s 15.8430 KOps/s $\color{#35bf28}+2.21\%$
test_values_stack_nested 0.1237ms 76.7305μs 13.0326 KOps/s 13.4494 KOps/s $\color{#d91a1a}-3.10\%$
test_values_stack_nested_leaf 0.1149ms 62.0233μs 16.1230 KOps/s 16.2201 KOps/s $\color{#d91a1a}-0.60\%$
test_values_stack_nested_locked 0.1424ms 76.7714μs 13.0257 KOps/s 13.4528 KOps/s $\color{#d91a1a}-3.17\%$
test_membership 3.6799μs 0.7710μs 1.2970 MOps/s 1.1377 MOps/s $\textbf{\color{#35bf28}+14.01\%}$
test_membership_nested 20.8390μs 2.8336μs 352.9067 KOps/s 342.2286 KOps/s $\color{#35bf28}+3.12\%$
test_membership_nested_leaf 20.1670μs 2.8123μs 355.5763 KOps/s 363.9504 KOps/s $\color{#d91a1a}-2.30\%$
test_membership_stacked_nested 25.6080μs 2.8315μs 353.1673 KOps/s 364.0319 KOps/s $\color{#d91a1a}-2.98\%$
test_membership_stacked_nested_leaf 27.2210μs 2.8732μs 348.0475 KOps/s 362.3386 KOps/s $\color{#d91a1a}-3.94\%$
test_membership_nested_last 22.3620μs 4.0373μs 247.6928 KOps/s 252.4880 KOps/s $\color{#d91a1a}-1.90\%$
test_membership_nested_leaf_last 24.3650μs 4.0076μs 249.5283 KOps/s 250.6667 KOps/s $\color{#d91a1a}-0.45\%$
test_membership_stacked_nested_last 23.3940μs 3.9931μs 250.4294 KOps/s 163.1146 KOps/s $\textbf{\color{#35bf28}+53.53\%}$
test_membership_stacked_nested_leaf_last 29.7760μs 4.0245μs 248.4759 KOps/s 164.0957 KOps/s $\textbf{\color{#35bf28}+51.42\%}$
test_nested_getleaf 40.8260μs 10.6604μs 93.8052 KOps/s 91.6647 KOps/s $\color{#35bf28}+2.34\%$
test_nested_get 31.6000μs 10.2333μs 97.7201 KOps/s 95.0011 KOps/s $\color{#35bf28}+2.86\%$
test_stacked_getleaf 34.6650μs 10.6146μs 94.2096 KOps/s 93.7222 KOps/s $\color{#35bf28}+0.52\%$
test_stacked_get 34.8660μs 10.3025μs 97.0633 KOps/s 95.0708 KOps/s $\color{#35bf28}+2.10\%$
test_nested_getitemleaf 38.2610μs 11.2299μs 89.0476 KOps/s 85.3071 KOps/s $\color{#35bf28}+4.38\%$
test_nested_getitem 32.5500μs 10.5013μs 95.2259 KOps/s 92.5898 KOps/s $\color{#35bf28}+2.85\%$
test_stacked_getitemleaf 34.5650μs 11.1955μs 89.3216 KOps/s 88.3783 KOps/s $\color{#35bf28}+1.07\%$
test_stacked_getitem 30.4460μs 10.4940μs 95.2930 KOps/s 95.3636 KOps/s $\color{#d91a1a}-0.07\%$
test_lock_nested 83.0334ms 0.5709ms 1.7515 KOps/s 2.0399 KOps/s $\textbf{\color{#d91a1a}-14.14\%}$
test_lock_stack_nested 0.8670ms 0.4572ms 2.1870 KOps/s 2.2239 KOps/s $\color{#d91a1a}-1.66\%$
test_unlock_nested 85.0411ms 0.4882ms 2.0482 KOps/s 2.4463 KOps/s $\textbf{\color{#d91a1a}-16.27\%}$
test_unlock_stack_nested 0.5688ms 0.3703ms 2.7009 KOps/s 2.7116 KOps/s $\color{#d91a1a}-0.39\%$
test_flatten_speed 0.1766ms 86.9901μs 11.4956 KOps/s 11.2287 KOps/s $\color{#35bf28}+2.38\%$
test_unflatten_speed 0.8207ms 0.4660ms 2.1459 KOps/s 2.1308 KOps/s $\color{#35bf28}+0.71\%$
test_common_ops 6.2163ms 1.0520ms 950.5513 Ops/s 935.2376 Ops/s $\color{#35bf28}+1.64\%$
test_creation 26.4590μs 2.1387μs 467.5708 KOps/s 463.9625 KOps/s $\color{#35bf28}+0.78\%$
test_creation_empty 42.2500μs 15.6576μs 63.8668 KOps/s 64.6022 KOps/s $\color{#d91a1a}-1.14\%$
test_creation_nested_1 68.9970μs 18.9546μs 52.7576 KOps/s 52.1483 KOps/s $\color{#35bf28}+1.17\%$
test_creation_nested_2 64.9310μs 22.7457μs 43.9643 KOps/s 42.6337 KOps/s $\color{#35bf28}+3.12\%$
test_clone 1.2895ms 17.2401μs 58.0043 KOps/s 56.9877 KOps/s $\color{#35bf28}+1.78\%$
test_getitem[int] 0.8488ms 16.9280μs 59.0739 KOps/s 57.1217 KOps/s $\color{#35bf28}+3.42\%$
test_getitem[slice_int] 0.1344ms 30.5875μs 32.6931 KOps/s 31.7941 KOps/s $\color{#35bf28}+2.83\%$
test_getitem[range] 0.1718ms 57.8640μs 17.2819 KOps/s 17.3671 KOps/s $\color{#d91a1a}-0.49\%$
test_getitem[tuple] 0.1303ms 25.4296μs 39.3242 KOps/s 38.8379 KOps/s $\color{#35bf28}+1.25\%$
test_getitem[list] 0.1767ms 53.0748μs 18.8413 KOps/s 18.6471 KOps/s $\color{#35bf28}+1.04\%$
test_setitem_dim[int] 56.5560μs 31.8156μs 31.4312 KOps/s 30.1866 KOps/s $\color{#35bf28}+4.12\%$
test_setitem_dim[slice_int] 0.1116ms 60.1273μs 16.6314 KOps/s 16.4019 KOps/s $\color{#35bf28}+1.40\%$
test_setitem_dim[range] 0.1796ms 83.6103μs 11.9602 KOps/s 11.7900 KOps/s $\color{#35bf28}+1.44\%$
test_setitem_dim[tuple] 75.7210μs 48.5038μs 20.6169 KOps/s 19.9262 KOps/s $\color{#35bf28}+3.47\%$
test_setitem 69.8300μs 27.4819μs 36.3876 KOps/s 34.9140 KOps/s $\color{#35bf28}+4.22\%$
test_set 0.1394ms 26.7029μs 37.4491 KOps/s 36.9010 KOps/s $\color{#35bf28}+1.49\%$
test_set_shared 1.3008ms 0.2102ms 4.7574 KOps/s 4.7499 KOps/s $\color{#35bf28}+0.16\%$
test_update 0.1439ms 32.8103μs 30.4782 KOps/s 30.7283 KOps/s $\color{#d91a1a}-0.81\%$
test_update_nested 0.1183ms 43.2317μs 23.1312 KOps/s 23.3731 KOps/s $\color{#d91a1a}-1.03\%$
test_update__nested 0.1269ms 34.2247μs 29.2187 KOps/s 28.7363 KOps/s $\color{#35bf28}+1.68\%$
test_set_nested 0.1271ms 29.5338μs 33.8595 KOps/s 33.1862 KOps/s $\color{#35bf28}+2.03\%$
test_set_nested_new 82.0130μs 34.6460μs 28.8633 KOps/s 28.0319 KOps/s $\color{#35bf28}+2.97\%$
test_select 1.2196ms 52.8045μs 18.9378 KOps/s 18.6337 KOps/s $\color{#35bf28}+1.63\%$
test_select_nested 0.1278ms 59.0553μs 16.9333 KOps/s 16.4402 KOps/s $\color{#35bf28}+3.00\%$
test_exclude_nested 0.1597ms 74.9682μs 13.3390 KOps/s 12.9929 KOps/s $\color{#35bf28}+2.66\%$
test_empty[True] 0.4922ms 0.3187ms 3.1375 KOps/s 3.0995 KOps/s $\color{#35bf28}+1.23\%$
test_empty[False] 5.7907μs 1.1991μs 833.9846 KOps/s 836.9779 KOps/s $\color{#d91a1a}-0.36\%$
test_unbind_speed 0.3905ms 0.3060ms 3.2682 KOps/s 3.3133 KOps/s $\color{#d91a1a}-1.36\%$
test_unbind_speed_stack0 0.4326ms 0.2959ms 3.3800 KOps/s 3.4292 KOps/s $\color{#d91a1a}-1.44\%$
test_unbind_speed_stack1 92.6478ms 0.8095ms 1.2353 KOps/s 1.4974 KOps/s $\textbf{\color{#d91a1a}-17.50\%}$
test_split 81.5529ms 2.1617ms 462.5898 Ops/s 456.4482 Ops/s $\color{#35bf28}+1.35\%$
test_chunk 3.1064ms 2.0118ms 497.0630 Ops/s 458.4147 Ops/s $\textbf{\color{#35bf28}+8.43\%}$
test_creation[device0] 0.2409ms 0.1160ms 8.6192 KOps/s 8.5840 KOps/s $\color{#35bf28}+0.41\%$
test_creation_from_tensor 3.0316ms 0.1162ms 8.6090 KOps/s 8.2710 KOps/s $\color{#35bf28}+4.09\%$
test_add_one[memmap_tensor0] 84.1580μs 7.0741μs 141.3602 KOps/s 139.3530 KOps/s $\color{#35bf28}+1.44\%$
test_contiguous[memmap_tensor0] 29.2450μs 1.9232μs 519.9714 KOps/s 524.6382 KOps/s $\color{#d91a1a}-0.89\%$
test_stack[memmap_tensor0] 86.4720μs 5.5649μs 179.6963 KOps/s 179.5274 KOps/s $\color{#35bf28}+0.09\%$
test_memmaptd_index 1.2202ms 0.3944ms 2.5353 KOps/s 2.4104 KOps/s $\textbf{\color{#35bf28}+5.18\%}$
test_memmaptd_index_astensor 0.7410ms 0.4694ms 2.1303 KOps/s 2.0564 KOps/s $\color{#35bf28}+3.59\%$
test_memmaptd_index_op 1.6128ms 0.9517ms 1.0508 KOps/s 1.0253 KOps/s $\color{#35bf28}+2.48\%$
test_serialize_model 0.2143s 0.1292s 7.7381 Ops/s 8.3948 Ops/s $\textbf{\color{#d91a1a}-7.82\%}$
test_serialize_model_pickle 0.4454s 0.3946s 2.5343 Ops/s 2.5505 Ops/s $\color{#d91a1a}-0.63\%$
test_serialize_weights 0.1200s 0.1110s 9.0085 Ops/s 8.6954 Ops/s $\color{#35bf28}+3.60\%$
test_serialize_weights_returnearly 0.1709s 0.1568s 6.3782 Ops/s 6.1699 Ops/s $\color{#35bf28}+3.38\%$
test_serialize_weights_pickle 0.4572s 0.3931s 2.5439 Ops/s 2.2157 Ops/s $\textbf{\color{#35bf28}+14.81\%}$
test_serialize_weights_filesystem 0.2221s 0.1508s 6.6292 Ops/s 6.4131 Ops/s $\color{#35bf28}+3.37\%$
test_serialize_model_filesystem 0.1578s 0.1479s 6.7626 Ops/s 6.5518 Ops/s $\color{#35bf28}+3.22\%$
test_reshape_pytree 80.7510μs 39.3263μs 25.4283 KOps/s 25.0662 KOps/s $\color{#35bf28}+1.44\%$
test_reshape_td 95.1980μs 45.8802μs 21.7959 KOps/s 21.4300 KOps/s $\color{#35bf28}+1.71\%$
test_view_pytree 88.9070μs 38.8748μs 25.7236 KOps/s 25.1099 KOps/s $\color{#35bf28}+2.44\%$
test_view_td 0.2259ms 53.3767μs 18.7348 KOps/s 18.9467 KOps/s $\color{#d91a1a}-1.12\%$
test_unbind_pytree 98.2240μs 36.5398μs 27.3674 KOps/s 28.0528 KOps/s $\color{#d91a1a}-2.44\%$
test_unbind_td 0.3111ms 44.9698μs 22.2371 KOps/s 22.4887 KOps/s $\color{#d91a1a}-1.12\%$
test_split_pytree 95.9680μs 38.2594μs 26.1374 KOps/s 26.4212 KOps/s $\color{#d91a1a}-1.07\%$
test_split_td 0.4490ms 56.8541μs 17.5889 KOps/s 17.0556 KOps/s $\color{#35bf28}+3.13\%$
test_add_pytree 0.1478ms 45.2303μs 22.1091 KOps/s 21.8483 KOps/s $\color{#35bf28}+1.19\%$
test_add_td 0.1529ms 75.0754μs 13.3199 KOps/s 13.3919 KOps/s $\color{#d91a1a}-0.54\%$
test_compile_add_one_nested[tensordict-compile] 0.1409ms 56.5410μs 17.6863 KOps/s 17.9998 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_add_one_nested[tensordict-eager] 0.3690ms 0.1735ms 5.7648 KOps/s 5.6878 KOps/s $\color{#35bf28}+1.35\%$
test_compile_add_one_nested[pytree-compile] 0.1754ms 56.6892μs 17.6400 KOps/s 18.0095 KOps/s $\color{#d91a1a}-2.05\%$
test_compile_add_one_nested[pytree-eager] 0.3375ms 0.1411ms 7.0892 KOps/s 6.9959 KOps/s $\color{#35bf28}+1.33\%$
test_compile_copy_nested[tensordict-compile] 49.5330μs 21.2251μs 47.1141 KOps/s 48.1971 KOps/s $\color{#d91a1a}-2.25\%$
test_compile_copy_nested[tensordict-eager] 0.1529ms 68.2354μs 14.6551 KOps/s 14.6576 KOps/s $\color{#d91a1a}-0.02\%$
test_compile_copy_nested[pytree-compile] 0.1522ms 77.3846μs 12.9225 KOps/s 12.9935 KOps/s $\color{#d91a1a}-0.55\%$
test_compile_copy_nested[pytree-eager] 0.1407ms 70.1505μs 14.2551 KOps/s 14.2634 KOps/s $\color{#d91a1a}-0.06\%$
test_compile_add_one_flat[tensordict-compile] 0.3194ms 0.1738ms 5.7548 KOps/s 5.8389 KOps/s $\color{#d91a1a}-1.44\%$
test_compile_add_one_flat[tensordict-eager] 0.3624ms 0.1873ms 5.3376 KOps/s 5.2144 KOps/s $\color{#35bf28}+2.36\%$
test_compile_add_one_flat[tensorclass-compile] 92.3440μs 45.3314μs 22.0597 KOps/s 21.3321 KOps/s $\color{#35bf28}+3.41\%$
test_compile_add_one_flat[tensorclass-eager] 0.1429ms 68.6043μs 14.5763 KOps/s 14.5541 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[pytree-compile] 0.2401ms 0.1757ms 5.6909 KOps/s 5.7842 KOps/s $\color{#d91a1a}-1.61\%$
test_compile_add_one_flat[pytree-eager] 0.3804ms 0.2850ms 3.5082 KOps/s 3.4228 KOps/s $\color{#35bf28}+2.49\%$
test_compile_add_self_flat[tensordict-eager] 0.4597ms 0.2008ms 4.9797 KOps/s 4.8375 KOps/s $\color{#35bf28}+2.94\%$
test_compile_add_self_flat[tensordict-compile] 0.3389ms 0.1730ms 5.7796 KOps/s 5.7676 KOps/s $\color{#35bf28}+0.21\%$
test_compile_add_self_flat[tensorclass-eager] 0.1420ms 62.4837μs 16.0042 KOps/s 16.0979 KOps/s $\color{#d91a1a}-0.58\%$
test_compile_add_self_flat[tensorclass-compile] 0.1285ms 46.9721μs 21.2892 KOps/s 21.5313 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_add_self_flat[pytree-eager] 0.3135ms 0.2311ms 4.3273 KOps/s 4.2518 KOps/s $\color{#35bf28}+1.78\%$
test_compile_add_self_flat[pytree-compile] 0.3188ms 0.1743ms 5.7376 KOps/s 5.7524 KOps/s $\color{#d91a1a}-0.26\%$
test_compile_copy_flat[tensordict-compile] 0.2272ms 0.1046ms 9.5557 KOps/s 9.6890 KOps/s $\color{#d91a1a}-1.38\%$
test_compile_copy_flat[tensordict-eager] 0.1225ms 58.7439μs 17.0230 KOps/s 17.3698 KOps/s $\color{#d91a1a}-2.00\%$
test_compile_copy_flat[pytree-compile] 0.1706ms 79.9941μs 12.5009 KOps/s 12.6399 KOps/s $\color{#d91a1a}-1.10\%$
test_compile_copy_flat[pytree-eager] 0.1392ms 70.3422μs 14.2162 KOps/s 14.2889 KOps/s $\color{#d91a1a}-0.51\%$
test_compile_assign_and_add[tensordict-compile] 0.3864ms 0.1961ms 5.0988 KOps/s 5.1924 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_assign_and_add[tensordict-eager] 2.7559ms 1.6567ms 603.5985 Ops/s 608.5758 Ops/s $\color{#d91a1a}-0.82\%$
test_compile_assign_and_add[pytree-compile] 0.2820ms 0.1925ms 5.1938 KOps/s 5.2115 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_assign_and_add[pytree-eager] 1.3263ms 1.0972ms 911.4026 Ops/s 916.6134 Ops/s $\color{#d91a1a}-0.57\%$
test_compile_assign_and_add_stack[compile] 0.7336ms 0.4126ms 2.4238 KOps/s 2.4123 KOps/s $\color{#35bf28}+0.48\%$
test_compile_assign_and_add_stack[eager] 3.9482ms 3.5684ms 280.2353 Ops/s 279.1129 Ops/s $\color{#35bf28}+0.40\%$
test_compile_indexing[tensor-tensordict-compile] 90.6500μs 32.7299μs 30.5531 KOps/s 29.6891 KOps/s $\color{#35bf28}+2.91\%$
test_compile_indexing[tensor-tensordict-eager] 0.8593ms 47.3147μs 21.1351 KOps/s 20.6177 KOps/s $\color{#35bf28}+2.51\%$
test_compile_indexing[tensor-tensorclass-compile] 67.5160μs 28.3319μs 35.2959 KOps/s 33.1085 KOps/s $\textbf{\color{#35bf28}+6.61\%}$
test_compile_indexing[tensor-tensorclass-eager] 80.1000μs 29.6884μs 33.6831 KOps/s 33.9617 KOps/s $\color{#d91a1a}-0.82\%$
test_compile_indexing[tensor-pytree-compile] 73.8780μs 28.3346μs 35.2926 KOps/s 33.0056 KOps/s $\textbf{\color{#35bf28}+6.93\%}$
test_compile_indexing[tensor-pytree-eager] 69.1690μs 29.5366μs 33.8563 KOps/s 34.4046 KOps/s $\color{#d91a1a}-1.59\%$
test_compile_indexing[slice-tensordict-compile] 0.1568ms 71.4301μs 13.9997 KOps/s 13.8370 KOps/s $\color{#35bf28}+1.18\%$
test_compile_indexing[slice-tensordict-eager] 0.4469ms 27.4353μs 36.4494 KOps/s 35.7960 KOps/s $\color{#35bf28}+1.83\%$
test_compile_indexing[slice-tensorclass-compile] 0.1493ms 67.1019μs 14.9027 KOps/s 14.5615 KOps/s $\color{#35bf28}+2.34\%$
test_compile_indexing[slice-tensorclass-eager] 78.6970μs 23.4253μs 42.6889 KOps/s 42.6403 KOps/s $\color{#35bf28}+0.11\%$
test_compile_indexing[slice-pytree-compile] 0.1499ms 66.3119μs 15.0803 KOps/s 14.8020 KOps/s $\color{#35bf28}+1.88\%$
test_compile_indexing[slice-pytree-eager] 78.6970μs 23.6530μs 42.2779 KOps/s 42.8773 KOps/s $\color{#d91a1a}-1.40\%$
test_compile_indexing[int-tensordict-compile] 0.1104ms 70.5792μs 14.1685 KOps/s 13.7822 KOps/s $\color{#35bf28}+2.80\%$
test_compile_indexing[int-tensordict-eager] 0.8396ms 26.7017μs 37.4508 KOps/s 36.0477 KOps/s $\color{#35bf28}+3.89\%$
test_compile_indexing[int-tensorclass-compile] 0.1597ms 66.7682μs 14.9772 KOps/s 14.6249 KOps/s $\color{#35bf28}+2.41\%$
test_compile_indexing[int-tensorclass-eager] 64.2000μs 23.2922μs 42.9328 KOps/s 43.6519 KOps/s $\color{#d91a1a}-1.65\%$
test_compile_indexing[int-pytree-compile] 0.1651ms 65.7666μs 15.2053 KOps/s 14.7707 KOps/s $\color{#35bf28}+2.94\%$
test_compile_indexing[int-pytree-eager] 59.1710μs 23.0952μs 43.2990 KOps/s 43.3399 KOps/s $\color{#d91a1a}-0.09\%$
test_mod_add[eager] 59.2310μs 23.3473μs 42.8314 KOps/s 41.8968 KOps/s $\color{#35bf28}+2.23\%$
test_mod_add[compile] 83.9970μs 37.7505μs 26.4897 KOps/s 25.6303 KOps/s $\color{#35bf28}+3.35\%$
test_mod_add[compile-overhead] 84.2080μs 37.9909μs 26.3221 KOps/s 25.2594 KOps/s $\color{#35bf28}+4.21\%$
test_mod_wrap[eager] 0.4255ms 0.1997ms 5.0078 KOps/s 4.9588 KOps/s $\color{#35bf28}+0.99\%$
test_mod_wrap[compile] 0.4287ms 0.2298ms 4.3514 KOps/s 4.3427 KOps/s $\color{#35bf28}+0.20\%$
test_mod_wrap[compile-overhead] 0.4326ms 0.2258ms 4.4296 KOps/s 4.4071 KOps/s $\color{#35bf28}+0.51\%$
test_mod_wrap_and_backward[eager] 12.0379ms 10.5252ms 95.0100 Ops/s 91.8074 Ops/s $\color{#35bf28}+3.49\%$
test_mod_wrap_and_backward[compile] 12.5134ms 10.7037ms 93.4253 Ops/s 85.4859 Ops/s $\textbf{\color{#35bf28}+9.29\%}$
test_mod_wrap_and_backward[compile-overhead] 12.2436ms 10.9116ms 91.6454 Ops/s 85.3843 Ops/s $\textbf{\color{#35bf28}+7.33\%}$
test_seq_add[eager] 0.2770ms 89.0602μs 11.2284 KOps/s 11.6980 KOps/s $\color{#d91a1a}-4.01\%$
test_seq_add[compile] 0.1240ms 63.7468μs 15.6871 KOps/s 15.6001 KOps/s $\color{#35bf28}+0.56\%$
test_seq_add[compile-overhead] 0.1079ms 61.7211μs 16.2019 KOps/s 15.7720 KOps/s $\color{#35bf28}+2.73\%$
test_seq_wrap[eager] 0.5085ms 0.3663ms 2.7300 KOps/s 2.7759 KOps/s $\color{#d91a1a}-1.65\%$
test_seq_wrap[compile] 0.8926ms 0.2619ms 3.8190 KOps/s 3.7634 KOps/s $\color{#35bf28}+1.48\%$
test_seq_wrap[compile-overhead] 1.0321ms 0.2726ms 3.6685 KOps/s 3.7266 KOps/s $\color{#d91a1a}-1.56\%$
test_func_call_runtime[False-eager] 0.6628ms 0.5095ms 1.9627 KOps/s 1.9623 KOps/s $\color{#35bf28}+0.02\%$
test_func_call_runtime[False-compile] 1.0399ms 0.4997ms 2.0014 KOps/s 2.0279 KOps/s $\color{#d91a1a}-1.31\%$
test_func_call_runtime[False-compile-overhead] 1.0299ms 0.4945ms 2.0223 KOps/s 2.0289 KOps/s $\color{#d91a1a}-0.33\%$
test_func_call_runtime[True-eager] 1.2551ms 0.7325ms 1.3651 KOps/s 1.3833 KOps/s $\color{#d91a1a}-1.31\%$
test_func_call_runtime[True-compile] 0.6544ms 0.5038ms 1.9849 KOps/s 1.9591 KOps/s $\color{#35bf28}+1.32\%$
test_func_call_runtime[True-compile-overhead] 0.7126ms 0.5000ms 2.0000 KOps/s 1.9578 KOps/s $\color{#35bf28}+2.15\%$
test_func_call_cm_runtime[False-eager] 0.6285ms 0.5039ms 1.9846 KOps/s 1.9990 KOps/s $\color{#d91a1a}-0.72\%$
test_func_call_cm_runtime[False-compile] 0.6412ms 0.4944ms 2.0225 KOps/s 2.0232 KOps/s $\color{#d91a1a}-0.03\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6065ms 0.4900ms 2.0409 KOps/s 2.0260 KOps/s $\color{#35bf28}+0.73\%$
test_func_call_cm_runtime[True-eager] 1.3381ms 0.8533ms 1.1719 KOps/s 1.1787 KOps/s $\color{#d91a1a}-0.57\%$
test_func_call_cm_runtime[True-compile] 0.8334ms 0.7142ms 1.4001 KOps/s 1.3794 KOps/s $\color{#35bf28}+1.50\%$
test_func_call_cm_runtime[True-compile-overhead] 1.2278ms 0.7156ms 1.3974 KOps/s 1.3781 KOps/s $\color{#35bf28}+1.40\%$
test_vmap_func_call_cm_runtime[eager] 2.3046ms 1.8289ms 546.7767 Ops/s 530.6623 Ops/s $\color{#35bf28}+3.04\%$
test_vmap_func_call_cm_runtime[compile] 2.5831ms 1.8951ms 527.6819 Ops/s 527.2431 Ops/s $\color{#35bf28}+0.08\%$
test_vmap_func_call_cm_runtime[compile-overhead] 2.6141ms 1.8963ms 527.3560 Ops/s 524.5766 Ops/s $\color{#35bf28}+0.53\%$
test_distributed 0.2264ms 0.1241ms 8.0595 KOps/s 7.8874 KOps/s $\color{#35bf28}+2.18\%$
test_tdmodule 51.3860μs 17.0582μs 58.6227 KOps/s 61.4856 KOps/s $\color{#d91a1a}-4.66\%$
test_tdmodule_dispatch 61.7560μs 33.1406μs 30.1745 KOps/s 31.3697 KOps/s $\color{#d91a1a}-3.81\%$
test_tdseq 35.5270μs 18.6097μs 53.7355 KOps/s 53.3325 KOps/s $\color{#35bf28}+0.76\%$
test_tdseq_dispatch 70.3820μs 38.4542μs 26.0049 KOps/s 26.6291 KOps/s $\color{#d91a1a}-2.34\%$
test_instantiation_functorch 1.8847ms 1.5583ms 641.7099 Ops/s 632.1417 Ops/s $\color{#35bf28}+1.51\%$
test_instantiation_td 1.9738ms 1.1664ms 857.3707 Ops/s 852.4408 Ops/s $\color{#35bf28}+0.58\%$
test_exec_functorch 0.3400ms 0.1814ms 5.5142 KOps/s 5.4546 KOps/s $\color{#35bf28}+1.09\%$
test_exec_functional_call 0.3517ms 0.1679ms 5.9552 KOps/s 5.8011 KOps/s $\color{#35bf28}+2.66\%$
test_exec_td 0.2493ms 0.1625ms 6.1523 KOps/s 5.9146 KOps/s $\color{#35bf28}+4.02\%$
test_exec_td_decorator 1.1959ms 0.2190ms 4.5668 KOps/s 4.4069 KOps/s $\color{#35bf28}+3.63\%$
test_vmap_mlp_speed[True-True] 0.9524ms 0.6367ms 1.5707 KOps/s 1.5805 KOps/s $\color{#d91a1a}-0.62\%$
test_vmap_mlp_speed[True-False] 0.7631ms 0.6351ms 1.5746 KOps/s 1.5891 KOps/s $\color{#d91a1a}-0.92\%$
test_vmap_mlp_speed[False-True] 0.7866ms 0.4964ms 2.0144 KOps/s 2.0454 KOps/s $\color{#d91a1a}-1.51\%$
test_vmap_mlp_speed[False-False] 0.6551ms 0.4961ms 2.0157 KOps/s 2.0360 KOps/s $\color{#d91a1a}-1.00\%$
test_vmap_mlp_speed_decorator[True-True] 0.9447ms 0.6187ms 1.6164 KOps/s 1.6328 KOps/s $\color{#d91a1a}-1.00\%$
test_vmap_mlp_speed_decorator[True-False] 0.9460ms 0.6194ms 1.6144 KOps/s 1.6307 KOps/s $\color{#d91a1a}-1.00\%$
test_vmap_mlp_speed_decorator[False-True] 0.7529ms 0.5104ms 1.9591 KOps/s 1.9527 KOps/s $\color{#35bf28}+0.33\%$
test_vmap_mlp_speed_decorator[False-False] 0.9084ms 0.5111ms 1.9564 KOps/s 1.9457 KOps/s $\color{#35bf28}+0.55\%$
test_to_module_speed[True] 2.1121ms 1.3149ms 760.4871 Ops/s 764.8971 Ops/s $\color{#d91a1a}-0.58\%$
test_to_module_speed[False] 1.7615ms 1.2772ms 782.9395 Ops/s 789.1886 Ops/s $\color{#d91a1a}-0.79\%$
test_tc_init 86.7730μs 42.4159μs 23.5761 KOps/s 23.4115 KOps/s $\color{#35bf28}+0.70\%$
test_tc_init_nested 0.1412ms 83.7362μs 11.9423 KOps/s 11.9400 KOps/s $\color{#35bf28}+0.02\%$
test_tc_first_layer_tensor 22.0810μs 1.5527μs 644.0411 KOps/s 654.4185 KOps/s $\color{#d91a1a}-1.59\%$
test_tc_first_layer_nontensor 43.6240μs 4.8070μs 208.0286 KOps/s 213.8229 KOps/s $\color{#d91a1a}-2.71\%$
test_tc_second_layer_tensor 18.2940μs 2.8405μs 352.0521 KOps/s 357.7640 KOps/s $\color{#d91a1a}-1.60\%$
test_tc_second_layer_nontensor 27.8620μs 6.1170μs 163.4795 KOps/s 164.7317 KOps/s $\color{#d91a1a}-0.76\%$
test_unbind 0.4556s 12.9038ms 77.4968 Ops/s 74.7872 Ops/s $\color{#35bf28}+3.62\%$
test_full_like 7.2921ms 6.6309ms 150.8080 Ops/s 149.9393 Ops/s $\color{#35bf28}+0.58\%$
test_zeros_like 3.0221ms 2.5540ms 391.5362 Ops/s 389.3320 Ops/s $\color{#35bf28}+0.57\%$
test_ones_like 3.3844ms 2.9978ms 333.5791 Ops/s 168.5809 Ops/s $\textbf{\color{#35bf28}+97.87\%}$
test_clone 4.8706ms 4.5754ms 218.5613 Ops/s 131.2008 Ops/s $\textbf{\color{#35bf28}+66.59\%}$
test_squeeze 55.7240μs 12.1286μs 82.4499 KOps/s 77.7010 KOps/s $\textbf{\color{#35bf28}+6.11\%}$
test_unsqueeze 0.3330ms 92.9762μs 10.7554 KOps/s 10.8731 KOps/s $\color{#d91a1a}-1.08\%$
test_split 0.3358ms 0.1978ms 5.0551 KOps/s 5.1661 KOps/s $\color{#d91a1a}-2.15\%$
test_permute 0.3238ms 0.2193ms 4.5593 KOps/s 4.4049 KOps/s $\color{#35bf28}+3.51\%$
test_stack 31.4845ms 24.0275ms 41.6190 Ops/s 42.1654 Ops/s $\color{#d91a1a}-1.30\%$
test_cat 27.3639ms 23.2931ms 42.9313 Ops/s 42.5671 Ops/s $\color{#35bf28}+0.86\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}40$. Worsened: $\large\color{#d91a1a}10$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 63.4810μs 13.6069μs 73.4923 KOps/s 70.2448 KOps/s $\color{#35bf28}+4.62\%$
test_plain_set_stack_nested 32.1310μs 13.5377μs 73.8677 KOps/s 69.7560 KOps/s $\textbf{\color{#35bf28}+5.89\%}$
test_plain_set_nested_inplace 0.1128ms 14.4472μs 69.2175 KOps/s 66.1771 KOps/s $\color{#35bf28}+4.59\%$
test_plain_set_stack_nested_inplace 44.0410μs 14.6430μs 68.2919 KOps/s 66.5036 KOps/s $\color{#35bf28}+2.69\%$
test_items 29.1810μs 2.8601μs 349.6383 KOps/s 343.1253 KOps/s $\color{#35bf28}+1.90\%$
test_items_nested 0.3551ms 0.3246ms 3.0811 KOps/s 3.0928 KOps/s $\color{#d91a1a}-0.38\%$
test_items_nested_locked 0.3821ms 0.3293ms 3.0363 KOps/s 3.0408 KOps/s $\color{#d91a1a}-0.15\%$
test_items_nested_leaf 80.4520μs 55.5719μs 17.9947 KOps/s 17.8360 KOps/s $\color{#35bf28}+0.89\%$
test_items_stack_nested 0.3962ms 0.3322ms 3.0100 KOps/s 3.0721 KOps/s $\color{#d91a1a}-2.02\%$
test_items_stack_nested_leaf 81.9820μs 57.3410μs 17.4395 KOps/s 17.5510 KOps/s $\color{#d91a1a}-0.63\%$
test_items_stack_nested_locked 0.3668ms 0.3320ms 3.0122 KOps/s 3.0679 KOps/s $\color{#d91a1a}-1.82\%$
test_keys 25.7300μs 3.4233μs 292.1190 KOps/s 284.1720 KOps/s $\color{#35bf28}+2.80\%$
test_keys_nested 0.1171ms 54.3453μs 18.4008 KOps/s 18.2843 KOps/s $\color{#35bf28}+0.64\%$
test_keys_nested_locked 2.6080ms 61.4813μs 16.2651 KOps/s 15.9159 KOps/s $\color{#35bf28}+2.19\%$
test_keys_nested_leaf 80.6820μs 46.8799μs 21.3311 KOps/s 21.6996 KOps/s $\color{#d91a1a}-1.70\%$
test_keys_stack_nested 87.1020μs 56.6573μs 17.6500 KOps/s 17.5877 KOps/s $\color{#35bf28}+0.35\%$
test_keys_stack_nested_leaf 84.9810μs 48.2717μs 20.7161 KOps/s 20.8006 KOps/s $\color{#d91a1a}-0.41\%$
test_keys_stack_nested_locked 92.2520μs 61.8315μs 16.1730 KOps/s 16.1054 KOps/s $\color{#35bf28}+0.42\%$
test_values 4.7052μs 0.8381μs 1.1931 MOps/s 1.1987 MOps/s $\color{#d91a1a}-0.47\%$
test_values_nested 68.4420μs 40.9814μs 24.4013 KOps/s 24.5043 KOps/s $\color{#d91a1a}-0.42\%$
test_values_nested_locked 68.1220μs 42.7624μs 23.3850 KOps/s 23.3332 KOps/s $\color{#35bf28}+0.22\%$
test_values_nested_leaf 62.0410μs 35.7046μs 28.0076 KOps/s 28.2980 KOps/s $\color{#d91a1a}-1.03\%$
test_values_stack_nested 72.5320μs 41.8599μs 23.8892 KOps/s 24.1756 KOps/s $\color{#d91a1a}-1.18\%$
test_values_stack_nested_leaf 66.0410μs 36.0934μs 27.7059 KOps/s 27.9969 KOps/s $\color{#d91a1a}-1.04\%$
test_values_stack_nested_locked 85.2520μs 43.5607μs 22.9565 KOps/s 22.9152 KOps/s $\color{#35bf28}+0.18\%$
test_membership 1.6596μs 0.5041μs 1.9839 MOps/s 2.0047 MOps/s $\color{#d91a1a}-1.04\%$
test_membership_nested 13.4250μs 1.9033μs 525.4054 KOps/s 506.7147 KOps/s $\color{#35bf28}+3.69\%$
test_membership_nested_leaf 11.8637μs 1.8635μs 536.6326 KOps/s 524.5538 KOps/s $\color{#35bf28}+2.30\%$
test_membership_stacked_nested 28.1110μs 2.0147μs 496.3441 KOps/s 513.3812 KOps/s $\color{#d91a1a}-3.32\%$
test_membership_stacked_nested_leaf 18.2100μs 1.9966μs 500.8398 KOps/s 516.6043 KOps/s $\color{#d91a1a}-3.05\%$
test_membership_nested_last 38.7810μs 2.8376μs 352.4159 KOps/s 362.6859 KOps/s $\color{#d91a1a}-2.83\%$
test_membership_nested_leaf_last 29.5910μs 2.8220μs 354.3606 KOps/s 314.5672 KOps/s $\textbf{\color{#35bf28}+12.65\%}$
test_membership_stacked_nested_last 38.7510μs 3.4408μs 290.6321 KOps/s 311.2717 KOps/s $\textbf{\color{#d91a1a}-6.63\%}$
test_membership_stacked_nested_leaf_last 29.8110μs 3.4502μs 289.8402 KOps/s 301.2950 KOps/s $\color{#d91a1a}-3.80\%$
test_nested_getleaf 41.2510μs 6.0454μs 165.4148 KOps/s 165.9031 KOps/s $\color{#d91a1a}-0.29\%$
test_nested_get 43.1110μs 5.7012μs 175.4005 KOps/s 178.0945 KOps/s $\color{#d91a1a}-1.51\%$
test_stacked_getleaf 37.7600μs 6.0526μs 165.2183 KOps/s 165.2471 KOps/s $\color{#d91a1a}-0.02\%$
test_stacked_get 35.3610μs 5.6267μs 177.7232 KOps/s 177.5370 KOps/s $\color{#35bf28}+0.10\%$
test_nested_getitemleaf 41.8710μs 6.1384μs 162.9098 KOps/s 163.4747 KOps/s $\color{#d91a1a}-0.35\%$
test_nested_getitem 29.8710μs 5.7292μs 174.5441 KOps/s 173.9660 KOps/s $\color{#35bf28}+0.33\%$
test_stacked_getitemleaf 53.3710μs 6.0818μs 164.4237 KOps/s 164.0957 KOps/s $\color{#35bf28}+0.20\%$
test_stacked_getitem 32.4910μs 5.6764μs 176.1676 KOps/s 175.4009 KOps/s $\color{#35bf28}+0.44\%$
test_lock_nested 5.1956ms 0.4272ms 2.3406 KOps/s 2.3333 KOps/s $\color{#35bf28}+0.31\%$
test_lock_stack_nested 0.4315ms 0.3826ms 2.6136 KOps/s 2.5613 KOps/s $\color{#35bf28}+2.04\%$
test_unlock_nested 0.7688ms 0.3634ms 2.7518 KOps/s 2.7083 KOps/s $\color{#35bf28}+1.61\%$
test_unlock_stack_nested 0.3820ms 0.3222ms 3.1040 KOps/s 2.9884 KOps/s $\color{#35bf28}+3.87\%$
test_flatten_speed 0.1463ms 68.7798μs 14.5392 KOps/s 14.3412 KOps/s $\color{#35bf28}+1.38\%$
test_unflatten_speed 0.3887ms 0.2816ms 3.5509 KOps/s 3.3793 KOps/s $\textbf{\color{#35bf28}+5.08\%}$
test_common_ops 1.6138ms 1.3334ms 749.9532 Ops/s 797.2723 Ops/s $\textbf{\color{#d91a1a}-5.94\%}$
test_creation 18.4300μs 1.4519μs 688.7613 KOps/s 664.4970 KOps/s $\color{#35bf28}+3.65\%$
test_creation_empty 44.2210μs 15.1836μs 65.8607 KOps/s 61.2116 KOps/s $\textbf{\color{#35bf28}+7.60\%}$
test_creation_nested_1 51.5910μs 16.7571μs 59.6762 KOps/s 56.2776 KOps/s $\textbf{\color{#35bf28}+6.04\%}$
test_creation_nested_2 57.4110μs 20.8856μs 47.8799 KOps/s 46.4192 KOps/s $\color{#35bf28}+3.15\%$
test_clone 67.9520μs 31.0819μs 32.1731 KOps/s 31.4847 KOps/s $\color{#35bf28}+2.19\%$
test_getitem[int] 1.4574ms 18.4070μs 54.3270 KOps/s 61.1068 KOps/s $\textbf{\color{#d91a1a}-11.09\%}$
test_getitem[slice_int] 0.1249ms 31.5412μs 31.7046 KOps/s 33.5998 KOps/s $\textbf{\color{#d91a1a}-5.64\%}$
test_getitem[range] 0.1544ms 0.1087ms 9.2020 KOps/s 9.1263 KOps/s $\color{#35bf28}+0.83\%$
test_getitem[tuple] 0.1233ms 24.4409μs 40.9150 KOps/s 36.5954 KOps/s $\textbf{\color{#35bf28}+11.80\%}$
test_getitem[list] 0.1959ms 98.6328μs 10.1386 KOps/s 9.3388 KOps/s $\textbf{\color{#35bf28}+8.56\%}$
test_setitem_dim[int] 66.6010μs 45.1856μs 22.1309 KOps/s 19.6865 KOps/s $\textbf{\color{#35bf28}+12.42\%}$
test_setitem_dim[slice_int] 0.1010ms 67.9331μs 14.7204 KOps/s 13.4207 KOps/s $\textbf{\color{#35bf28}+9.68\%}$
test_setitem_dim[range] 0.1950ms 0.1271ms 7.8705 KOps/s 7.3444 KOps/s $\textbf{\color{#35bf28}+7.16\%}$
test_setitem_dim[tuple] 87.8240μs 61.5529μs 16.2462 KOps/s 14.8651 KOps/s $\textbf{\color{#35bf28}+9.29\%}$
test_setitem 78.2630μs 41.5912μs 24.0436 KOps/s 21.6002 KOps/s $\textbf{\color{#35bf28}+11.31\%}$
test_set 75.0730μs 40.6988μs 24.5708 KOps/s 22.1352 KOps/s $\textbf{\color{#35bf28}+11.00\%}$
test_set_shared 0.3829ms 52.4528μs 19.0648 KOps/s 17.9363 KOps/s $\textbf{\color{#35bf28}+6.29\%}$
test_update 0.1076ms 54.0112μs 18.5147 KOps/s 17.9205 KOps/s $\color{#35bf28}+3.32\%$
test_update_nested 0.1571ms 57.1210μs 17.5067 KOps/s 15.7238 KOps/s $\textbf{\color{#35bf28}+11.34\%}$
test_update__nested 0.1158ms 59.3361μs 16.8531 KOps/s 15.2228 KOps/s $\textbf{\color{#35bf28}+10.71\%}$
test_set_nested 78.8720μs 43.6407μs 22.9144 KOps/s 20.4984 KOps/s $\textbf{\color{#35bf28}+11.79\%}$
test_set_nested_new 88.4920μs 46.5405μs 21.4866 KOps/s 19.3727 KOps/s $\textbf{\color{#35bf28}+10.91\%}$
test_select 0.1033ms 59.4258μs 16.8277 KOps/s 14.9308 KOps/s $\textbf{\color{#35bf28}+12.70\%}$
test_select_nested 0.6090ms 42.0140μs 23.8016 KOps/s 22.3762 KOps/s $\textbf{\color{#35bf28}+6.37\%}$
test_exclude_nested 99.9820μs 58.2719μs 17.1609 KOps/s 15.5798 KOps/s $\textbf{\color{#35bf28}+10.15\%}$
test_empty[True] 0.3073ms 0.2425ms 4.1239 KOps/s 4.0105 KOps/s $\color{#35bf28}+2.83\%$
test_empty[False] 3.7971μs 0.7491μs 1.3349 MOps/s 1.3294 MOps/s $\color{#35bf28}+0.41\%$
test_to 57.0810μs 24.4862μs 40.8393 KOps/s 41.4734 KOps/s $\color{#d91a1a}-1.53\%$
test_to_nonblocking 59.4610μs 23.3414μs 42.8423 KOps/s 43.6929 KOps/s $\color{#d91a1a}-1.95\%$
test_unbind_speed 0.3401ms 0.2858ms 3.4991 KOps/s 3.5277 KOps/s $\color{#d91a1a}-0.81\%$
test_unbind_speed_stack0 0.3212ms 0.2798ms 3.5744 KOps/s 3.5250 KOps/s $\color{#35bf28}+1.40\%$
test_unbind_speed_stack1 99.1894ms 0.7058ms 1.4167 KOps/s 1.3664 KOps/s $\color{#35bf28}+3.69\%$
test_split 99.7283ms 2.2255ms 449.3361 Ops/s 444.8484 Ops/s $\color{#35bf28}+1.01\%$
test_chunk 0.1004s 2.2185ms 450.7456 Ops/s 445.0396 Ops/s $\color{#35bf28}+1.28\%$
test_creation[device0] 0.3564ms 0.1309ms 7.6371 KOps/s 7.4881 KOps/s $\color{#35bf28}+1.99\%$
test_creation_from_tensor 0.3435ms 0.1335ms 7.4902 KOps/s 7.3920 KOps/s $\color{#35bf28}+1.33\%$
test_add_one[memmap_tensor0] 0.1805ms 10.0853μs 99.1543 KOps/s 100.0849 KOps/s $\color{#d91a1a}-0.93\%$
test_contiguous[memmap_tensor0] 23.8610μs 2.2060μs 453.3013 KOps/s 451.2061 KOps/s $\color{#35bf28}+0.46\%$
test_stack[memmap_tensor0] 36.7810μs 6.9770μs 143.3286 KOps/s 144.9345 KOps/s $\color{#d91a1a}-1.11\%$
test_memmaptd_index 1.2358ms 0.4344ms 2.3018 KOps/s 2.2000 KOps/s $\color{#35bf28}+4.63\%$
test_memmaptd_index_astensor 0.9952ms 0.4960ms 2.0161 KOps/s 1.9734 KOps/s $\color{#35bf28}+2.16\%$
test_memmaptd_index_op 1.4454ms 1.0357ms 965.5028 Ops/s 889.9751 Ops/s $\textbf{\color{#35bf28}+8.49\%}$
test_serialize_model 0.1311s 0.1294s 7.7258 Ops/s 7.6981 Ops/s $\color{#35bf28}+0.36\%$
test_serialize_model_pickle 1.3499s 1.2132s 0.8243 Ops/s 0.8245 Ops/s $\color{#d91a1a}-0.03\%$
test_serialize_weights 0.2292s 0.1431s 6.9886 Ops/s 7.7680 Ops/s $\textbf{\color{#d91a1a}-10.03\%}$
test_serialize_weights_returnearly 0.2315s 58.3685ms 17.1325 Ops/s 18.6339 Ops/s $\textbf{\color{#d91a1a}-8.06\%}$
test_serialize_weights_pickle 1.3716s 1.2168s 0.8218 Ops/s 0.8242 Ops/s $\color{#d91a1a}-0.30\%$
test_reshape_pytree 73.1010μs 36.5332μs 27.3724 KOps/s 28.2297 KOps/s $\color{#d91a1a}-3.04\%$
test_reshape_td 86.8720μs 42.0578μs 23.7768 KOps/s 22.9123 KOps/s $\color{#35bf28}+3.77\%$
test_view_pytree 77.3920μs 36.2243μs 27.6058 KOps/s 27.6801 KOps/s $\color{#d91a1a}-0.27\%$
test_view_td 87.8420μs 47.5227μs 21.0426 KOps/s 21.1736 KOps/s $\color{#d91a1a}-0.62\%$
test_unbind_pytree 63.7320μs 35.1401μs 28.4575 KOps/s 28.4231 KOps/s $\color{#35bf28}+0.12\%$
test_unbind_td 0.4699ms 44.0386μs 22.7073 KOps/s 22.4522 KOps/s $\color{#35bf28}+1.14\%$
test_split_pytree 83.3320μs 46.5235μs 21.4945 KOps/s 20.6429 KOps/s $\color{#35bf28}+4.13\%$
test_split_td 0.6756ms 56.1837μs 17.7988 KOps/s 15.6637 KOps/s $\textbf{\color{#35bf28}+13.63\%}$
test_add_pytree 0.1010ms 58.0567μs 17.2245 KOps/s 16.0947 KOps/s $\textbf{\color{#35bf28}+7.02\%}$
test_add_td 0.1547ms 93.4809μs 10.6974 KOps/s 9.8770 KOps/s $\textbf{\color{#35bf28}+8.31\%}$
test_compile_add_one_nested[tensordict-compile] 0.4186ms 0.2104ms 4.7520 KOps/s 4.7316 KOps/s $\color{#35bf28}+0.43\%$
test_compile_add_one_nested[tensordict-eager] 0.2632ms 0.1496ms 6.6846 KOps/s 6.6750 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_nested[pytree-compile] 0.2303ms 0.1444ms 6.9269 KOps/s 6.6636 KOps/s $\color{#35bf28}+3.95\%$
test_compile_add_one_nested[pytree-eager] 0.2801ms 0.1847ms 5.4130 KOps/s 5.1327 KOps/s $\textbf{\color{#35bf28}+5.46\%}$
test_compile_copy_nested[tensordict-compile] 55.5110μs 21.5953μs 46.3063 KOps/s 47.9459 KOps/s $\color{#d91a1a}-3.42\%$
test_compile_copy_nested[tensordict-eager] 0.1015ms 43.4793μs 22.9995 KOps/s 22.9800 KOps/s $\color{#35bf28}+0.08\%$
test_compile_copy_nested[pytree-compile] 0.2379ms 64.8000μs 15.4321 KOps/s 15.3771 KOps/s $\color{#35bf28}+0.36\%$
test_compile_copy_nested[pytree-eager] 80.4920μs 49.5476μs 20.1826 KOps/s 20.1570 KOps/s $\color{#35bf28}+0.13\%$
test_compile_add_one_flat[tensordict-compile] 0.3671ms 0.3178ms 3.1465 KOps/s 3.0821 KOps/s $\color{#35bf28}+2.09\%$
test_compile_add_one_flat[tensordict-eager] 0.3047ms 0.2068ms 4.8365 KOps/s 4.8600 KOps/s $\color{#d91a1a}-0.48\%$
test_compile_add_one_flat[tensorclass-compile] 0.1727ms 0.1269ms 7.8779 KOps/s 7.5766 KOps/s $\color{#35bf28}+3.98\%$
test_compile_add_one_flat[tensorclass-eager] 0.1020ms 59.6026μs 16.7778 KOps/s 16.2057 KOps/s $\color{#35bf28}+3.53\%$
test_compile_add_one_flat[pytree-compile] 0.4018ms 0.3190ms 3.1349 KOps/s 3.1142 KOps/s $\color{#35bf28}+0.66\%$
test_compile_add_one_flat[pytree-eager] 0.6803ms 0.6342ms 1.5767 KOps/s 1.6294 KOps/s $\color{#d91a1a}-3.24\%$
test_compile_add_self_flat[tensordict-eager] 0.2985ms 0.2470ms 4.0487 KOps/s 4.0625 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_add_self_flat[tensordict-compile] 0.3792ms 0.3171ms 3.1538 KOps/s 3.0949 KOps/s $\color{#35bf28}+1.90\%$
test_compile_add_self_flat[tensorclass-eager] 0.1187ms 70.6753μs 14.1492 KOps/s 13.4423 KOps/s $\textbf{\color{#35bf28}+5.26\%}$
test_compile_add_self_flat[tensorclass-compile] 0.1795ms 0.1272ms 7.8630 KOps/s 7.5168 KOps/s $\color{#35bf28}+4.61\%$
test_compile_add_self_flat[pytree-eager] 0.6077ms 0.5380ms 1.8587 KOps/s 1.8993 KOps/s $\color{#d91a1a}-2.14\%$
test_compile_add_self_flat[pytree-compile] 0.3617ms 0.3189ms 3.1358 KOps/s 3.1076 KOps/s $\color{#35bf28}+0.91\%$
test_compile_copy_flat[tensordict-compile] 86.7420μs 18.0177μs 55.5010 KOps/s 55.1424 KOps/s $\color{#35bf28}+0.65\%$
test_compile_copy_flat[tensordict-eager] 59.3020μs 27.2419μs 36.7081 KOps/s 36.8327 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_copy_flat[pytree-compile] 0.1033ms 69.0248μs 14.4876 KOps/s 14.0632 KOps/s $\color{#35bf28}+3.02\%$
test_compile_copy_flat[pytree-eager] 81.7420μs 51.0322μs 19.5955 KOps/s 19.4354 KOps/s $\color{#35bf28}+0.82\%$
test_compile_assign_and_add[tensordict-compile] 2.3971ms 0.8356ms 1.1967 KOps/s 1.1048 KOps/s $\textbf{\color{#35bf28}+8.31\%}$
test_compile_assign_and_add[tensordict-eager] 3.5001ms 3.2403ms 308.6087 Ops/s 315.1045 Ops/s $\color{#d91a1a}-2.06\%$
test_compile_assign_and_add[pytree-compile] 2.3374ms 0.8190ms 1.2210 KOps/s 1.1184 KOps/s $\textbf{\color{#35bf28}+9.18\%}$
test_compile_assign_and_add[pytree-eager] 3.5223ms 3.3026ms 302.7905 Ops/s 313.6156 Ops/s $\color{#d91a1a}-3.45\%$
test_compile_indexing[tensor-tensordict-compile] 0.1595ms 0.1096ms 9.1273 KOps/s 9.0734 KOps/s $\color{#35bf28}+0.59\%$
test_compile_indexing[tensor-tensordict-eager] 0.1931ms 64.3140μs 15.5487 KOps/s 15.9324 KOps/s $\color{#d91a1a}-2.41\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1685ms 0.1041ms 9.6061 KOps/s 9.7632 KOps/s $\color{#d91a1a}-1.61\%$
test_compile_indexing[tensor-tensorclass-eager] 94.3720μs 45.5921μs 21.9336 KOps/s 23.0484 KOps/s $\color{#d91a1a}-4.84\%$
test_compile_indexing[tensor-pytree-compile] 0.2086ms 0.1071ms 9.3410 KOps/s 9.6575 KOps/s $\color{#d91a1a}-3.28\%$
test_compile_indexing[tensor-pytree-eager] 91.0720μs 45.7576μs 21.8543 KOps/s 22.9277 KOps/s $\color{#d91a1a}-4.68\%$
test_compile_indexing[slice-tensordict-compile] 0.1891ms 0.1442ms 6.9365 KOps/s 7.2452 KOps/s $\color{#d91a1a}-4.26\%$
test_compile_indexing[slice-tensordict-eager] 0.1784ms 25.2662μs 39.5785 KOps/s 38.2639 KOps/s $\color{#35bf28}+3.44\%$
test_compile_indexing[slice-tensorclass-compile] 0.1934ms 0.1358ms 7.3635 KOps/s 7.5929 KOps/s $\color{#d91a1a}-3.02\%$
test_compile_indexing[slice-tensorclass-eager] 56.0920μs 21.0516μs 47.5024 KOps/s 47.1669 KOps/s $\color{#35bf28}+0.71\%$
test_compile_indexing[slice-pytree-compile] 0.2285ms 0.1369ms 7.3054 KOps/s 7.5139 KOps/s $\color{#d91a1a}-2.77\%$
test_compile_indexing[slice-pytree-eager] 52.5710μs 20.9967μs 47.6266 KOps/s 46.5491 KOps/s $\color{#35bf28}+2.31\%$
test_compile_indexing[int-tensordict-compile] 0.1859ms 0.1391ms 7.1911 KOps/s 7.2005 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_indexing[int-tensordict-eager] 0.4965ms 25.4409μs 39.3068 KOps/s 38.1292 KOps/s $\color{#35bf28}+3.09\%$
test_compile_indexing[int-tensorclass-compile] 0.2443ms 0.1322ms 7.5632 KOps/s 7.5324 KOps/s $\color{#35bf28}+0.41\%$
test_compile_indexing[int-tensorclass-eager] 0.1061ms 23.4814μs 42.5869 KOps/s 47.3940 KOps/s $\textbf{\color{#d91a1a}-10.14\%}$
test_compile_indexing[int-pytree-compile] 0.1918ms 0.1356ms 7.3748 KOps/s 7.5388 KOps/s $\color{#d91a1a}-2.18\%$
test_compile_indexing[int-pytree-eager] 64.8810μs 20.7964μs 48.0852 KOps/s 47.5728 KOps/s $\color{#35bf28}+1.08\%$
test_mod_add[eager] 74.6720μs 32.5515μs 30.7205 KOps/s 30.9717 KOps/s $\color{#d91a1a}-0.81\%$
test_mod_add[compile] 0.1211ms 69.8851μs 14.3092 KOps/s 14.1720 KOps/s $\color{#35bf28}+0.97\%$
test_mod_add[compile-overhead] 0.2666ms 0.1354ms 7.3833 KOps/s 6.6415 KOps/s $\textbf{\color{#35bf28}+11.17\%}$
test_mod_wrap[eager] 0.3582ms 0.2502ms 3.9960 KOps/s 4.0579 KOps/s $\color{#d91a1a}-1.52\%$
test_mod_wrap[compile] 1.4336ms 0.2951ms 3.3886 KOps/s 3.3530 KOps/s $\color{#35bf28}+1.06\%$
test_mod_wrap[compile-overhead] 7.5940ms 4.0260ms 248.3841 Ops/s 247.6779 Ops/s $\color{#35bf28}+0.29\%$
test_mod_wrap_and_backward[eager] 1.6395ms 1.3573ms 736.7318 Ops/s 684.9489 Ops/s $\textbf{\color{#35bf28}+7.56\%}$
test_mod_wrap_and_backward[compile] 1.5784ms 1.3167ms 759.4684 Ops/s 697.2193 Ops/s $\textbf{\color{#35bf28}+8.93\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3440ms 0.9122ms 1.0962 KOps/s 937.9654 Ops/s $\textbf{\color{#35bf28}+16.87\%}$
test_seq_add[eager] 0.1837ms 99.5723μs 10.0430 KOps/s 9.9064 KOps/s $\color{#35bf28}+1.38\%$
test_seq_add[compile] 0.6972ms 84.9570μs 11.7707 KOps/s 12.4246 KOps/s $\textbf{\color{#d91a1a}-5.26\%}$
test_seq_add[compile-overhead] 0.1523ms 0.1139ms 8.7827 KOps/s 8.7520 KOps/s $\color{#35bf28}+0.35\%$
test_seq_wrap[eager] 0.4400ms 0.3797ms 2.6339 KOps/s 2.5346 KOps/s $\color{#35bf28}+3.92\%$
test_seq_wrap[compile] 0.3770ms 0.3137ms 3.1874 KOps/s 3.1461 KOps/s $\color{#35bf28}+1.31\%$
test_seq_wrap[compile-overhead] 0.2980ms 0.2255ms 4.4338 KOps/s 4.5007 KOps/s $\color{#d91a1a}-1.49\%$
test_func_call_runtime[False-eager] 0.7895ms 0.7363ms 1.3581 KOps/s 1.2607 KOps/s $\textbf{\color{#35bf28}+7.73\%}$
test_func_call_runtime[False-compile] 1.0881ms 0.7846ms 1.2745 KOps/s 1.2374 KOps/s $\color{#35bf28}+3.00\%$
test_func_call_runtime[False-compile-overhead] 0.4004ms 0.3592ms 2.7842 KOps/s 2.7035 KOps/s $\color{#35bf28}+2.98\%$
test_func_call_runtime[True-eager] 0.9825ms 0.8977ms 1.1139 KOps/s 1.0908 KOps/s $\color{#35bf28}+2.12\%$
test_func_call_runtime[True-compile] 0.8810ms 0.8211ms 1.2179 KOps/s 1.1769 KOps/s $\color{#35bf28}+3.49\%$
test_func_call_runtime[True-compile-overhead] 0.5222ms 0.3936ms 2.5404 KOps/s 2.5060 KOps/s $\color{#35bf28}+1.38\%$
test_func_call_cm_runtime[False-eager] 0.8111ms 0.7338ms 1.3628 KOps/s 1.2920 KOps/s $\textbf{\color{#35bf28}+5.48\%}$
test_func_call_cm_runtime[False-compile] 0.8983ms 0.7808ms 1.2807 KOps/s 1.2350 KOps/s $\color{#35bf28}+3.70\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4092ms 0.3616ms 2.7654 KOps/s 2.6960 KOps/s $\color{#35bf28}+2.57\%$
test_func_call_cm_runtime[True-eager] 1.0928ms 0.9998ms 1.0002 KOps/s 985.2992 Ops/s $\color{#35bf28}+1.51\%$
test_func_call_cm_runtime[True-compile] 0.9444ms 0.8482ms 1.1790 KOps/s 1.1570 KOps/s $\color{#35bf28}+1.91\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4587ms 0.4162ms 2.4027 KOps/s 2.3613 KOps/s $\color{#35bf28}+1.75\%$
test_vmap_func_call_cm_runtime[eager] 2.5270ms 2.0705ms 482.9859 Ops/s 477.6399 Ops/s $\color{#35bf28}+1.12\%$
test_vmap_func_call_cm_runtime[compile] 0.9553ms 0.9004ms 1.1106 KOps/s 1.1386 KOps/s $\color{#d91a1a}-2.46\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4813ms 0.4241ms 2.3578 KOps/s 2.3404 KOps/s $\color{#35bf28}+0.74\%$
test_distributed 4.0519ms 0.2345ms 4.2637 KOps/s 8.5092 KOps/s $\textbf{\color{#d91a1a}-49.89\%}$
test_tdmodule 51.4710μs 13.6870μs 73.0623 KOps/s 65.9930 KOps/s $\textbf{\color{#35bf28}+10.71\%}$
test_tdmodule_dispatch 67.0810μs 27.1556μs 36.8249 KOps/s 33.4452 KOps/s $\textbf{\color{#35bf28}+10.11\%}$
test_tdseq 36.5010μs 14.8007μs 67.5644 KOps/s 62.1798 KOps/s $\textbf{\color{#35bf28}+8.66\%}$
test_tdseq_dispatch 60.3710μs 29.8890μs 33.4571 KOps/s 30.6796 KOps/s $\textbf{\color{#35bf28}+9.05\%}$
test_instantiation_functorch 2.0184ms 1.8692ms 534.9903 Ops/s 533.0794 Ops/s $\color{#35bf28}+0.36\%$
test_instantiation_td 1.8120ms 1.1972ms 835.2787 Ops/s 826.1761 Ops/s $\color{#35bf28}+1.10\%$
test_exec_functorch 0.2555ms 0.2108ms 4.7433 KOps/s 4.7114 KOps/s $\color{#35bf28}+0.68\%$
test_exec_functional_call 0.2491ms 0.2127ms 4.7015 KOps/s 4.6776 KOps/s $\color{#35bf28}+0.51\%$
test_exec_td 0.2883ms 0.2177ms 4.5938 KOps/s 4.5017 KOps/s $\color{#35bf28}+2.04\%$
test_exec_td_decorator 1.1504ms 0.2578ms 3.8795 KOps/s 3.7730 KOps/s $\color{#35bf28}+2.82\%$
test_vmap_mlp_speed[True-True] 0.7589ms 0.6845ms 1.4610 KOps/s 1.4384 KOps/s $\color{#35bf28}+1.57\%$
test_vmap_mlp_speed[True-False] 0.7594ms 0.6856ms 1.4585 KOps/s 1.4446 KOps/s $\color{#35bf28}+0.96\%$
test_vmap_mlp_speed[False-True] 0.6633ms 0.5778ms 1.7306 KOps/s 1.7150 KOps/s $\color{#35bf28}+0.91\%$
test_vmap_mlp_speed[False-False] 0.6644ms 0.5794ms 1.7260 KOps/s 1.7108 KOps/s $\color{#35bf28}+0.89\%$
test_vmap_mlp_speed_decorator[True-True] 1.2546ms 0.6889ms 1.4516 KOps/s 1.4733 KOps/s $\color{#d91a1a}-1.48\%$
test_vmap_mlp_speed_decorator[True-False] 0.8105ms 0.6727ms 1.4865 KOps/s 1.4716 KOps/s $\color{#35bf28}+1.01\%$
test_vmap_mlp_speed_decorator[False-True] 0.7175ms 0.5925ms 1.6877 KOps/s 1.6701 KOps/s $\color{#35bf28}+1.06\%$
test_vmap_mlp_speed_decorator[False-False] 0.7428ms 0.5912ms 1.6916 KOps/s 1.6695 KOps/s $\color{#35bf28}+1.32\%$
test_vmap_transformer_speed[True-True] 8.6380ms 8.3264ms 120.1000 Ops/s 118.3316 Ops/s $\color{#35bf28}+1.49\%$
test_vmap_transformer_speed[True-False] 8.7237ms 8.3480ms 119.7895 Ops/s 118.6359 Ops/s $\color{#35bf28}+0.97\%$
test_vmap_transformer_speed[False-True] 8.4908ms 8.1897ms 122.1045 Ops/s 121.3994 Ops/s $\color{#35bf28}+0.58\%$
test_vmap_transformer_speed[False-False] 8.4950ms 8.2172ms 121.6961 Ops/s 121.6025 Ops/s $\color{#35bf28}+0.08\%$
test_vmap_transformer_speed_decorator[True-True] 20.2277ms 19.7980ms 50.5102 Ops/s 51.0782 Ops/s $\color{#d91a1a}-1.11\%$
test_vmap_transformer_speed_decorator[True-False] 20.3576ms 19.9424ms 50.1443 Ops/s 51.0792 Ops/s $\color{#d91a1a}-1.83\%$
test_vmap_transformer_speed_decorator[False-True] 20.0880ms 19.6075ms 51.0008 Ops/s 51.5615 Ops/s $\color{#d91a1a}-1.09\%$
test_vmap_transformer_speed_decorator[False-False] 19.5095ms 19.3674ms 51.6332 Ops/s 51.4700 Ops/s $\color{#35bf28}+0.32\%$
test_to_module_speed[True] 1.1663ms 0.9378ms 1.0664 KOps/s 1.0613 KOps/s $\color{#35bf28}+0.48\%$
test_to_module_speed[False] 1.3257ms 0.9130ms 1.0953 KOps/s 1.0800 KOps/s $\color{#35bf28}+1.42\%$
test_tc_init 65.2420μs 34.0931μs 29.3315 KOps/s 27.5807 KOps/s $\textbf{\color{#35bf28}+6.35\%}$
test_tc_init_nested 0.1148ms 71.0185μs 14.0808 KOps/s 13.6939 KOps/s $\color{#35bf28}+2.83\%$
test_tc_first_layer_tensor 9.0330μs 0.6781μs 1.4747 MOps/s 1.4601 MOps/s $\color{#35bf28}+1.00\%$
test_tc_first_layer_nontensor 24.9110μs 2.2574μs 442.9782 KOps/s 440.4209 KOps/s $\color{#35bf28}+0.58\%$
test_tc_second_layer_tensor 34.3007μs 1.3877μs 720.6210 KOps/s 733.9660 KOps/s $\color{#d91a1a}-1.82\%$
test_tc_second_layer_nontensor 28.1810μs 2.9573μs 338.1428 KOps/s 338.8215 KOps/s $\color{#d91a1a}-0.20\%$
test_unbind 0.2025s 13.0979ms 76.3479 Ops/s 96.9112 Ops/s $\textbf{\color{#d91a1a}-21.22\%}$
test_full_like 0.6634ms 0.5715ms 1.7498 KOps/s 1.7334 KOps/s $\color{#35bf28}+0.94\%$
test_zeros_like 0.2590ms 0.1979ms 5.0533 KOps/s 5.0508 KOps/s $\color{#35bf28}+0.05\%$
test_ones_like 0.2331ms 0.1978ms 5.0556 KOps/s 5.0561 KOps/s $\color{#d91a1a}-0.01\%$
test_clone 0.4444ms 0.4148ms 2.4108 KOps/s 2.4198 KOps/s $\color{#d91a1a}-0.37\%$
test_squeeze 31.6510μs 10.0525μs 99.4778 KOps/s 99.0518 KOps/s $\color{#35bf28}+0.43\%$
test_unsqueeze 0.2306ms 75.6948μs 13.2109 KOps/s 13.3296 KOps/s $\color{#d91a1a}-0.89\%$
test_split 0.4376ms 0.1599ms 6.2527 KOps/s 6.2792 KOps/s $\color{#d91a1a}-0.42\%$
test_permute 0.2859ms 0.1802ms 5.5486 KOps/s 5.3772 KOps/s $\color{#35bf28}+3.19\%$
test_stack 1.2673ms 0.8572ms 1.1666 KOps/s 1.1390 KOps/s $\color{#35bf28}+2.43\%$
test_cat 1.2550ms 1.2314ms 812.0593 Ops/s 811.6231 Ops/s $\color{#35bf28}+0.05\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants