Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Doc] Better docstring for to_module #1081

Merged
merged 2 commits into from
Nov 7, 2024
Merged

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 7, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 7, 2024
ghstack-source-id: b371e60bc0512b9f5e10d463a10a1317a0e46aef
Pull Request resolved: #1081
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 7, 2024
Copy link

github-actions bot commented Nov 7, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}29$. Worsened: $\large\color{#d91a1a}15$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 49.8130μs 17.5506μs 56.9781 KOps/s 54.0663 KOps/s $\textbf{\color{#35bf28}+5.39\%}$
test_plain_set_stack_nested 60.8540μs 17.9053μs 55.8495 KOps/s 54.2545 KOps/s $\color{#35bf28}+2.94\%$
test_plain_set_nested_inplace 52.8690μs 19.4339μs 51.4564 KOps/s 48.8952 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_plain_set_stack_nested_inplace 73.5480μs 19.2654μs 51.9064 KOps/s 48.8275 KOps/s $\textbf{\color{#35bf28}+6.31\%}$
test_items 45.8560μs 4.1816μs 239.1444 KOps/s 238.6413 KOps/s $\color{#35bf28}+0.21\%$
test_items_nested 0.4801ms 0.3430ms 2.9151 KOps/s 2.8978 KOps/s $\color{#35bf28}+0.60\%$
test_items_nested_locked 0.5701ms 0.3414ms 2.9289 KOps/s 2.8828 KOps/s $\color{#35bf28}+1.60\%$
test_items_nested_leaf 0.1492ms 72.3802μs 13.8159 KOps/s 13.9092 KOps/s $\color{#d91a1a}-0.67\%$
test_items_stack_nested 0.5499ms 0.3465ms 2.8858 KOps/s 2.8610 KOps/s $\color{#35bf28}+0.86\%$
test_items_stack_nested_leaf 0.1941ms 74.9450μs 13.3431 KOps/s 13.1854 KOps/s $\color{#35bf28}+1.20\%$
test_items_stack_nested_locked 0.5215ms 0.3440ms 2.9066 KOps/s 2.8444 KOps/s $\color{#35bf28}+2.19\%$
test_keys 22.7420μs 3.4953μs 286.0991 KOps/s 285.4294 KOps/s $\color{#35bf28}+0.23\%$
test_keys_nested 0.2072ms 0.1344ms 7.4420 KOps/s 7.4360 KOps/s $\color{#35bf28}+0.08\%$
test_keys_nested_locked 0.7012ms 0.1410ms 7.0908 KOps/s 7.1410 KOps/s $\color{#d91a1a}-0.70\%$
test_keys_nested_leaf 0.2650ms 0.1168ms 8.5626 KOps/s 8.6409 KOps/s $\color{#d91a1a}-0.91\%$
test_keys_stack_nested 0.2309ms 0.1362ms 7.3430 KOps/s 7.4095 KOps/s $\color{#d91a1a}-0.90\%$
test_keys_stack_nested_leaf 0.2254ms 0.1177ms 8.4975 KOps/s 8.6275 KOps/s $\color{#d91a1a}-1.51\%$
test_keys_stack_nested_locked 0.2386ms 0.1405ms 7.1152 KOps/s 7.1275 KOps/s $\color{#d91a1a}-0.17\%$
test_values 17.8132μs 1.0328μs 968.2692 KOps/s 955.5119 KOps/s $\color{#35bf28}+1.34\%$
test_values_nested 0.1088ms 54.3828μs 18.3882 KOps/s 17.4656 KOps/s $\textbf{\color{#35bf28}+5.28\%}$
test_values_nested_locked 0.1059ms 54.5629μs 18.3275 KOps/s 18.1024 KOps/s $\color{#35bf28}+1.24\%$
test_values_nested_leaf 0.1386ms 59.5209μs 16.8008 KOps/s 16.7230 KOps/s $\color{#35bf28}+0.47\%$
test_values_stack_nested 0.1593ms 55.8650μs 17.9003 KOps/s 17.8217 KOps/s $\color{#35bf28}+0.44\%$
test_values_stack_nested_leaf 0.1492ms 60.8313μs 16.4389 KOps/s 16.4631 KOps/s $\color{#d91a1a}-0.15\%$
test_values_stack_nested_locked 0.1225ms 56.1802μs 17.7999 KOps/s 16.9179 KOps/s $\textbf{\color{#35bf28}+5.21\%}$
test_membership 11.7520μs 0.9180μs 1.0894 MOps/s 1.1232 MOps/s $\color{#d91a1a}-3.01\%$
test_membership_nested 46.8170μs 2.7728μs 360.6423 KOps/s 363.7396 KOps/s $\color{#d91a1a}-0.85\%$
test_membership_nested_leaf 22.9930μs 2.7908μs 358.3188 KOps/s 354.7271 KOps/s $\color{#35bf28}+1.01\%$
test_membership_stacked_nested 36.9490μs 2.8255μs 353.9243 KOps/s 362.8172 KOps/s $\color{#d91a1a}-2.45\%$
test_membership_stacked_nested_leaf 31.2280μs 2.8240μs 354.1123 KOps/s 357.3986 KOps/s $\color{#d91a1a}-0.92\%$
test_membership_nested_last 59.9840μs 4.2520μs 235.1853 KOps/s 252.2620 KOps/s $\textbf{\color{#d91a1a}-6.77\%}$
test_membership_nested_leaf_last 39.4930μs 4.3106μs 231.9862 KOps/s 245.6611 KOps/s $\textbf{\color{#d91a1a}-5.57\%}$
test_membership_stacked_nested_last 26.7600μs 4.9138μs 203.5101 KOps/s 245.1732 KOps/s $\textbf{\color{#d91a1a}-16.99\%}$
test_membership_stacked_nested_leaf_last 68.8580μs 4.9233μs 203.1176 KOps/s 245.0806 KOps/s $\textbf{\color{#d91a1a}-17.12\%}$
test_nested_getleaf 49.0220μs 10.9396μs 91.4111 KOps/s 92.5061 KOps/s $\color{#d91a1a}-1.18\%$
test_nested_get 47.7780μs 10.4453μs 95.7369 KOps/s 98.6142 KOps/s $\color{#d91a1a}-2.92\%$
test_stacked_getleaf 48.6810μs 10.9610μs 91.2327 KOps/s 93.6414 KOps/s $\color{#d91a1a}-2.57\%$
test_stacked_get 44.3020μs 10.1917μs 98.1192 KOps/s 98.6584 KOps/s $\color{#d91a1a}-0.55\%$
test_nested_getitemleaf 52.9850μs 11.5126μs 86.8611 KOps/s 89.7512 KOps/s $\color{#d91a1a}-3.22\%$
test_nested_getitem 38.3910μs 10.6031μs 94.3117 KOps/s 95.9230 KOps/s $\color{#d91a1a}-1.68\%$
test_stacked_getitemleaf 57.5470μs 11.4181μs 87.5806 KOps/s 90.2318 KOps/s $\color{#d91a1a}-2.94\%$
test_stacked_getitem 0.1772ms 10.8705μs 91.9923 KOps/s 96.3128 KOps/s $\color{#d91a1a}-4.49\%$
test_lock_nested 0.8546ms 0.4355ms 2.2962 KOps/s 1.7585 KOps/s $\textbf{\color{#35bf28}+30.58\%}$
test_lock_stack_nested 0.7610ms 0.4082ms 2.4500 KOps/s 2.3909 KOps/s $\color{#35bf28}+2.47\%$
test_unlock_nested 0.7159ms 0.3553ms 2.8142 KOps/s 2.7238 KOps/s $\color{#35bf28}+3.32\%$
test_unlock_stack_nested 0.5208ms 0.3268ms 3.0596 KOps/s 2.9846 KOps/s $\color{#35bf28}+2.51\%$
test_flatten_speed 0.2184ms 92.9880μs 10.7541 KOps/s 10.8859 KOps/s $\color{#d91a1a}-1.21\%$
test_unflatten_speed 0.6188ms 0.4864ms 2.0560 KOps/s 2.1402 KOps/s $\color{#d91a1a}-3.93\%$
test_common_ops 3.6923ms 0.7578ms 1.3195 KOps/s 1.2357 KOps/s $\textbf{\color{#35bf28}+6.79\%}$
test_creation 24.9160μs 2.0600μs 485.4318 KOps/s 495.2630 KOps/s $\color{#d91a1a}-1.99\%$
test_creation_empty 60.2920μs 10.0719μs 99.2857 KOps/s 78.6042 KOps/s $\textbf{\color{#35bf28}+26.31\%}$
test_creation_nested_1 47.3180μs 12.7633μs 78.3498 KOps/s 64.8678 KOps/s $\textbf{\color{#35bf28}+20.78\%}$
test_creation_nested_2 45.8860μs 16.8606μs 59.3099 KOps/s 50.4119 KOps/s $\textbf{\color{#35bf28}+17.65\%}$
test_clone 0.2161ms 13.2740μs 75.3352 KOps/s 73.8505 KOps/s $\color{#35bf28}+2.01\%$
test_getitem[int] 1.3984ms 12.6370μs 79.1330 KOps/s 80.6695 KOps/s $\color{#d91a1a}-1.90\%$
test_getitem[slice_int] 0.1609ms 23.1045μs 43.2817 KOps/s 41.7705 KOps/s $\color{#35bf28}+3.62\%$
test_getitem[range] 0.1710ms 49.3542μs 20.2617 KOps/s 20.2721 KOps/s $\color{#d91a1a}-0.05\%$
test_getitem[tuple] 0.1569ms 19.5053μs 51.2682 KOps/s 50.3715 KOps/s $\color{#35bf28}+1.78\%$
test_getitem[list] 0.1913ms 45.1015μs 22.1722 KOps/s 22.4502 KOps/s $\color{#d91a1a}-1.24\%$
test_setitem_dim[int] 50.3240μs 25.9370μs 38.5550 KOps/s 38.9056 KOps/s $\color{#d91a1a}-0.90\%$
test_setitem_dim[slice_int] 92.9030μs 51.1989μs 19.5317 KOps/s 19.2921 KOps/s $\color{#35bf28}+1.24\%$
test_setitem_dim[range] 0.1163ms 75.2757μs 13.2845 KOps/s 13.3292 KOps/s $\color{#d91a1a}-0.34\%$
test_setitem_dim[tuple] 83.1150μs 41.2217μs 24.2591 KOps/s 24.5561 KOps/s $\color{#d91a1a}-1.21\%$
test_setitem 0.2833ms 19.7399μs 50.6588 KOps/s 47.1863 KOps/s $\textbf{\color{#35bf28}+7.36\%}$
test_set 0.1019ms 19.2269μs 52.0105 KOps/s 47.9265 KOps/s $\textbf{\color{#35bf28}+8.52\%}$
test_set_shared 1.1779ms 0.1688ms 5.9246 KOps/s 5.7491 KOps/s $\color{#35bf28}+3.05\%$
test_update 0.3212ms 21.5324μs 46.4416 KOps/s 41.0834 KOps/s $\textbf{\color{#35bf28}+13.04\%}$
test_update_nested 0.2995ms 31.8080μs 31.4387 KOps/s 29.7482 KOps/s $\textbf{\color{#35bf28}+5.68\%}$
test_update__nested 0.4913ms 33.9851μs 29.4246 KOps/s 30.1625 KOps/s $\color{#d91a1a}-2.45\%$
test_set_nested 0.1384ms 21.3289μs 46.8847 KOps/s 43.7974 KOps/s $\textbf{\color{#35bf28}+7.05\%}$
test_set_nested_new 0.2347ms 26.0840μs 38.3377 KOps/s 36.7357 KOps/s $\color{#35bf28}+4.36\%$
test_select 0.2397ms 41.9553μs 23.8349 KOps/s 22.5031 KOps/s $\textbf{\color{#35bf28}+5.92\%}$
test_select_nested 0.1278ms 60.2183μs 16.6062 KOps/s 16.5401 KOps/s $\color{#35bf28}+0.40\%$
test_exclude_nested 0.1497ms 75.7749μs 13.1970 KOps/s 13.2655 KOps/s $\color{#d91a1a}-0.52\%$
test_empty[True] 0.5663ms 0.3516ms 2.8439 KOps/s 2.8665 KOps/s $\color{#d91a1a}-0.79\%$
test_empty[False] 10.8028μs 1.2196μs 819.9355 KOps/s 825.3186 KOps/s $\color{#d91a1a}-0.65\%$
test_unbind_speed 0.4868ms 0.2598ms 3.8498 KOps/s 3.7749 KOps/s $\color{#35bf28}+1.99\%$
test_unbind_speed_stack0 0.3686ms 0.2573ms 3.8863 KOps/s 3.8601 KOps/s $\color{#35bf28}+0.68\%$
test_unbind_speed_stack1 0.1084s 0.7638ms 1.3092 KOps/s 1.4226 KOps/s $\textbf{\color{#d91a1a}-7.97\%}$
test_split 0.1024s 1.7128ms 583.8244 Ops/s 567.9099 Ops/s $\color{#35bf28}+2.80\%$
test_chunk 0.1074s 1.7273ms 578.9476 Ops/s 572.6750 Ops/s $\color{#35bf28}+1.10\%$
test_consolidate_njt[False-None] 10.2027ms 8.2223ms 121.6203 Ops/s 123.5135 Ops/s $\color{#d91a1a}-1.53\%$
test_creation[device0] 0.2279ms 90.1938μs 11.0872 KOps/s 10.7463 KOps/s $\color{#35bf28}+3.17\%$
test_creation_from_tensor 3.3960ms 93.8928μs 10.6504 KOps/s 10.6348 KOps/s $\color{#35bf28}+0.15\%$
test_add_one[memmap_tensor0] 0.2140ms 4.7937μs 208.6065 KOps/s 209.5380 KOps/s $\color{#d91a1a}-0.44\%$
test_contiguous[memmap_tensor0] 22.8430μs 0.5225μs 1.9139 MOps/s 2.0031 MOps/s $\color{#d91a1a}-4.45\%$
test_stack[memmap_tensor0] 64.3890μs 3.5179μs 284.2635 KOps/s 285.2734 KOps/s $\color{#d91a1a}-0.35\%$
test_memmaptd_index 0.4573ms 0.2380ms 4.2012 KOps/s 4.2302 KOps/s $\color{#d91a1a}-0.69\%$
test_memmaptd_index_astensor 0.5968ms 0.3166ms 3.1583 KOps/s 3.1855 KOps/s $\color{#d91a1a}-0.85\%$
test_memmaptd_index_op 1.0086ms 0.5730ms 1.7452 KOps/s 1.6314 KOps/s $\textbf{\color{#35bf28}+6.97\%}$
test_serialize_model 0.1264s 0.1206s 8.2931 Ops/s 7.4973 Ops/s $\textbf{\color{#35bf28}+10.62\%}$
test_serialize_model_pickle 0.4581s 0.3948s 2.5330 Ops/s 2.4497 Ops/s $\color{#35bf28}+3.40\%$
test_serialize_weights 0.1236s 0.1154s 8.6620 Ops/s 8.5325 Ops/s $\color{#35bf28}+1.52\%$
test_serialize_weights_returnearly 0.1812s 0.1627s 6.1455 Ops/s 6.3704 Ops/s $\color{#d91a1a}-3.53\%$
test_serialize_weights_pickle 0.5389s 0.4572s 2.1873 Ops/s 2.3326 Ops/s $\textbf{\color{#d91a1a}-6.23\%}$
test_serialize_weights_filesystem 0.1465s 0.1412s 7.0806 Ops/s 6.4058 Ops/s $\textbf{\color{#35bf28}+10.53\%}$
test_serialize_model_filesystem 0.1556s 0.1481s 6.7518 Ops/s 6.5250 Ops/s $\color{#35bf28}+3.48\%$
test_reshape_pytree 60.8830μs 27.6313μs 36.1909 KOps/s 37.3735 KOps/s $\color{#d91a1a}-3.16\%$
test_reshape_td 75.6610μs 32.7001μs 30.5809 KOps/s 31.3540 KOps/s $\color{#d91a1a}-2.47\%$
test_view_pytree 61.8550μs 27.6686μs 36.1420 KOps/s 36.9584 KOps/s $\color{#d91a1a}-2.21\%$
test_view_td 93.1640μs 37.6927μs 26.5303 KOps/s 27.3290 KOps/s $\color{#d91a1a}-2.92\%$
test_unbind_pytree 88.5270μs 29.8190μs 33.5356 KOps/s 33.1491 KOps/s $\color{#35bf28}+1.17\%$
test_unbind_td 0.3075ms 38.0548μs 26.2779 KOps/s 26.1992 KOps/s $\color{#35bf28}+0.30\%$
test_split_pytree 97.9090μs 30.1387μs 33.1799 KOps/s 33.9493 KOps/s $\color{#d91a1a}-2.27\%$
test_split_td 0.1964ms 43.0886μs 23.2080 KOps/s 23.0043 KOps/s $\color{#35bf28}+0.89\%$
test_add_pytree 81.0300μs 35.7449μs 27.9760 KOps/s 27.6405 KOps/s $\color{#35bf28}+1.21\%$
test_add_td 0.1544ms 54.3371μs 18.4036 KOps/s 17.8856 KOps/s $\color{#35bf28}+2.90\%$
test_compile_add_one_nested[tensordict-compile] 0.1507ms 65.4212μs 15.2856 KOps/s 16.0987 KOps/s $\textbf{\color{#d91a1a}-5.05\%}$
test_compile_add_one_nested[tensordict-eager] 3.9819ms 0.1627ms 6.1448 KOps/s 6.2951 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_add_one_nested[pytree-compile] 0.1024ms 47.8144μs 20.9142 KOps/s 21.9625 KOps/s $\color{#d91a1a}-4.77\%$
test_compile_add_one_nested[pytree-eager] 0.2729ms 0.1182ms 8.4627 KOps/s 8.3981 KOps/s $\color{#35bf28}+0.77\%$
test_compile_copy_nested[tensordict-compile] 75.0400μs 26.1462μs 38.2464 KOps/s 39.4508 KOps/s $\color{#d91a1a}-3.05\%$
test_compile_copy_nested[tensordict-eager] 0.1268ms 55.3901μs 18.0538 KOps/s 18.5824 KOps/s $\color{#d91a1a}-2.84\%$
test_compile_copy_nested[pytree-compile] 0.1382ms 79.4086μs 12.5931 KOps/s 12.4735 KOps/s $\color{#35bf28}+0.96\%$
test_compile_copy_nested[pytree-eager] 0.1489ms 69.6911μs 14.3490 KOps/s 14.6863 KOps/s $\color{#d91a1a}-2.30\%$
test_compile_add_one_flat[tensordict-compile] 0.2301ms 0.1078ms 9.2766 KOps/s 9.5248 KOps/s $\color{#d91a1a}-2.61\%$
test_compile_add_one_flat[tensordict-eager] 0.4192ms 0.2064ms 4.8461 KOps/s 5.0910 KOps/s $\color{#d91a1a}-4.81\%$
test_compile_add_one_flat[tensorclass-compile] 0.1127ms 46.6525μs 21.4351 KOps/s 22.8150 KOps/s $\textbf{\color{#d91a1a}-6.05\%}$
test_compile_add_one_flat[tensorclass-eager] 0.4853ms 62.7955μs 15.9247 KOps/s 16.3140 KOps/s $\color{#d91a1a}-2.39\%$
test_compile_add_one_flat[pytree-compile] 0.2240ms 0.1047ms 9.5486 KOps/s 9.7184 KOps/s $\color{#d91a1a}-1.75\%$
test_compile_add_one_flat[pytree-eager] 0.2965ms 0.2050ms 4.8769 KOps/s 4.9536 KOps/s $\color{#d91a1a}-1.55\%$
test_compile_add_self_flat[tensordict-eager] 0.4777ms 0.2139ms 4.6755 KOps/s 4.7806 KOps/s $\color{#d91a1a}-2.20\%$
test_compile_add_self_flat[tensordict-compile] 0.2144ms 0.1095ms 9.1298 KOps/s 9.6198 KOps/s $\textbf{\color{#d91a1a}-5.09\%}$
test_compile_add_self_flat[tensorclass-eager] 0.1831ms 56.7361μs 17.6255 KOps/s 18.3096 KOps/s $\color{#d91a1a}-3.74\%$
test_compile_add_self_flat[tensorclass-compile] 0.1049ms 48.8583μs 20.4674 KOps/s 21.7528 KOps/s $\textbf{\color{#d91a1a}-5.91\%}$
test_compile_add_self_flat[pytree-eager] 1.5854ms 0.1623ms 6.1620 KOps/s 6.2750 KOps/s $\color{#d91a1a}-1.80\%$
test_compile_add_self_flat[pytree-compile] 0.2213ms 0.1052ms 9.5098 KOps/s 9.6778 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_copy_flat[tensordict-compile] 97.9900μs 21.4633μs 46.5912 KOps/s 48.5734 KOps/s $\color{#d91a1a}-4.08\%$
test_compile_copy_flat[tensordict-eager] 0.1422ms 59.7777μs 16.7287 KOps/s 16.4445 KOps/s $\color{#35bf28}+1.73\%$
test_compile_copy_flat[pytree-compile] 0.1309ms 81.6634μs 12.2454 KOps/s 11.9483 KOps/s $\color{#35bf28}+2.49\%$
test_compile_copy_flat[pytree-eager] 0.1234ms 69.4373μs 14.4015 KOps/s 14.3524 KOps/s $\color{#35bf28}+0.34\%$
test_compile_assign_and_add[tensordict-compile] 0.2876ms 0.2086ms 4.7943 KOps/s 4.8502 KOps/s $\color{#d91a1a}-1.15\%$
test_compile_assign_and_add[tensordict-eager] 2.5319ms 1.2830ms 779.4467 Ops/s 785.8980 Ops/s $\color{#d91a1a}-0.82\%$
test_compile_assign_and_add[pytree-compile] 0.3037ms 0.2015ms 4.9630 KOps/s 4.9832 KOps/s $\color{#d91a1a}-0.40\%$
test_compile_assign_and_add[pytree-eager] 0.9633ms 0.7818ms 1.2792 KOps/s 1.2915 KOps/s $\color{#d91a1a}-0.96\%$
test_compile_assign_and_add_stack[compile] 0.5707ms 0.4638ms 2.1563 KOps/s 2.2093 KOps/s $\color{#d91a1a}-2.40\%$
test_compile_assign_and_add_stack[eager] 2.8386ms 2.5922ms 385.7701 Ops/s 361.2210 Ops/s $\textbf{\color{#35bf28}+6.80\%}$
test_compile_indexing[tensor-tensordict-compile] 84.6170μs 36.8170μs 27.1614 KOps/s 28.0999 KOps/s $\color{#d91a1a}-3.34\%$
test_compile_indexing[tensor-tensordict-eager] 0.5706ms 32.4191μs 30.8460 KOps/s 29.4447 KOps/s $\color{#35bf28}+4.76\%$
test_compile_indexing[tensor-tensorclass-compile] 0.1159ms 31.4960μs 31.7501 KOps/s 33.9692 KOps/s $\textbf{\color{#d91a1a}-6.53\%}$
test_compile_indexing[tensor-tensorclass-eager] 63.9290μs 23.0774μs 43.3324 KOps/s 42.8696 KOps/s $\color{#35bf28}+1.08\%$
test_compile_indexing[tensor-pytree-compile] 88.1950μs 31.7550μs 31.4911 KOps/s 33.8619 KOps/s $\textbf{\color{#d91a1a}-7.00\%}$
test_compile_indexing[tensor-pytree-eager] 65.7430μs 23.3383μs 42.8481 KOps/s 43.3492 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_indexing[slice-tensordict-compile] 0.1350ms 52.5903μs 19.0149 KOps/s 19.3753 KOps/s $\color{#d91a1a}-1.86\%$
test_compile_indexing[slice-tensordict-eager] 0.5318ms 18.9217μs 52.8493 KOps/s 50.6763 KOps/s $\color{#35bf28}+4.29\%$
test_compile_indexing[slice-tensorclass-compile] 0.1266ms 46.7740μs 21.3794 KOps/s 22.1668 KOps/s $\color{#d91a1a}-3.55\%$
test_compile_indexing[slice-tensorclass-eager] 58.8390μs 18.6439μs 53.6369 KOps/s 53.0871 KOps/s $\color{#35bf28}+1.04\%$
test_compile_indexing[slice-pytree-compile] 0.1279ms 47.4838μs 21.0598 KOps/s 22.1778 KOps/s $\textbf{\color{#d91a1a}-5.04\%}$
test_compile_indexing[slice-pytree-eager] 55.0330μs 18.6504μs 53.6183 KOps/s 52.6962 KOps/s $\color{#35bf28}+1.75\%$
test_compile_indexing[int-tensordict-compile] 0.1168ms 54.3168μs 18.4105 KOps/s 19.1777 KOps/s $\color{#d91a1a}-4.00\%$
test_compile_indexing[int-tensordict-eager] 1.0045ms 19.1763μs 52.1478 KOps/s 50.8369 KOps/s $\color{#35bf28}+2.58\%$
test_compile_indexing[int-tensorclass-compile] 0.1122ms 47.8061μs 20.9178 KOps/s 22.0854 KOps/s $\textbf{\color{#d91a1a}-5.29\%}$
test_compile_indexing[int-tensorclass-eager] 66.4540μs 18.6043μs 53.7510 KOps/s 52.4952 KOps/s $\color{#35bf28}+2.39\%$
test_compile_indexing[int-pytree-compile] 0.1020ms 47.3336μs 21.1266 KOps/s 22.0413 KOps/s $\color{#d91a1a}-4.15\%$
test_compile_indexing[int-pytree-eager] 56.3750μs 18.7556μs 53.3175 KOps/s 53.7103 KOps/s $\color{#d91a1a}-0.73\%$
test_mod_add[eager] 76.8530μs 26.6111μs 37.5783 KOps/s 35.7512 KOps/s $\textbf{\color{#35bf28}+5.11\%}$
test_mod_add[compile] 0.1053ms 45.1663μs 22.1404 KOps/s 22.2956 KOps/s $\color{#d91a1a}-0.70\%$
test_mod_add[compile-overhead] 99.4650μs 46.2105μs 21.6401 KOps/s 22.3412 KOps/s $\color{#d91a1a}-3.14\%$
test_mod_wrap[eager] 0.4102ms 0.2181ms 4.5855 KOps/s 4.5699 KOps/s $\color{#35bf28}+0.34\%$
test_mod_wrap[compile] 2.0324ms 0.2061ms 4.8510 KOps/s 4.7756 KOps/s $\color{#35bf28}+1.58\%$
test_mod_wrap[compile-overhead] 2.2640ms 0.2052ms 4.8725 KOps/s 4.7480 KOps/s $\color{#35bf28}+2.62\%$
test_mod_wrap_and_backward[eager] 14.9992ms 11.7977ms 84.7623 Ops/s 77.4870 Ops/s $\textbf{\color{#35bf28}+9.39\%}$
test_mod_wrap_and_backward[compile] 15.1285ms 12.7165ms 78.6380 Ops/s 80.2305 Ops/s $\color{#d91a1a}-1.98\%$
test_mod_wrap_and_backward[compile-overhead] 14.0872ms 12.4155ms 80.5443 Ops/s 71.9551 Ops/s $\textbf{\color{#35bf28}+11.94\%}$
test_seq_add[eager] 0.2001ms 91.8497μs 10.8874 KOps/s 10.6522 KOps/s $\color{#35bf28}+2.21\%$
test_seq_add[compile] 0.1375ms 61.3962μs 16.2877 KOps/s 16.8422 KOps/s $\color{#d91a1a}-3.29\%$
test_seq_add[compile-overhead] 0.1256ms 59.3923μs 16.8372 KOps/s 16.7464 KOps/s $\color{#35bf28}+0.54\%$
test_seq_wrap[eager] 0.6791ms 0.3898ms 2.5652 KOps/s 2.3792 KOps/s $\textbf{\color{#35bf28}+7.82\%}$
test_seq_wrap[compile] 1.3945ms 0.2292ms 4.3631 KOps/s 4.4324 KOps/s $\color{#d91a1a}-1.56\%$
test_seq_wrap[compile-overhead] 0.3620ms 0.2285ms 4.3771 KOps/s 4.3953 KOps/s $\color{#d91a1a}-0.41\%$
test_func_call_runtime[False-eager] 1.3516ms 0.5532ms 1.8077 KOps/s 1.8341 KOps/s $\color{#d91a1a}-1.44\%$
test_func_call_runtime[False-compile] 0.7772ms 0.4386ms 2.2799 KOps/s 2.3196 KOps/s $\color{#d91a1a}-1.71\%$
test_func_call_runtime[False-compile-overhead] 0.5601ms 0.4372ms 2.2871 KOps/s 2.3055 KOps/s $\color{#d91a1a}-0.80\%$
test_func_call_runtime[True-eager] 1.0915ms 0.7577ms 1.3198 KOps/s 1.3056 KOps/s $\color{#35bf28}+1.08\%$
test_func_call_runtime[True-compile] 0.6387ms 0.4766ms 2.0984 KOps/s 2.1107 KOps/s $\color{#d91a1a}-0.58\%$
test_func_call_runtime[True-compile-overhead] 0.9779ms 0.4806ms 2.0809 KOps/s 2.1162 KOps/s $\color{#d91a1a}-1.67\%$
test_func_call_cm_runtime[False-eager] 0.8306ms 0.5486ms 1.8229 KOps/s 1.8112 KOps/s $\color{#35bf28}+0.64\%$
test_func_call_cm_runtime[False-compile] 0.5149ms 0.4371ms 2.2879 KOps/s 2.3017 KOps/s $\color{#d91a1a}-0.60\%$
test_func_call_cm_runtime[False-compile-overhead] 0.6089ms 0.4364ms 2.2917 KOps/s 2.3101 KOps/s $\color{#d91a1a}-0.80\%$
test_func_call_cm_runtime[True-eager] 1.4501ms 0.9058ms 1.1040 KOps/s 1.0986 KOps/s $\color{#35bf28}+0.49\%$
test_func_call_cm_runtime[True-compile] 0.6628ms 0.5027ms 1.9891 KOps/s 1.9937 KOps/s $\color{#d91a1a}-0.23\%$
test_func_call_cm_runtime[True-compile-overhead] 0.6839ms 0.5049ms 1.9807 KOps/s 1.9793 KOps/s $\color{#35bf28}+0.07\%$
test_vmap_func_call_cm_runtime[eager] 3.2953ms 1.9314ms 517.7599 Ops/s 520.5382 Ops/s $\color{#d91a1a}-0.53\%$
test_vmap_func_call_cm_runtime[compile] 1.2525ms 0.5346ms 1.8705 KOps/s 1.9035 KOps/s $\color{#d91a1a}-1.73\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.7396ms 0.5291ms 1.8899 KOps/s 1.9288 KOps/s $\color{#d91a1a}-2.02\%$
test_distributed 0.2731ms 0.1278ms 7.8253 KOps/s 7.7412 KOps/s $\color{#35bf28}+1.09\%$
test_tdmodule 63.5580μs 19.5331μs 51.1953 KOps/s 49.8686 KOps/s $\color{#35bf28}+2.66\%$
test_tdmodule_dispatch 55.4730μs 36.0428μs 27.7448 KOps/s 25.5304 KOps/s $\textbf{\color{#35bf28}+8.67\%}$
test_tdseq 48.4000μs 21.4407μs 46.6402 KOps/s 44.8196 KOps/s $\color{#35bf28}+4.06\%$
test_tdseq_dispatch 75.9810μs 42.0492μs 23.7816 KOps/s 22.2966 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_instantiation_functorch 1.9593ms 1.5661ms 638.5215 Ops/s 642.0066 Ops/s $\color{#d91a1a}-0.54\%$
test_exec_functorch 0.2834ms 0.1835ms 5.4491 KOps/s 5.4208 KOps/s $\color{#35bf28}+0.52\%$
test_exec_functional_call 0.3521ms 0.1802ms 5.5480 KOps/s 5.5590 KOps/s $\color{#d91a1a}-0.20\%$
test_exec_td_decorator 0.5355ms 0.2309ms 4.3311 KOps/s 4.2451 KOps/s $\color{#35bf28}+2.03\%$
test_vmap_mlp_speed_decorator[True-True] 0.8639ms 0.6475ms 1.5443 KOps/s 1.5477 KOps/s $\color{#d91a1a}-0.22\%$
test_vmap_mlp_speed_decorator[True-False] 1.0114ms 0.6482ms 1.5427 KOps/s 1.5510 KOps/s $\color{#d91a1a}-0.54\%$
test_vmap_mlp_speed_decorator[False-True] 1.9909ms 0.5313ms 1.8823 KOps/s 1.9045 KOps/s $\color{#d91a1a}-1.16\%$
test_vmap_mlp_speed_decorator[False-False] 0.8404ms 0.5306ms 1.8846 KOps/s 1.8994 KOps/s $\color{#d91a1a}-0.78\%$
test_to_module_speed[True] 1.5563ms 1.2830ms 779.4063 Ops/s 770.1618 Ops/s $\color{#35bf28}+1.20\%$
test_to_module_speed[False] 1.5249ms 1.2543ms 797.2430 Ops/s 777.1090 Ops/s $\color{#35bf28}+2.59\%$
test_tc_init 92.9420μs 46.3708μs 21.5653 KOps/s 21.1816 KOps/s $\color{#35bf28}+1.81\%$
test_tc_init_nested 0.1596ms 92.9522μs 10.7582 KOps/s 10.8271 KOps/s $\color{#d91a1a}-0.64\%$
test_tc_first_layer_tensor 17.6430μs 1.5357μs 651.1480 KOps/s 662.7365 KOps/s $\color{#d91a1a}-1.75\%$
test_tc_first_layer_nontensor 26.8700μs 4.7939μs 208.6003 KOps/s 214.6707 KOps/s $\color{#d91a1a}-2.83\%$
test_tc_second_layer_tensor 36.3880μs 2.8826μs 346.9061 KOps/s 361.1499 KOps/s $\color{#d91a1a}-3.94\%$
test_tc_second_layer_nontensor 30.0560μs 6.0816μs 164.4304 KOps/s 168.6973 KOps/s $\color{#d91a1a}-2.53\%$
test_unbind 0.2343s 12.0860ms 82.7403 Ops/s 72.6631 Ops/s $\textbf{\color{#35bf28}+13.87\%}$
test_full_like 8.7902ms 7.4238ms 134.7023 Ops/s 116.3857 Ops/s $\textbf{\color{#35bf28}+15.74\%}$
test_zeros_like 3.5237ms 2.8892ms 346.1205 Ops/s 326.2871 Ops/s $\textbf{\color{#35bf28}+6.08\%}$
test_ones_like 4.4139ms 3.3520ms 298.3336 Ops/s 294.8646 Ops/s $\color{#35bf28}+1.18\%$
test_clone 6.3590ms 5.2548ms 190.3025 Ops/s 181.2576 Ops/s $\color{#35bf28}+4.99\%$
test_squeeze 62.9270μs 12.3228μs 81.1506 KOps/s 80.1847 KOps/s $\color{#35bf28}+1.20\%$
test_unsqueeze 0.3278ms 90.0997μs 11.0988 KOps/s 11.4872 KOps/s $\color{#d91a1a}-3.38\%$
test_split 1.3443ms 0.1913ms 5.2262 KOps/s 5.3388 KOps/s $\color{#d91a1a}-2.11\%$
test_permute 0.2971ms 0.2171ms 4.6059 KOps/s 4.6401 KOps/s $\color{#d91a1a}-0.74\%$
test_stack 29.8336ms 26.1786ms 38.1991 Ops/s 39.7827 Ops/s $\color{#d91a1a}-3.98\%$
test_cat 32.1630ms 26.1071ms 38.3038 Ops/s 40.5263 Ops/s $\textbf{\color{#d91a1a}-5.48\%}$

Copy link

github-actions bot commented Nov 7, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 229. Improved: $\large\color{#35bf28}36$. Worsened: $\large\color{#d91a1a}14$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 36.8910μs 9.8350μs 101.6780 KOps/s 85.9992 KOps/s $\textbf{\color{#35bf28}+18.23\%}$
test_plain_set_stack_nested 27.4500μs 9.8248μs 101.7834 KOps/s 85.5705 KOps/s $\textbf{\color{#35bf28}+18.95\%}$
test_plain_set_nested_inplace 34.7600μs 10.6957μs 93.4951 KOps/s 79.9603 KOps/s $\textbf{\color{#35bf28}+16.93\%}$
test_plain_set_stack_nested_inplace 37.9200μs 10.6443μs 93.9469 KOps/s 81.1177 KOps/s $\textbf{\color{#35bf28}+15.82\%}$
test_items 33.3200μs 2.8847μs 346.6582 KOps/s 343.9954 KOps/s $\color{#35bf28}+0.77\%$
test_items_nested 0.3971ms 0.3385ms 2.9546 KOps/s 3.1407 KOps/s $\textbf{\color{#d91a1a}-5.93\%}$
test_items_nested_locked 0.4108ms 0.3394ms 2.9466 KOps/s 3.1325 KOps/s $\textbf{\color{#d91a1a}-5.93\%}$
test_items_nested_leaf 82.6820μs 58.3515μs 17.1375 KOps/s 17.3398 KOps/s $\color{#d91a1a}-1.17\%$
test_items_stack_nested 0.3978ms 0.3438ms 2.9088 KOps/s 3.1456 KOps/s $\textbf{\color{#d91a1a}-7.53\%}$
test_items_stack_nested_leaf 84.3610μs 59.5254μs 16.7995 KOps/s 16.8988 KOps/s $\color{#d91a1a}-0.59\%$
test_items_stack_nested_locked 0.4369ms 0.3416ms 2.9271 KOps/s 3.1356 KOps/s $\textbf{\color{#d91a1a}-6.65\%}$
test_keys 32.2200μs 3.4818μs 287.2106 KOps/s 290.0261 KOps/s $\color{#d91a1a}-0.97\%$
test_keys_nested 0.1125ms 69.7090μs 14.3453 KOps/s 14.2885 KOps/s $\color{#35bf28}+0.40\%$
test_keys_nested_locked 0.7393ms 75.9470μs 13.1671 KOps/s 13.2695 KOps/s $\color{#d91a1a}-0.77\%$
test_keys_nested_leaf 93.8320μs 60.9734μs 16.4006 KOps/s 16.3688 KOps/s $\color{#35bf28}+0.19\%$
test_keys_stack_nested 0.1026ms 69.6248μs 14.3627 KOps/s 14.1621 KOps/s $\color{#35bf28}+1.42\%$
test_keys_stack_nested_leaf 89.7920μs 61.3078μs 16.3111 KOps/s 16.1697 KOps/s $\color{#35bf28}+0.87\%$
test_keys_stack_nested_locked 0.1109ms 75.3108μs 13.2783 KOps/s 13.0851 KOps/s $\color{#35bf28}+1.48\%$
test_values 6.1968μs 0.8512μs 1.1748 MOps/s 1.1556 MOps/s $\color{#35bf28}+1.66\%$
test_values_nested 62.5310μs 31.1711μs 32.0810 KOps/s 32.2585 KOps/s $\color{#d91a1a}-0.55\%$
test_values_nested_locked 62.2820μs 32.7233μs 30.5593 KOps/s 30.5661 KOps/s $\color{#d91a1a}-0.02\%$
test_values_nested_leaf 58.7710μs 33.5570μs 29.8000 KOps/s 29.7577 KOps/s $\color{#35bf28}+0.14\%$
test_values_stack_nested 60.2310μs 31.8940μs 31.3539 KOps/s 31.5201 KOps/s $\color{#d91a1a}-0.53\%$
test_values_stack_nested_leaf 62.7710μs 34.3679μs 29.0969 KOps/s 29.1928 KOps/s $\color{#d91a1a}-0.33\%$
test_values_stack_nested_locked 64.5010μs 32.9515μs 30.3476 KOps/s 30.0949 KOps/s $\color{#35bf28}+0.84\%$
test_membership 1.7766μs 0.5093μs 1.9634 MOps/s 1.9615 MOps/s $\color{#35bf28}+0.10\%$
test_membership_nested 20.0605μs 1.9295μs 518.2607 KOps/s 523.4373 KOps/s $\color{#d91a1a}-0.99\%$
test_membership_nested_leaf 16.6855μs 1.9163μs 521.8423 KOps/s 530.8309 KOps/s $\color{#d91a1a}-1.69\%$
test_membership_stacked_nested 24.0510μs 2.0349μs 491.4166 KOps/s 517.2863 KOps/s $\textbf{\color{#d91a1a}-5.00\%}$
test_membership_stacked_nested_leaf 28.4000μs 2.0241μs 494.0403 KOps/s 515.6916 KOps/s $\color{#d91a1a}-4.20\%$
test_membership_nested_last 38.3000μs 2.8415μs 351.9220 KOps/s 355.4737 KOps/s $\color{#d91a1a}-1.00\%$
test_membership_nested_leaf_last 42.4210μs 2.8315μs 353.1644 KOps/s 355.0868 KOps/s $\color{#d91a1a}-0.54\%$
test_membership_stacked_nested_last 39.3400μs 7.7896μs 128.3769 KOps/s 309.6010 KOps/s $\textbf{\color{#d91a1a}-58.53\%}$
test_membership_stacked_nested_leaf_last 48.3710μs 7.7396μs 129.2055 KOps/s 309.6174 KOps/s $\textbf{\color{#d91a1a}-58.27\%}$
test_nested_getleaf 29.5000μs 6.0146μs 166.2608 KOps/s 166.3793 KOps/s $\color{#d91a1a}-0.07\%$
test_nested_get 37.0800μs 5.7111μs 175.0989 KOps/s 176.0114 KOps/s $\color{#d91a1a}-0.52\%$
test_stacked_getleaf 44.5610μs 6.0005μs 166.6542 KOps/s 166.8295 KOps/s $\color{#d91a1a}-0.11\%$
test_stacked_get 32.9310μs 5.7101μs 175.1280 KOps/s 175.4113 KOps/s $\color{#d91a1a}-0.16\%$
test_nested_getitemleaf 43.7010μs 6.1083μs 163.7129 KOps/s 165.2918 KOps/s $\color{#d91a1a}-0.96\%$
test_nested_getitem 43.5900μs 5.8243μs 171.6935 KOps/s 173.0990 KOps/s $\color{#d91a1a}-0.81\%$
test_stacked_getitemleaf 31.1400μs 6.1298μs 163.1372 KOps/s 164.6045 KOps/s $\color{#d91a1a}-0.89\%$
test_stacked_getitem 52.6210μs 5.8132μs 172.0220 KOps/s 172.9959 KOps/s $\color{#d91a1a}-0.56\%$
test_lock_nested 9.1151ms 0.3711ms 2.6946 KOps/s 2.7359 KOps/s $\color{#d91a1a}-1.51\%$
test_lock_stack_nested 0.3841ms 0.3269ms 3.0589 KOps/s 2.9750 KOps/s $\color{#35bf28}+2.82\%$
test_unlock_nested 1.0435ms 0.3061ms 3.2671 KOps/s 3.2973 KOps/s $\color{#d91a1a}-0.92\%$
test_unlock_stack_nested 0.3324ms 0.2660ms 3.7599 KOps/s 3.6459 KOps/s $\color{#35bf28}+3.13\%$
test_flatten_speed 0.1045ms 72.8223μs 13.7321 KOps/s 13.8280 KOps/s $\color{#d91a1a}-0.69\%$
test_unflatten_speed 0.3921ms 0.2910ms 3.4359 KOps/s 3.4513 KOps/s $\color{#d91a1a}-0.44\%$
test_common_ops 1.6467ms 0.5463ms 1.8304 KOps/s 1.6572 KOps/s $\textbf{\color{#35bf28}+10.45\%}$
test_creation 0.1658ms 1.4803μs 675.5554 KOps/s 685.0684 KOps/s $\color{#d91a1a}-1.39\%$
test_creation_empty 31.4100μs 5.9303μs 168.6257 KOps/s 102.7541 KOps/s $\textbf{\color{#35bf28}+64.11\%}$
test_creation_nested_1 42.9610μs 7.4664μs 133.9325 KOps/s 88.7533 KOps/s $\textbf{\color{#35bf28}+50.90\%}$
test_creation_nested_2 37.6000μs 9.9463μs 100.5399 KOps/s 71.8657 KOps/s $\textbf{\color{#35bf28}+39.90\%}$
test_clone 0.1497ms 10.0383μs 99.6188 KOps/s 98.8062 KOps/s $\color{#35bf28}+0.82\%$
test_getitem[int] 1.7473ms 10.8342μs 92.3002 KOps/s 92.1270 KOps/s $\color{#35bf28}+0.19\%$
test_getitem[slice_int] 0.1133ms 21.5873μs 46.3235 KOps/s 46.6597 KOps/s $\color{#d91a1a}-0.72\%$
test_getitem[range] 0.1319ms 36.9158μs 27.0887 KOps/s 27.5450 KOps/s $\color{#d91a1a}-1.66\%$
test_getitem[tuple] 0.1085ms 17.8703μs 55.9588 KOps/s 55.3038 KOps/s $\color{#35bf28}+1.18\%$
test_getitem[list] 0.1262ms 32.4486μs 30.8180 KOps/s 31.1703 KOps/s $\color{#d91a1a}-1.13\%$
test_setitem_dim[int] 28.1310μs 18.2568μs 54.7741 KOps/s 55.4136 KOps/s $\color{#d91a1a}-1.15\%$
test_setitem_dim[slice_int] 57.0910μs 37.0250μs 27.0088 KOps/s 27.7916 KOps/s $\color{#d91a1a}-2.82\%$
test_setitem_dim[range] 74.7110μs 50.6317μs 19.7505 KOps/s 19.8383 KOps/s $\color{#d91a1a}-0.44\%$
test_setitem_dim[tuple] 53.2710μs 30.1988μs 33.1139 KOps/s 32.3665 KOps/s $\color{#35bf28}+2.31\%$
test_setitem 0.1343ms 13.3002μs 75.1870 KOps/s 65.5805 KOps/s $\textbf{\color{#35bf28}+14.65\%}$
test_set 0.1492ms 12.8373μs 77.8980 KOps/s 68.2627 KOps/s $\textbf{\color{#35bf28}+14.12\%}$
test_set_shared 1.4183ms 0.1447ms 6.9111 KOps/s 6.9121 KOps/s $\color{#d91a1a}-0.01\%$
test_update 0.4568ms 14.4590μs 69.1609 KOps/s 54.6497 KOps/s $\textbf{\color{#35bf28}+26.55\%}$
test_update_nested 0.1545ms 19.2730μs 51.8862 KOps/s 43.0435 KOps/s $\textbf{\color{#35bf28}+20.54\%}$
test_update__nested 0.4577ms 24.1374μs 41.4295 KOps/s 42.8246 KOps/s $\color{#d91a1a}-3.26\%$
test_set_nested 0.1462ms 14.0188μs 71.3328 KOps/s 62.6777 KOps/s $\textbf{\color{#35bf28}+13.81\%}$
test_set_nested_new 0.1490ms 16.4130μs 60.9273 KOps/s 55.2279 KOps/s $\textbf{\color{#35bf28}+10.32\%}$
test_select 0.1925ms 27.2958μs 36.6357 KOps/s 33.5231 KOps/s $\textbf{\color{#35bf28}+9.29\%}$
test_select_nested 72.8220μs 41.7803μs 23.9347 KOps/s 23.8231 KOps/s $\color{#35bf28}+0.47\%$
test_exclude_nested 0.1034ms 59.5058μs 16.8051 KOps/s 16.8868 KOps/s $\color{#d91a1a}-0.48\%$
test_empty[True] 0.3111ms 0.2587ms 3.8656 KOps/s 3.8992 KOps/s $\color{#d91a1a}-0.86\%$
test_empty[False] 3.3731μs 0.7400μs 1.3513 MOps/s 1.3409 MOps/s $\color{#35bf28}+0.78\%$
test_to 83.1110μs 54.4781μs 18.3560 KOps/s 18.4572 KOps/s $\color{#d91a1a}-0.55\%$
test_to_nonblocking 0.1001ms 45.1707μs 22.1382 KOps/s 22.3492 KOps/s $\color{#d91a1a}-0.94\%$
test_unbind_speed 0.2831ms 0.2317ms 4.3156 KOps/s 4.3114 KOps/s $\color{#35bf28}+0.10\%$
test_unbind_speed_stack0 0.2710ms 0.2225ms 4.4948 KOps/s 4.2998 KOps/s $\color{#35bf28}+4.54\%$
test_unbind_speed_stack1 92.9125ms 0.6285ms 1.5911 KOps/s 1.7069 KOps/s $\textbf{\color{#d91a1a}-6.78\%}$
test_split 93.6014ms 1.7026ms 587.3519 Ops/s 598.0320 Ops/s $\color{#d91a1a}-1.79\%$
test_chunk 96.8103ms 1.7120ms 584.0994 Ops/s 593.5975 Ops/s $\color{#d91a1a}-1.60\%$
test_consolidate[False-None] 96.2660ms 2.8874ms 346.3353 Ops/s 343.7670 Ops/s $\color{#35bf28}+0.75\%$
test_consolidate[default-None] 1.7690ms 1.6761ms 596.6065 Ops/s 593.7519 Ops/s $\color{#35bf28}+0.48\%$
test_consolidate[reduce-overhead-None] 1.8468ms 1.7165ms 582.5923 Ops/s 584.6316 Ops/s $\color{#d91a1a}-0.35\%$
test_consolidate_njt[False-None] 7.0141ms 6.6993ms 149.2703 Ops/s 149.9641 Ops/s $\color{#d91a1a}-0.46\%$
test_to[False-False-None] 1.7362ms 1.6501ms 606.0356 Ops/s 595.3829 Ops/s $\color{#35bf28}+1.79\%$
test_to[True-False-None] 1.6049ms 1.3421ms 745.1037 Ops/s 743.9402 Ops/s $\color{#35bf28}+0.16\%$
test_to[within-False-None] 4.2593ms 4.1257ms 242.3829 Ops/s 241.8760 Ops/s $\color{#35bf28}+0.21\%$
test_to[True-default-None] 5.5859ms 5.0888ms 196.5094 Ops/s 193.5804 Ops/s $\color{#35bf28}+1.51\%$
test_to_njt[False-False-None] 7.2205ms 7.0033ms 142.7889 Ops/s 142.2542 Ops/s $\color{#35bf28}+0.38\%$
test_to_njt[True-False-None] 5.8504ms 5.5616ms 179.8042 Ops/s 179.4113 Ops/s $\color{#35bf28}+0.22\%$
test_to_njt[within-False-None] 12.6552ms 12.2518ms 81.6205 Ops/s 81.6128 Ops/s $+0.01\%$
test_creation[device0] 0.4755ms 79.1200μs 12.6390 KOps/s 12.7433 KOps/s $\color{#d91a1a}-0.82\%$
test_creation_from_tensor 0.4724ms 82.7431μs 12.0856 KOps/s 12.0272 KOps/s $\color{#35bf28}+0.49\%$
test_add_one[memmap_tensor0] 0.4035ms 6.2108μs 161.0086 KOps/s 165.1135 KOps/s $\color{#d91a1a}-2.49\%$
test_contiguous[memmap_tensor0] 2.7596μs 0.4224μs 2.3672 MOps/s 2.4315 MOps/s $\color{#d91a1a}-2.64\%$
test_stack[memmap_tensor0] 44.0700μs 4.5422μs 220.1591 KOps/s 231.4742 KOps/s $\color{#d91a1a}-4.89\%$
test_memmaptd_index 1.9918ms 0.2592ms 3.8573 KOps/s 3.9005 KOps/s $\color{#d91a1a}-1.11\%$
test_memmaptd_index_astensor 1.0097ms 0.3145ms 3.1798 KOps/s 3.1670 KOps/s $\color{#35bf28}+0.40\%$
test_memmaptd_index_op 0.9520ms 0.5366ms 1.8636 KOps/s 1.6673 KOps/s $\textbf{\color{#35bf28}+11.77\%}$
test_serialize_model 0.1318s 0.1302s 7.6801 Ops/s 5.3450 Ops/s $\textbf{\color{#35bf28}+43.69\%}$
test_serialize_model_pickle 1.3511s 1.2157s 0.8226 Ops/s 0.8210 Ops/s $\color{#35bf28}+0.19\%$
test_serialize_weights 0.1297s 0.1291s 7.7438 Ops/s 7.6913 Ops/s $\color{#35bf28}+0.68\%$
test_serialize_weights_returnearly 0.5767s 63.2200ms 15.8178 Ops/s 23.6371 Ops/s $\textbf{\color{#d91a1a}-33.08\%}$
test_serialize_weights_pickle 1.3816s 1.2137s 0.8239 Ops/s 0.8203 Ops/s $\color{#35bf28}+0.44\%$
test_reshape_pytree 69.7910μs 22.8050μs 43.8500 KOps/s 43.5464 KOps/s $\color{#35bf28}+0.70\%$
test_reshape_td 63.0010μs 27.0412μs 36.9806 KOps/s 36.9417 KOps/s $\color{#35bf28}+0.11\%$
test_view_pytree 94.9620μs 22.3833μs 44.6762 KOps/s 44.2357 KOps/s $\color{#35bf28}+1.00\%$
test_view_td 0.1102ms 30.2149μs 33.0963 KOps/s 32.5999 KOps/s $\color{#35bf28}+1.52\%$
test_unbind_pytree 58.1510μs 27.8167μs 35.9496 KOps/s 36.0665 KOps/s $\color{#d91a1a}-0.32\%$
test_unbind_td 0.6268ms 34.9176μs 28.6389 KOps/s 28.1327 KOps/s $\color{#35bf28}+1.80\%$
test_split_pytree 73.4820μs 31.0132μs 32.2443 KOps/s 32.6702 KOps/s $\color{#d91a1a}-1.30\%$
test_split_td 0.8213ms 41.1713μs 24.2888 KOps/s 24.2993 KOps/s $\color{#d91a1a}-0.04\%$
test_add_pytree 70.2320μs 32.4339μs 30.8320 KOps/s 30.8550 KOps/s $\color{#d91a1a}-0.07\%$
test_add_td 69.5610μs 41.4166μs 24.1449 KOps/s 20.5833 KOps/s $\textbf{\color{#35bf28}+17.30\%}$
test_compile_add_one_nested[tensordict-compile] 0.1708ms 0.1205ms 8.3010 KOps/s 8.0799 KOps/s $\color{#35bf28}+2.74\%$
test_compile_add_one_nested[tensordict-eager] 0.2198ms 0.1280ms 7.8108 KOps/s 7.8995 KOps/s $\color{#d91a1a}-1.12\%$
test_compile_add_one_nested[pytree-compile] 0.1645ms 99.7939μs 10.0206 KOps/s 9.9656 KOps/s $\color{#35bf28}+0.55\%$
test_compile_add_one_nested[pytree-eager] 1.3701ms 0.1492ms 6.7009 KOps/s 6.7140 KOps/s $\color{#d91a1a}-0.20\%$
test_compile_copy_nested[tensordict-compile] 59.9110μs 22.9587μs 43.5565 KOps/s 44.4717 KOps/s $\color{#d91a1a}-2.06\%$
test_compile_copy_nested[tensordict-eager] 61.7910μs 27.3648μs 36.5433 KOps/s 35.9772 KOps/s $\color{#35bf28}+1.57\%$
test_compile_copy_nested[pytree-compile] 0.2139ms 62.4629μs 16.0095 KOps/s 15.3424 KOps/s $\color{#35bf28}+4.35\%$
test_compile_copy_nested[pytree-eager] 79.2110μs 49.5499μs 20.1817 KOps/s 19.8116 KOps/s $\color{#35bf28}+1.87\%$
test_compile_add_one_flat[tensordict-compile] 0.1799ms 0.1421ms 7.0377 KOps/s 6.7387 KOps/s $\color{#35bf28}+4.44\%$
test_compile_add_one_flat[tensordict-eager] 0.2995ms 0.2092ms 4.7802 KOps/s 4.8004 KOps/s $\color{#d91a1a}-0.42\%$
test_compile_add_one_flat[tensorclass-compile] 0.1428ms 97.0362μs 10.3054 KOps/s 10.0806 KOps/s $\color{#35bf28}+2.23\%$
test_compile_add_one_flat[tensorclass-eager] 0.1253ms 52.4985μs 19.0482 KOps/s 19.2289 KOps/s $\color{#d91a1a}-0.94\%$
test_compile_add_one_flat[pytree-compile] 0.1944ms 0.1440ms 6.9453 KOps/s 6.9131 KOps/s $\color{#35bf28}+0.47\%$
test_compile_add_one_flat[pytree-eager] 0.5389ms 0.4767ms 2.0978 KOps/s 2.1042 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_add_self_flat[tensordict-eager] 0.4192ms 0.2474ms 4.0413 KOps/s 4.0186 KOps/s $\color{#35bf28}+0.56\%$
test_compile_add_self_flat[tensordict-compile] 0.1979ms 0.1432ms 6.9814 KOps/s 7.0356 KOps/s $\color{#d91a1a}-0.77\%$
test_compile_add_self_flat[tensorclass-eager] 0.1504ms 61.0646μs 16.3761 KOps/s 16.4636 KOps/s $\color{#d91a1a}-0.53\%$
test_compile_add_self_flat[tensorclass-compile] 0.1853ms 98.2928μs 10.1737 KOps/s 9.7966 KOps/s $\color{#35bf28}+3.85\%$
test_compile_add_self_flat[pytree-eager] 0.4964ms 0.4110ms 2.4328 KOps/s 2.4769 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_add_self_flat[pytree-compile] 0.1970ms 0.1378ms 7.2544 KOps/s 7.0932 KOps/s $\color{#35bf28}+2.27\%$
test_compile_copy_flat[tensordict-compile] 77.7510μs 21.0562μs 47.4919 KOps/s 53.4894 KOps/s $\textbf{\color{#d91a1a}-11.21\%}$
test_compile_copy_flat[tensordict-eager] 69.9020μs 27.1547μs 36.8260 KOps/s 36.6556 KOps/s $\color{#35bf28}+0.46\%$
test_compile_copy_flat[pytree-compile] 0.1014ms 69.5274μs 14.3828 KOps/s 14.5127 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_copy_flat[pytree-eager] 0.1049ms 52.0388μs 19.2164 KOps/s 19.1281 KOps/s $\color{#35bf28}+0.46\%$
test_compile_assign_and_add[tensordict-compile] 1.6144ms 0.4394ms 2.2757 KOps/s 2.2340 KOps/s $\color{#35bf28}+1.87\%$
test_compile_assign_and_add[tensordict-eager] 2.7022ms 2.5474ms 392.5596 Ops/s 390.3655 Ops/s $\color{#35bf28}+0.56\%$
test_compile_assign_and_add[pytree-compile] 1.6038ms 0.4320ms 2.3148 KOps/s 2.2757 KOps/s $\color{#35bf28}+1.72\%$
test_compile_assign_and_add[pytree-eager] 2.7546ms 2.6206ms 381.5979 Ops/s 384.2870 Ops/s $\color{#d91a1a}-0.70\%$
test_compile_indexing[tensor-tensordict-compile] 0.3295ms 0.1176ms 8.5038 KOps/s 8.6088 KOps/s $\color{#d91a1a}-1.22\%$
test_compile_indexing[tensor-tensordict-eager] 0.5652ms 77.8648μs 12.8428 KOps/s 12.7436 KOps/s $\color{#35bf28}+0.78\%$
test_compile_indexing[tensor-tensorclass-compile] 0.3757ms 0.1040ms 9.6140 KOps/s 9.7135 KOps/s $\color{#d91a1a}-1.03\%$
test_compile_indexing[tensor-tensorclass-eager] 0.1737ms 69.9674μs 14.2924 KOps/s 14.9021 KOps/s $\color{#d91a1a}-4.09\%$
test_compile_indexing[tensor-pytree-compile] 0.1596ms 0.1105ms 9.0515 KOps/s 9.6121 KOps/s $\textbf{\color{#d91a1a}-5.83\%}$
test_compile_indexing[tensor-pytree-eager] 0.1185ms 70.7568μs 14.1329 KOps/s 14.7536 KOps/s $\color{#d91a1a}-4.21\%$
test_compile_indexing[slice-tensordict-compile] 0.1520ms 0.1050ms 9.5249 KOps/s 9.7825 KOps/s $\color{#d91a1a}-2.63\%$
test_compile_indexing[slice-tensordict-eager] 0.1429ms 18.0109μs 55.5218 KOps/s 49.8896 KOps/s $\textbf{\color{#35bf28}+11.29\%}$
test_compile_indexing[slice-tensorclass-compile] 0.1381ms 95.9429μs 10.4229 KOps/s 9.8098 KOps/s $\textbf{\color{#35bf28}+6.25\%}$
test_compile_indexing[slice-tensorclass-eager] 59.0410μs 16.9700μs 58.9275 KOps/s 60.0554 KOps/s $\color{#d91a1a}-1.88\%$
test_compile_indexing[slice-pytree-compile] 0.1552ms 96.9845μs 10.3109 KOps/s 10.1018 KOps/s $\color{#35bf28}+2.07\%$
test_compile_indexing[slice-pytree-eager] 53.3510μs 17.2893μs 57.8391 KOps/s 59.9534 KOps/s $\color{#d91a1a}-3.53\%$
test_compile_indexing[int-tensordict-compile] 0.4835ms 0.1024ms 9.7638 KOps/s 9.6745 KOps/s $\color{#35bf28}+0.92\%$
test_compile_indexing[int-tensordict-eager] 0.6828ms 18.2083μs 54.9200 KOps/s 55.1789 KOps/s $\color{#d91a1a}-0.47\%$
test_compile_indexing[int-tensorclass-compile] 0.4871ms 97.3792μs 10.2691 KOps/s 9.7478 KOps/s $\textbf{\color{#35bf28}+5.35\%}$
test_compile_indexing[int-tensorclass-eager] 0.1581ms 16.8177μs 59.4612 KOps/s 60.2353 KOps/s $\color{#d91a1a}-1.29\%$
test_compile_indexing[int-pytree-compile] 0.4839ms 97.7378μs 10.2315 KOps/s 10.1588 KOps/s $\color{#35bf28}+0.72\%$
test_compile_indexing[int-pytree-eager] 0.3991ms 16.8415μs 59.3770 KOps/s 60.1274 KOps/s $\color{#d91a1a}-1.25\%$
test_mod_add[eager] 0.4316ms 31.5618μs 31.6839 KOps/s 30.8273 KOps/s $\color{#35bf28}+2.78\%$
test_mod_add[compile] 0.1467ms 80.9579μs 12.3521 KOps/s 12.9278 KOps/s $\color{#d91a1a}-4.45\%$
test_mod_add[compile-overhead] 0.3123ms 0.1633ms 6.1228 KOps/s 5.7350 KOps/s $\textbf{\color{#35bf28}+6.76\%}$
test_mod_wrap[eager] 0.6507ms 0.2479ms 4.0336 KOps/s 3.8830 KOps/s $\color{#35bf28}+3.88\%$
test_mod_wrap[compile] 1.5409ms 0.2800ms 3.5713 KOps/s 3.5071 KOps/s $\color{#35bf28}+1.83\%$
test_mod_wrap[compile-overhead] 8.1134ms 4.2049ms 237.8157 Ops/s 238.4703 Ops/s $\color{#d91a1a}-0.27\%$
test_mod_wrap_and_backward[eager] 1.4273ms 1.3079ms 764.5751 Ops/s 700.9781 Ops/s $\textbf{\color{#35bf28}+9.07\%}$
test_mod_wrap_and_backward[compile] 1.3522ms 1.2512ms 799.2106 Ops/s 731.7698 Ops/s $\textbf{\color{#35bf28}+9.22\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3349ms 0.8966ms 1.1153 KOps/s 976.4793 Ops/s $\textbf{\color{#35bf28}+14.21\%}$
test_seq_add[eager] 0.1990ms 95.7986μs 10.4386 KOps/s 9.6958 KOps/s $\textbf{\color{#35bf28}+7.66\%}$
test_seq_add[compile] 0.1388ms 85.7600μs 11.6605 KOps/s 11.5234 KOps/s $\color{#35bf28}+1.19\%$
test_seq_add[compile-overhead] 0.1678ms 0.1263ms 7.9189 KOps/s 7.8830 KOps/s $\color{#35bf28}+0.45\%$
test_seq_wrap[eager] 0.4826ms 0.3727ms 2.6833 KOps/s 2.5468 KOps/s $\textbf{\color{#35bf28}+5.36\%}$
test_seq_wrap[compile] 0.3795ms 0.2988ms 3.3469 KOps/s 3.3067 KOps/s $\color{#35bf28}+1.22\%$
test_seq_wrap[compile-overhead] 0.2726ms 0.2171ms 4.6065 KOps/s 4.5013 KOps/s $\color{#35bf28}+2.34\%$
test_func_call_runtime[False-eager] 0.8369ms 0.7203ms 1.3883 KOps/s 1.2844 KOps/s $\textbf{\color{#35bf28}+8.09\%}$
test_func_call_runtime[False-compile] 0.7896ms 0.7462ms 1.3401 KOps/s 1.3243 KOps/s $\color{#35bf28}+1.20\%$
test_func_call_runtime[False-compile-overhead] 0.4384ms 0.3527ms 2.8355 KOps/s 2.7918 KOps/s $\color{#35bf28}+1.56\%$
test_func_call_runtime[True-eager] 1.0333ms 0.8896ms 1.1242 KOps/s 1.0943 KOps/s $\color{#35bf28}+2.73\%$
test_func_call_runtime[True-compile] 0.8660ms 0.7656ms 1.3062 KOps/s 1.2883 KOps/s $\color{#35bf28}+1.39\%$
test_func_call_runtime[True-compile-overhead] 0.4274ms 0.3749ms 2.6673 KOps/s 2.6456 KOps/s $\color{#35bf28}+0.82\%$
test_func_call_cm_runtime[False-eager] 0.9636ms 0.7332ms 1.3639 KOps/s 1.3469 KOps/s $\color{#35bf28}+1.26\%$
test_func_call_cm_runtime[False-compile] 0.9237ms 0.7566ms 1.3217 KOps/s 1.3191 KOps/s $\color{#35bf28}+0.19\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4026ms 0.3541ms 2.8239 KOps/s 2.7855 KOps/s $\color{#35bf28}+1.38\%$
test_func_call_cm_runtime[True-eager] 1.0757ms 0.9851ms 1.0152 KOps/s 985.9505 Ops/s $\color{#35bf28}+2.96\%$
test_func_call_cm_runtime[True-compile] 0.8621ms 0.7928ms 1.2614 KOps/s 1.2431 KOps/s $\color{#35bf28}+1.47\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5598ms 0.4042ms 2.4739 KOps/s 2.4530 KOps/s $\color{#35bf28}+0.85\%$
test_vmap_func_call_cm_runtime[eager] 2.4339ms 1.9797ms 505.1333 Ops/s 499.2518 Ops/s $\color{#35bf28}+1.18\%$
test_vmap_func_call_cm_runtime[compile] 0.8713ms 0.8007ms 1.2490 KOps/s 1.2210 KOps/s $\color{#35bf28}+2.29\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4591ms 0.4028ms 2.4825 KOps/s 2.4392 KOps/s $\color{#35bf28}+1.78\%$
test_distributed 2.3180ms 0.2521ms 3.9670 KOps/s 8.7921 KOps/s $\textbf{\color{#d91a1a}-54.88\%}$
test_tdmodule 45.4610μs 12.2235μs 81.8097 KOps/s 62.6537 KOps/s $\textbf{\color{#35bf28}+30.57\%}$
test_tdmodule_dispatch 58.3210μs 24.1694μs 41.3746 KOps/s 34.1465 KOps/s $\textbf{\color{#35bf28}+21.17\%}$
test_tdseq 48.2010μs 13.5728μs 73.6768 KOps/s 60.0121 KOps/s $\textbf{\color{#35bf28}+22.77\%}$
test_tdseq_dispatch 37.2800μs 26.6447μs 37.5309 KOps/s 30.3289 KOps/s $\textbf{\color{#35bf28}+23.75\%}$
test_instantiation_functorch 1.6260ms 1.5288ms 654.0921 Ops/s 636.9413 Ops/s $\color{#35bf28}+2.69\%$
test_exec_functorch 0.2033ms 0.1410ms 7.0908 KOps/s 7.0177 KOps/s $\color{#35bf28}+1.04\%$
test_exec_functional_call 0.3124ms 0.1332ms 7.5062 KOps/s 7.5490 KOps/s $\color{#d91a1a}-0.57\%$
test_exec_td_decorator 0.3639ms 0.1780ms 5.6187 KOps/s 5.6499 KOps/s $\color{#d91a1a}-0.55\%$
test_vmap_mlp_speed_decorator[True-True] 0.7872ms 0.6380ms 1.5673 KOps/s 1.5226 KOps/s $\color{#35bf28}+2.94\%$
test_vmap_mlp_speed_decorator[True-False] 0.7423ms 0.6341ms 1.5771 KOps/s 1.5276 KOps/s $\color{#35bf28}+3.24\%$
test_vmap_mlp_speed_decorator[False-True] 0.6847ms 0.5611ms 1.7823 KOps/s 1.7572 KOps/s $\color{#35bf28}+1.43\%$
test_vmap_mlp_speed_decorator[False-False] 0.6717ms 0.5599ms 1.7860 KOps/s 1.7488 KOps/s $\color{#35bf28}+2.13\%$
test_vmap_transformer_speed_decorator[True-True] 18.6938ms 18.4837ms 54.1016 Ops/s 53.9280 Ops/s $\color{#35bf28}+0.32\%$
test_vmap_transformer_speed_decorator[True-False] 18.7404ms 18.4951ms 54.0685 Ops/s 53.7961 Ops/s $\color{#35bf28}+0.51\%$
test_vmap_transformer_speed_decorator[False-True] 19.4806ms 18.4830ms 54.1039 Ops/s 54.2938 Ops/s $\color{#d91a1a}-0.35\%$
test_vmap_transformer_speed_decorator[False-False] 18.6041ms 18.3481ms 54.5016 Ops/s 54.1154 Ops/s $\color{#35bf28}+0.71\%$
test_to_module_speed[True] 1.0587ms 0.9337ms 1.0710 KOps/s 1.0498 KOps/s $\color{#35bf28}+2.02\%$
test_to_module_speed[False] 1.4088ms 0.9119ms 1.0966 KOps/s 1.0804 KOps/s $\color{#35bf28}+1.49\%$
test_tc_init 79.3520μs 32.8914μs 30.4031 KOps/s 26.7201 KOps/s $\textbf{\color{#35bf28}+13.78\%}$
test_tc_init_nested 0.1169ms 66.9635μs 14.9335 KOps/s 13.5199 KOps/s $\textbf{\color{#35bf28}+10.46\%}$
test_tc_first_layer_tensor 5.1187μs 0.6957μs 1.4375 MOps/s 1.4246 MOps/s $\color{#35bf28}+0.91\%$
test_tc_first_layer_nontensor 42.6210μs 2.3115μs 432.6257 KOps/s 431.1321 KOps/s $\color{#35bf28}+0.35\%$
test_tc_second_layer_tensor 10.2127μs 1.4157μs 706.3678 KOps/s 703.6134 KOps/s $\color{#35bf28}+0.39\%$
test_tc_second_layer_nontensor 31.3400μs 3.0385μs 329.1100 KOps/s 324.0801 KOps/s $\color{#35bf28}+1.55\%$
test_unbind 0.2293s 9.7439ms 102.6279 Ops/s 151.7345 Ops/s $\textbf{\color{#d91a1a}-32.36\%}$
test_full_like 9.4096ms 9.0696ms 110.2587 Ops/s 109.4241 Ops/s $\color{#35bf28}+0.76\%$
test_zeros_like 4.9245ms 4.3115ms 231.9385 Ops/s 114.9307 Ops/s $\textbf{\color{#35bf28}+101.81\%}$
test_ones_like 4.9011ms 4.1785ms 239.3201 Ops/s 236.0018 Ops/s $\color{#35bf28}+1.41\%$
test_clone 11.2713ms 9.0542ms 110.4463 Ops/s 158.1907 Ops/s $\textbf{\color{#d91a1a}-30.18\%}$
test_squeeze 58.9110μs 8.9750μs 111.4209 KOps/s 107.9539 KOps/s $\color{#35bf28}+3.21\%$
test_unsqueeze 0.1193ms 70.3898μs 14.2066 KOps/s 14.0336 KOps/s $\color{#35bf28}+1.23\%$
test_split 0.3984ms 0.1624ms 6.1559 KOps/s 6.2101 KOps/s $\color{#d91a1a}-0.87\%$
test_permute 0.2285ms 0.1767ms 5.6601 KOps/s 5.2751 KOps/s $\textbf{\color{#35bf28}+7.30\%}$
test_stack 52.7711ms 50.6783ms 19.7323 Ops/s 19.6503 Ops/s $\color{#35bf28}+0.42\%$
test_cat 50.6061ms 50.1524ms 19.9392 Ops/s 19.8349 Ops/s $\color{#35bf28}+0.53\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Nov 7, 2024
ghstack-source-id: 16cedee8c0d38da6f377a262d5d7478a66fce07f
Pull Request resolved: #1081
@vmoens vmoens merged commit c40f86e into gh/vmoens/37/base Nov 7, 2024
10 of 24 checks passed
@vmoens vmoens deleted the gh/vmoens/37/head branch November 7, 2024 13:06
@vmoens vmoens added the documentation Improvements or additions to documentation label Nov 7, 2024
vmoens added a commit that referenced this pull request Nov 14, 2024
ghstack-source-id: 16cedee8c0d38da6f377a262d5d7478a66fce07f
Pull Request resolved: #1081

(cherry picked from commit 9607cf0)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants