-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] Better docstring for to_module #1081
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This was referenced Nov 7, 2024
vmoens
added a commit
that referenced
this pull request
Nov 7, 2024
ghstack-source-id: b371e60bc0512b9f5e10d463a10a1317a0e46aef Pull Request resolved: #1081
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 7, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 49.8130μs | 17.5506μs | 56.9781 KOps/s | 54.0663 KOps/s | |
test_plain_set_stack_nested | 60.8540μs | 17.9053μs | 55.8495 KOps/s | 54.2545 KOps/s | |
test_plain_set_nested_inplace | 52.8690μs | 19.4339μs | 51.4564 KOps/s | 48.8952 KOps/s | |
test_plain_set_stack_nested_inplace | 73.5480μs | 19.2654μs | 51.9064 KOps/s | 48.8275 KOps/s | |
test_items | 45.8560μs | 4.1816μs | 239.1444 KOps/s | 238.6413 KOps/s | |
test_items_nested | 0.4801ms | 0.3430ms | 2.9151 KOps/s | 2.8978 KOps/s | |
test_items_nested_locked | 0.5701ms | 0.3414ms | 2.9289 KOps/s | 2.8828 KOps/s | |
test_items_nested_leaf | 0.1492ms | 72.3802μs | 13.8159 KOps/s | 13.9092 KOps/s | |
test_items_stack_nested | 0.5499ms | 0.3465ms | 2.8858 KOps/s | 2.8610 KOps/s | |
test_items_stack_nested_leaf | 0.1941ms | 74.9450μs | 13.3431 KOps/s | 13.1854 KOps/s | |
test_items_stack_nested_locked | 0.5215ms | 0.3440ms | 2.9066 KOps/s | 2.8444 KOps/s | |
test_keys | 22.7420μs | 3.4953μs | 286.0991 KOps/s | 285.4294 KOps/s | |
test_keys_nested | 0.2072ms | 0.1344ms | 7.4420 KOps/s | 7.4360 KOps/s | |
test_keys_nested_locked | 0.7012ms | 0.1410ms | 7.0908 KOps/s | 7.1410 KOps/s | |
test_keys_nested_leaf | 0.2650ms | 0.1168ms | 8.5626 KOps/s | 8.6409 KOps/s | |
test_keys_stack_nested | 0.2309ms | 0.1362ms | 7.3430 KOps/s | 7.4095 KOps/s | |
test_keys_stack_nested_leaf | 0.2254ms | 0.1177ms | 8.4975 KOps/s | 8.6275 KOps/s | |
test_keys_stack_nested_locked | 0.2386ms | 0.1405ms | 7.1152 KOps/s | 7.1275 KOps/s | |
test_values | 17.8132μs | 1.0328μs | 968.2692 KOps/s | 955.5119 KOps/s | |
test_values_nested | 0.1088ms | 54.3828μs | 18.3882 KOps/s | 17.4656 KOps/s | |
test_values_nested_locked | 0.1059ms | 54.5629μs | 18.3275 KOps/s | 18.1024 KOps/s | |
test_values_nested_leaf | 0.1386ms | 59.5209μs | 16.8008 KOps/s | 16.7230 KOps/s | |
test_values_stack_nested | 0.1593ms | 55.8650μs | 17.9003 KOps/s | 17.8217 KOps/s | |
test_values_stack_nested_leaf | 0.1492ms | 60.8313μs | 16.4389 KOps/s | 16.4631 KOps/s | |
test_values_stack_nested_locked | 0.1225ms | 56.1802μs | 17.7999 KOps/s | 16.9179 KOps/s | |
test_membership | 11.7520μs | 0.9180μs | 1.0894 MOps/s | 1.1232 MOps/s | |
test_membership_nested | 46.8170μs | 2.7728μs | 360.6423 KOps/s | 363.7396 KOps/s | |
test_membership_nested_leaf | 22.9930μs | 2.7908μs | 358.3188 KOps/s | 354.7271 KOps/s | |
test_membership_stacked_nested | 36.9490μs | 2.8255μs | 353.9243 KOps/s | 362.8172 KOps/s | |
test_membership_stacked_nested_leaf | 31.2280μs | 2.8240μs | 354.1123 KOps/s | 357.3986 KOps/s | |
test_membership_nested_last | 59.9840μs | 4.2520μs | 235.1853 KOps/s | 252.2620 KOps/s | |
test_membership_nested_leaf_last | 39.4930μs | 4.3106μs | 231.9862 KOps/s | 245.6611 KOps/s | |
test_membership_stacked_nested_last | 26.7600μs | 4.9138μs | 203.5101 KOps/s | 245.1732 KOps/s | |
test_membership_stacked_nested_leaf_last | 68.8580μs | 4.9233μs | 203.1176 KOps/s | 245.0806 KOps/s | |
test_nested_getleaf | 49.0220μs | 10.9396μs | 91.4111 KOps/s | 92.5061 KOps/s | |
test_nested_get | 47.7780μs | 10.4453μs | 95.7369 KOps/s | 98.6142 KOps/s | |
test_stacked_getleaf | 48.6810μs | 10.9610μs | 91.2327 KOps/s | 93.6414 KOps/s | |
test_stacked_get | 44.3020μs | 10.1917μs | 98.1192 KOps/s | 98.6584 KOps/s | |
test_nested_getitemleaf | 52.9850μs | 11.5126μs | 86.8611 KOps/s | 89.7512 KOps/s | |
test_nested_getitem | 38.3910μs | 10.6031μs | 94.3117 KOps/s | 95.9230 KOps/s | |
test_stacked_getitemleaf | 57.5470μs | 11.4181μs | 87.5806 KOps/s | 90.2318 KOps/s | |
test_stacked_getitem | 0.1772ms | 10.8705μs | 91.9923 KOps/s | 96.3128 KOps/s | |
test_lock_nested | 0.8546ms | 0.4355ms | 2.2962 KOps/s | 1.7585 KOps/s | |
test_lock_stack_nested | 0.7610ms | 0.4082ms | 2.4500 KOps/s | 2.3909 KOps/s | |
test_unlock_nested | 0.7159ms | 0.3553ms | 2.8142 KOps/s | 2.7238 KOps/s | |
test_unlock_stack_nested | 0.5208ms | 0.3268ms | 3.0596 KOps/s | 2.9846 KOps/s | |
test_flatten_speed | 0.2184ms | 92.9880μs | 10.7541 KOps/s | 10.8859 KOps/s | |
test_unflatten_speed | 0.6188ms | 0.4864ms | 2.0560 KOps/s | 2.1402 KOps/s | |
test_common_ops | 3.6923ms | 0.7578ms | 1.3195 KOps/s | 1.2357 KOps/s | |
test_creation | 24.9160μs | 2.0600μs | 485.4318 KOps/s | 495.2630 KOps/s | |
test_creation_empty | 60.2920μs | 10.0719μs | 99.2857 KOps/s | 78.6042 KOps/s | |
test_creation_nested_1 | 47.3180μs | 12.7633μs | 78.3498 KOps/s | 64.8678 KOps/s | |
test_creation_nested_2 | 45.8860μs | 16.8606μs | 59.3099 KOps/s | 50.4119 KOps/s | |
test_clone | 0.2161ms | 13.2740μs | 75.3352 KOps/s | 73.8505 KOps/s | |
test_getitem[int] | 1.3984ms | 12.6370μs | 79.1330 KOps/s | 80.6695 KOps/s | |
test_getitem[slice_int] | 0.1609ms | 23.1045μs | 43.2817 KOps/s | 41.7705 KOps/s | |
test_getitem[range] | 0.1710ms | 49.3542μs | 20.2617 KOps/s | 20.2721 KOps/s | |
test_getitem[tuple] | 0.1569ms | 19.5053μs | 51.2682 KOps/s | 50.3715 KOps/s | |
test_getitem[list] | 0.1913ms | 45.1015μs | 22.1722 KOps/s | 22.4502 KOps/s | |
test_setitem_dim[int] | 50.3240μs | 25.9370μs | 38.5550 KOps/s | 38.9056 KOps/s | |
test_setitem_dim[slice_int] | 92.9030μs | 51.1989μs | 19.5317 KOps/s | 19.2921 KOps/s | |
test_setitem_dim[range] | 0.1163ms | 75.2757μs | 13.2845 KOps/s | 13.3292 KOps/s | |
test_setitem_dim[tuple] | 83.1150μs | 41.2217μs | 24.2591 KOps/s | 24.5561 KOps/s | |
test_setitem | 0.2833ms | 19.7399μs | 50.6588 KOps/s | 47.1863 KOps/s | |
test_set | 0.1019ms | 19.2269μs | 52.0105 KOps/s | 47.9265 KOps/s | |
test_set_shared | 1.1779ms | 0.1688ms | 5.9246 KOps/s | 5.7491 KOps/s | |
test_update | 0.3212ms | 21.5324μs | 46.4416 KOps/s | 41.0834 KOps/s | |
test_update_nested | 0.2995ms | 31.8080μs | 31.4387 KOps/s | 29.7482 KOps/s | |
test_update__nested | 0.4913ms | 33.9851μs | 29.4246 KOps/s | 30.1625 KOps/s | |
test_set_nested | 0.1384ms | 21.3289μs | 46.8847 KOps/s | 43.7974 KOps/s | |
test_set_nested_new | 0.2347ms | 26.0840μs | 38.3377 KOps/s | 36.7357 KOps/s | |
test_select | 0.2397ms | 41.9553μs | 23.8349 KOps/s | 22.5031 KOps/s | |
test_select_nested | 0.1278ms | 60.2183μs | 16.6062 KOps/s | 16.5401 KOps/s | |
test_exclude_nested | 0.1497ms | 75.7749μs | 13.1970 KOps/s | 13.2655 KOps/s | |
test_empty[True] | 0.5663ms | 0.3516ms | 2.8439 KOps/s | 2.8665 KOps/s | |
test_empty[False] | 10.8028μs | 1.2196μs | 819.9355 KOps/s | 825.3186 KOps/s | |
test_unbind_speed | 0.4868ms | 0.2598ms | 3.8498 KOps/s | 3.7749 KOps/s | |
test_unbind_speed_stack0 | 0.3686ms | 0.2573ms | 3.8863 KOps/s | 3.8601 KOps/s | |
test_unbind_speed_stack1 | 0.1084s | 0.7638ms | 1.3092 KOps/s | 1.4226 KOps/s | |
test_split | 0.1024s | 1.7128ms | 583.8244 Ops/s | 567.9099 Ops/s | |
test_chunk | 0.1074s | 1.7273ms | 578.9476 Ops/s | 572.6750 Ops/s | |
test_consolidate_njt[False-None] | 10.2027ms | 8.2223ms | 121.6203 Ops/s | 123.5135 Ops/s | |
test_creation[device0] | 0.2279ms | 90.1938μs | 11.0872 KOps/s | 10.7463 KOps/s | |
test_creation_from_tensor | 3.3960ms | 93.8928μs | 10.6504 KOps/s | 10.6348 KOps/s | |
test_add_one[memmap_tensor0] | 0.2140ms | 4.7937μs | 208.6065 KOps/s | 209.5380 KOps/s | |
test_contiguous[memmap_tensor0] | 22.8430μs | 0.5225μs | 1.9139 MOps/s | 2.0031 MOps/s | |
test_stack[memmap_tensor0] | 64.3890μs | 3.5179μs | 284.2635 KOps/s | 285.2734 KOps/s | |
test_memmaptd_index | 0.4573ms | 0.2380ms | 4.2012 KOps/s | 4.2302 KOps/s | |
test_memmaptd_index_astensor | 0.5968ms | 0.3166ms | 3.1583 KOps/s | 3.1855 KOps/s | |
test_memmaptd_index_op | 1.0086ms | 0.5730ms | 1.7452 KOps/s | 1.6314 KOps/s | |
test_serialize_model | 0.1264s | 0.1206s | 8.2931 Ops/s | 7.4973 Ops/s | |
test_serialize_model_pickle | 0.4581s | 0.3948s | 2.5330 Ops/s | 2.4497 Ops/s | |
test_serialize_weights | 0.1236s | 0.1154s | 8.6620 Ops/s | 8.5325 Ops/s | |
test_serialize_weights_returnearly | 0.1812s | 0.1627s | 6.1455 Ops/s | 6.3704 Ops/s | |
test_serialize_weights_pickle | 0.5389s | 0.4572s | 2.1873 Ops/s | 2.3326 Ops/s | |
test_serialize_weights_filesystem | 0.1465s | 0.1412s | 7.0806 Ops/s | 6.4058 Ops/s | |
test_serialize_model_filesystem | 0.1556s | 0.1481s | 6.7518 Ops/s | 6.5250 Ops/s | |
test_reshape_pytree | 60.8830μs | 27.6313μs | 36.1909 KOps/s | 37.3735 KOps/s | |
test_reshape_td | 75.6610μs | 32.7001μs | 30.5809 KOps/s | 31.3540 KOps/s | |
test_view_pytree | 61.8550μs | 27.6686μs | 36.1420 KOps/s | 36.9584 KOps/s | |
test_view_td | 93.1640μs | 37.6927μs | 26.5303 KOps/s | 27.3290 KOps/s | |
test_unbind_pytree | 88.5270μs | 29.8190μs | 33.5356 KOps/s | 33.1491 KOps/s | |
test_unbind_td | 0.3075ms | 38.0548μs | 26.2779 KOps/s | 26.1992 KOps/s | |
test_split_pytree | 97.9090μs | 30.1387μs | 33.1799 KOps/s | 33.9493 KOps/s | |
test_split_td | 0.1964ms | 43.0886μs | 23.2080 KOps/s | 23.0043 KOps/s | |
test_add_pytree | 81.0300μs | 35.7449μs | 27.9760 KOps/s | 27.6405 KOps/s | |
test_add_td | 0.1544ms | 54.3371μs | 18.4036 KOps/s | 17.8856 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1507ms | 65.4212μs | 15.2856 KOps/s | 16.0987 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 3.9819ms | 0.1627ms | 6.1448 KOps/s | 6.2951 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1024ms | 47.8144μs | 20.9142 KOps/s | 21.9625 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2729ms | 0.1182ms | 8.4627 KOps/s | 8.3981 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 75.0400μs | 26.1462μs | 38.2464 KOps/s | 39.4508 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1268ms | 55.3901μs | 18.0538 KOps/s | 18.5824 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1382ms | 79.4086μs | 12.5931 KOps/s | 12.4735 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1489ms | 69.6911μs | 14.3490 KOps/s | 14.6863 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2301ms | 0.1078ms | 9.2766 KOps/s | 9.5248 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4192ms | 0.2064ms | 4.8461 KOps/s | 5.0910 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1127ms | 46.6525μs | 21.4351 KOps/s | 22.8150 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4853ms | 62.7955μs | 15.9247 KOps/s | 16.3140 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2240ms | 0.1047ms | 9.5486 KOps/s | 9.7184 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.2965ms | 0.2050ms | 4.8769 KOps/s | 4.9536 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4777ms | 0.2139ms | 4.6755 KOps/s | 4.7806 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2144ms | 0.1095ms | 9.1298 KOps/s | 9.6198 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1831ms | 56.7361μs | 17.6255 KOps/s | 18.3096 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1049ms | 48.8583μs | 20.4674 KOps/s | 21.7528 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.5854ms | 0.1623ms | 6.1620 KOps/s | 6.2750 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2213ms | 0.1052ms | 9.5098 KOps/s | 9.6778 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 97.9900μs | 21.4633μs | 46.5912 KOps/s | 48.5734 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1422ms | 59.7777μs | 16.7287 KOps/s | 16.4445 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1309ms | 81.6634μs | 12.2454 KOps/s | 11.9483 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1234ms | 69.4373μs | 14.4015 KOps/s | 14.3524 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2876ms | 0.2086ms | 4.7943 KOps/s | 4.8502 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.5319ms | 1.2830ms | 779.4467 Ops/s | 785.8980 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3037ms | 0.2015ms | 4.9630 KOps/s | 4.9832 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 0.9633ms | 0.7818ms | 1.2792 KOps/s | 1.2915 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5707ms | 0.4638ms | 2.1563 KOps/s | 2.2093 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.8386ms | 2.5922ms | 385.7701 Ops/s | 361.2210 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 84.6170μs | 36.8170μs | 27.1614 KOps/s | 28.0999 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5706ms | 32.4191μs | 30.8460 KOps/s | 29.4447 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1159ms | 31.4960μs | 31.7501 KOps/s | 33.9692 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 63.9290μs | 23.0774μs | 43.3324 KOps/s | 42.8696 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 88.1950μs | 31.7550μs | 31.4911 KOps/s | 33.8619 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 65.7430μs | 23.3383μs | 42.8481 KOps/s | 43.3492 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1350ms | 52.5903μs | 19.0149 KOps/s | 19.3753 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5318ms | 18.9217μs | 52.8493 KOps/s | 50.6763 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1266ms | 46.7740μs | 21.3794 KOps/s | 22.1668 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 58.8390μs | 18.6439μs | 53.6369 KOps/s | 53.0871 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1279ms | 47.4838μs | 21.0598 KOps/s | 22.1778 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 55.0330μs | 18.6504μs | 53.6183 KOps/s | 52.6962 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1168ms | 54.3168μs | 18.4105 KOps/s | 19.1777 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0045ms | 19.1763μs | 52.1478 KOps/s | 50.8369 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1122ms | 47.8061μs | 20.9178 KOps/s | 22.0854 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 66.4540μs | 18.6043μs | 53.7510 KOps/s | 52.4952 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1020ms | 47.3336μs | 21.1266 KOps/s | 22.0413 KOps/s | |
test_compile_indexing[int-pytree-eager] | 56.3750μs | 18.7556μs | 53.3175 KOps/s | 53.7103 KOps/s | |
test_mod_add[eager] | 76.8530μs | 26.6111μs | 37.5783 KOps/s | 35.7512 KOps/s | |
test_mod_add[compile] | 0.1053ms | 45.1663μs | 22.1404 KOps/s | 22.2956 KOps/s | |
test_mod_add[compile-overhead] | 99.4650μs | 46.2105μs | 21.6401 KOps/s | 22.3412 KOps/s | |
test_mod_wrap[eager] | 0.4102ms | 0.2181ms | 4.5855 KOps/s | 4.5699 KOps/s | |
test_mod_wrap[compile] | 2.0324ms | 0.2061ms | 4.8510 KOps/s | 4.7756 KOps/s | |
test_mod_wrap[compile-overhead] | 2.2640ms | 0.2052ms | 4.8725 KOps/s | 4.7480 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.9992ms | 11.7977ms | 84.7623 Ops/s | 77.4870 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.1285ms | 12.7165ms | 78.6380 Ops/s | 80.2305 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 14.0872ms | 12.4155ms | 80.5443 Ops/s | 71.9551 Ops/s | |
test_seq_add[eager] | 0.2001ms | 91.8497μs | 10.8874 KOps/s | 10.6522 KOps/s | |
test_seq_add[compile] | 0.1375ms | 61.3962μs | 16.2877 KOps/s | 16.8422 KOps/s | |
test_seq_add[compile-overhead] | 0.1256ms | 59.3923μs | 16.8372 KOps/s | 16.7464 KOps/s | |
test_seq_wrap[eager] | 0.6791ms | 0.3898ms | 2.5652 KOps/s | 2.3792 KOps/s | |
test_seq_wrap[compile] | 1.3945ms | 0.2292ms | 4.3631 KOps/s | 4.4324 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3620ms | 0.2285ms | 4.3771 KOps/s | 4.3953 KOps/s | |
test_func_call_runtime[False-eager] | 1.3516ms | 0.5532ms | 1.8077 KOps/s | 1.8341 KOps/s | |
test_func_call_runtime[False-compile] | 0.7772ms | 0.4386ms | 2.2799 KOps/s | 2.3196 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5601ms | 0.4372ms | 2.2871 KOps/s | 2.3055 KOps/s | |
test_func_call_runtime[True-eager] | 1.0915ms | 0.7577ms | 1.3198 KOps/s | 1.3056 KOps/s | |
test_func_call_runtime[True-compile] | 0.6387ms | 0.4766ms | 2.0984 KOps/s | 2.1107 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.9779ms | 0.4806ms | 2.0809 KOps/s | 2.1162 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8306ms | 0.5486ms | 1.8229 KOps/s | 1.8112 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5149ms | 0.4371ms | 2.2879 KOps/s | 2.3017 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6089ms | 0.4364ms | 2.2917 KOps/s | 2.3101 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4501ms | 0.9058ms | 1.1040 KOps/s | 1.0986 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6628ms | 0.5027ms | 1.9891 KOps/s | 1.9937 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6839ms | 0.5049ms | 1.9807 KOps/s | 1.9793 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 3.2953ms | 1.9314ms | 517.7599 Ops/s | 520.5382 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.2525ms | 0.5346ms | 1.8705 KOps/s | 1.9035 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7396ms | 0.5291ms | 1.8899 KOps/s | 1.9288 KOps/s | |
test_distributed | 0.2731ms | 0.1278ms | 7.8253 KOps/s | 7.7412 KOps/s | |
test_tdmodule | 63.5580μs | 19.5331μs | 51.1953 KOps/s | 49.8686 KOps/s | |
test_tdmodule_dispatch | 55.4730μs | 36.0428μs | 27.7448 KOps/s | 25.5304 KOps/s | |
test_tdseq | 48.4000μs | 21.4407μs | 46.6402 KOps/s | 44.8196 KOps/s | |
test_tdseq_dispatch | 75.9810μs | 42.0492μs | 23.7816 KOps/s | 22.2966 KOps/s | |
test_instantiation_functorch | 1.9593ms | 1.5661ms | 638.5215 Ops/s | 642.0066 Ops/s | |
test_exec_functorch | 0.2834ms | 0.1835ms | 5.4491 KOps/s | 5.4208 KOps/s | |
test_exec_functional_call | 0.3521ms | 0.1802ms | 5.5480 KOps/s | 5.5590 KOps/s | |
test_exec_td_decorator | 0.5355ms | 0.2309ms | 4.3311 KOps/s | 4.2451 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8639ms | 0.6475ms | 1.5443 KOps/s | 1.5477 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0114ms | 0.6482ms | 1.5427 KOps/s | 1.5510 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 1.9909ms | 0.5313ms | 1.8823 KOps/s | 1.9045 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8404ms | 0.5306ms | 1.8846 KOps/s | 1.8994 KOps/s | |
test_to_module_speed[True] | 1.5563ms | 1.2830ms | 779.4063 Ops/s | 770.1618 Ops/s | |
test_to_module_speed[False] | 1.5249ms | 1.2543ms | 797.2430 Ops/s | 777.1090 Ops/s | |
test_tc_init | 92.9420μs | 46.3708μs | 21.5653 KOps/s | 21.1816 KOps/s | |
test_tc_init_nested | 0.1596ms | 92.9522μs | 10.7582 KOps/s | 10.8271 KOps/s | |
test_tc_first_layer_tensor | 17.6430μs | 1.5357μs | 651.1480 KOps/s | 662.7365 KOps/s | |
test_tc_first_layer_nontensor | 26.8700μs | 4.7939μs | 208.6003 KOps/s | 214.6707 KOps/s | |
test_tc_second_layer_tensor | 36.3880μs | 2.8826μs | 346.9061 KOps/s | 361.1499 KOps/s | |
test_tc_second_layer_nontensor | 30.0560μs | 6.0816μs | 164.4304 KOps/s | 168.6973 KOps/s | |
test_unbind | 0.2343s | 12.0860ms | 82.7403 Ops/s | 72.6631 Ops/s | |
test_full_like | 8.7902ms | 7.4238ms | 134.7023 Ops/s | 116.3857 Ops/s | |
test_zeros_like | 3.5237ms | 2.8892ms | 346.1205 Ops/s | 326.2871 Ops/s | |
test_ones_like | 4.4139ms | 3.3520ms | 298.3336 Ops/s | 294.8646 Ops/s | |
test_clone | 6.3590ms | 5.2548ms | 190.3025 Ops/s | 181.2576 Ops/s | |
test_squeeze | 62.9270μs | 12.3228μs | 81.1506 KOps/s | 80.1847 KOps/s | |
test_unsqueeze | 0.3278ms | 90.0997μs | 11.0988 KOps/s | 11.4872 KOps/s | |
test_split | 1.3443ms | 0.1913ms | 5.2262 KOps/s | 5.3388 KOps/s | |
test_permute | 0.2971ms | 0.2171ms | 4.6059 KOps/s | 4.6401 KOps/s | |
test_stack | 29.8336ms | 26.1786ms | 38.1991 Ops/s | 39.7827 Ops/s | |
test_cat | 32.1630ms | 26.1071ms | 38.3038 Ops/s | 40.5263 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.8910μs | 9.8350μs | 101.6780 KOps/s | 85.9992 KOps/s | |
test_plain_set_stack_nested | 27.4500μs | 9.8248μs | 101.7834 KOps/s | 85.5705 KOps/s | |
test_plain_set_nested_inplace | 34.7600μs | 10.6957μs | 93.4951 KOps/s | 79.9603 KOps/s | |
test_plain_set_stack_nested_inplace | 37.9200μs | 10.6443μs | 93.9469 KOps/s | 81.1177 KOps/s | |
test_items | 33.3200μs | 2.8847μs | 346.6582 KOps/s | 343.9954 KOps/s | |
test_items_nested | 0.3971ms | 0.3385ms | 2.9546 KOps/s | 3.1407 KOps/s | |
test_items_nested_locked | 0.4108ms | 0.3394ms | 2.9466 KOps/s | 3.1325 KOps/s | |
test_items_nested_leaf | 82.6820μs | 58.3515μs | 17.1375 KOps/s | 17.3398 KOps/s | |
test_items_stack_nested | 0.3978ms | 0.3438ms | 2.9088 KOps/s | 3.1456 KOps/s | |
test_items_stack_nested_leaf | 84.3610μs | 59.5254μs | 16.7995 KOps/s | 16.8988 KOps/s | |
test_items_stack_nested_locked | 0.4369ms | 0.3416ms | 2.9271 KOps/s | 3.1356 KOps/s | |
test_keys | 32.2200μs | 3.4818μs | 287.2106 KOps/s | 290.0261 KOps/s | |
test_keys_nested | 0.1125ms | 69.7090μs | 14.3453 KOps/s | 14.2885 KOps/s | |
test_keys_nested_locked | 0.7393ms | 75.9470μs | 13.1671 KOps/s | 13.2695 KOps/s | |
test_keys_nested_leaf | 93.8320μs | 60.9734μs | 16.4006 KOps/s | 16.3688 KOps/s | |
test_keys_stack_nested | 0.1026ms | 69.6248μs | 14.3627 KOps/s | 14.1621 KOps/s | |
test_keys_stack_nested_leaf | 89.7920μs | 61.3078μs | 16.3111 KOps/s | 16.1697 KOps/s | |
test_keys_stack_nested_locked | 0.1109ms | 75.3108μs | 13.2783 KOps/s | 13.0851 KOps/s | |
test_values | 6.1968μs | 0.8512μs | 1.1748 MOps/s | 1.1556 MOps/s | |
test_values_nested | 62.5310μs | 31.1711μs | 32.0810 KOps/s | 32.2585 KOps/s | |
test_values_nested_locked | 62.2820μs | 32.7233μs | 30.5593 KOps/s | 30.5661 KOps/s | |
test_values_nested_leaf | 58.7710μs | 33.5570μs | 29.8000 KOps/s | 29.7577 KOps/s | |
test_values_stack_nested | 60.2310μs | 31.8940μs | 31.3539 KOps/s | 31.5201 KOps/s | |
test_values_stack_nested_leaf | 62.7710μs | 34.3679μs | 29.0969 KOps/s | 29.1928 KOps/s | |
test_values_stack_nested_locked | 64.5010μs | 32.9515μs | 30.3476 KOps/s | 30.0949 KOps/s | |
test_membership | 1.7766μs | 0.5093μs | 1.9634 MOps/s | 1.9615 MOps/s | |
test_membership_nested | 20.0605μs | 1.9295μs | 518.2607 KOps/s | 523.4373 KOps/s | |
test_membership_nested_leaf | 16.6855μs | 1.9163μs | 521.8423 KOps/s | 530.8309 KOps/s | |
test_membership_stacked_nested | 24.0510μs | 2.0349μs | 491.4166 KOps/s | 517.2863 KOps/s | |
test_membership_stacked_nested_leaf | 28.4000μs | 2.0241μs | 494.0403 KOps/s | 515.6916 KOps/s | |
test_membership_nested_last | 38.3000μs | 2.8415μs | 351.9220 KOps/s | 355.4737 KOps/s | |
test_membership_nested_leaf_last | 42.4210μs | 2.8315μs | 353.1644 KOps/s | 355.0868 KOps/s | |
test_membership_stacked_nested_last | 39.3400μs | 7.7896μs | 128.3769 KOps/s | 309.6010 KOps/s | |
test_membership_stacked_nested_leaf_last | 48.3710μs | 7.7396μs | 129.2055 KOps/s | 309.6174 KOps/s | |
test_nested_getleaf | 29.5000μs | 6.0146μs | 166.2608 KOps/s | 166.3793 KOps/s | |
test_nested_get | 37.0800μs | 5.7111μs | 175.0989 KOps/s | 176.0114 KOps/s | |
test_stacked_getleaf | 44.5610μs | 6.0005μs | 166.6542 KOps/s | 166.8295 KOps/s | |
test_stacked_get | 32.9310μs | 5.7101μs | 175.1280 KOps/s | 175.4113 KOps/s | |
test_nested_getitemleaf | 43.7010μs | 6.1083μs | 163.7129 KOps/s | 165.2918 KOps/s | |
test_nested_getitem | 43.5900μs | 5.8243μs | 171.6935 KOps/s | 173.0990 KOps/s | |
test_stacked_getitemleaf | 31.1400μs | 6.1298μs | 163.1372 KOps/s | 164.6045 KOps/s | |
test_stacked_getitem | 52.6210μs | 5.8132μs | 172.0220 KOps/s | 172.9959 KOps/s | |
test_lock_nested | 9.1151ms | 0.3711ms | 2.6946 KOps/s | 2.7359 KOps/s | |
test_lock_stack_nested | 0.3841ms | 0.3269ms | 3.0589 KOps/s | 2.9750 KOps/s | |
test_unlock_nested | 1.0435ms | 0.3061ms | 3.2671 KOps/s | 3.2973 KOps/s | |
test_unlock_stack_nested | 0.3324ms | 0.2660ms | 3.7599 KOps/s | 3.6459 KOps/s | |
test_flatten_speed | 0.1045ms | 72.8223μs | 13.7321 KOps/s | 13.8280 KOps/s | |
test_unflatten_speed | 0.3921ms | 0.2910ms | 3.4359 KOps/s | 3.4513 KOps/s | |
test_common_ops | 1.6467ms | 0.5463ms | 1.8304 KOps/s | 1.6572 KOps/s | |
test_creation | 0.1658ms | 1.4803μs | 675.5554 KOps/s | 685.0684 KOps/s | |
test_creation_empty | 31.4100μs | 5.9303μs | 168.6257 KOps/s | 102.7541 KOps/s | |
test_creation_nested_1 | 42.9610μs | 7.4664μs | 133.9325 KOps/s | 88.7533 KOps/s | |
test_creation_nested_2 | 37.6000μs | 9.9463μs | 100.5399 KOps/s | 71.8657 KOps/s | |
test_clone | 0.1497ms | 10.0383μs | 99.6188 KOps/s | 98.8062 KOps/s | |
test_getitem[int] | 1.7473ms | 10.8342μs | 92.3002 KOps/s | 92.1270 KOps/s | |
test_getitem[slice_int] | 0.1133ms | 21.5873μs | 46.3235 KOps/s | 46.6597 KOps/s | |
test_getitem[range] | 0.1319ms | 36.9158μs | 27.0887 KOps/s | 27.5450 KOps/s | |
test_getitem[tuple] | 0.1085ms | 17.8703μs | 55.9588 KOps/s | 55.3038 KOps/s | |
test_getitem[list] | 0.1262ms | 32.4486μs | 30.8180 KOps/s | 31.1703 KOps/s | |
test_setitem_dim[int] | 28.1310μs | 18.2568μs | 54.7741 KOps/s | 55.4136 KOps/s | |
test_setitem_dim[slice_int] | 57.0910μs | 37.0250μs | 27.0088 KOps/s | 27.7916 KOps/s | |
test_setitem_dim[range] | 74.7110μs | 50.6317μs | 19.7505 KOps/s | 19.8383 KOps/s | |
test_setitem_dim[tuple] | 53.2710μs | 30.1988μs | 33.1139 KOps/s | 32.3665 KOps/s | |
test_setitem | 0.1343ms | 13.3002μs | 75.1870 KOps/s | 65.5805 KOps/s | |
test_set | 0.1492ms | 12.8373μs | 77.8980 KOps/s | 68.2627 KOps/s | |
test_set_shared | 1.4183ms | 0.1447ms | 6.9111 KOps/s | 6.9121 KOps/s | |
test_update | 0.4568ms | 14.4590μs | 69.1609 KOps/s | 54.6497 KOps/s | |
test_update_nested | 0.1545ms | 19.2730μs | 51.8862 KOps/s | 43.0435 KOps/s | |
test_update__nested | 0.4577ms | 24.1374μs | 41.4295 KOps/s | 42.8246 KOps/s | |
test_set_nested | 0.1462ms | 14.0188μs | 71.3328 KOps/s | 62.6777 KOps/s | |
test_set_nested_new | 0.1490ms | 16.4130μs | 60.9273 KOps/s | 55.2279 KOps/s | |
test_select | 0.1925ms | 27.2958μs | 36.6357 KOps/s | 33.5231 KOps/s | |
test_select_nested | 72.8220μs | 41.7803μs | 23.9347 KOps/s | 23.8231 KOps/s | |
test_exclude_nested | 0.1034ms | 59.5058μs | 16.8051 KOps/s | 16.8868 KOps/s | |
test_empty[True] | 0.3111ms | 0.2587ms | 3.8656 KOps/s | 3.8992 KOps/s | |
test_empty[False] | 3.3731μs | 0.7400μs | 1.3513 MOps/s | 1.3409 MOps/s | |
test_to | 83.1110μs | 54.4781μs | 18.3560 KOps/s | 18.4572 KOps/s | |
test_to_nonblocking | 0.1001ms | 45.1707μs | 22.1382 KOps/s | 22.3492 KOps/s | |
test_unbind_speed | 0.2831ms | 0.2317ms | 4.3156 KOps/s | 4.3114 KOps/s | |
test_unbind_speed_stack0 | 0.2710ms | 0.2225ms | 4.4948 KOps/s | 4.2998 KOps/s | |
test_unbind_speed_stack1 | 92.9125ms | 0.6285ms | 1.5911 KOps/s | 1.7069 KOps/s | |
test_split | 93.6014ms | 1.7026ms | 587.3519 Ops/s | 598.0320 Ops/s | |
test_chunk | 96.8103ms | 1.7120ms | 584.0994 Ops/s | 593.5975 Ops/s | |
test_consolidate[False-None] | 96.2660ms | 2.8874ms | 346.3353 Ops/s | 343.7670 Ops/s | |
test_consolidate[default-None] | 1.7690ms | 1.6761ms | 596.6065 Ops/s | 593.7519 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8468ms | 1.7165ms | 582.5923 Ops/s | 584.6316 Ops/s | |
test_consolidate_njt[False-None] | 7.0141ms | 6.6993ms | 149.2703 Ops/s | 149.9641 Ops/s | |
test_to[False-False-None] | 1.7362ms | 1.6501ms | 606.0356 Ops/s | 595.3829 Ops/s | |
test_to[True-False-None] | 1.6049ms | 1.3421ms | 745.1037 Ops/s | 743.9402 Ops/s | |
test_to[within-False-None] | 4.2593ms | 4.1257ms | 242.3829 Ops/s | 241.8760 Ops/s | |
test_to[True-default-None] | 5.5859ms | 5.0888ms | 196.5094 Ops/s | 193.5804 Ops/s | |
test_to_njt[False-False-None] | 7.2205ms | 7.0033ms | 142.7889 Ops/s | 142.2542 Ops/s | |
test_to_njt[True-False-None] | 5.8504ms | 5.5616ms | 179.8042 Ops/s | 179.4113 Ops/s | |
test_to_njt[within-False-None] | 12.6552ms | 12.2518ms | 81.6205 Ops/s | 81.6128 Ops/s | |
test_creation[device0] | 0.4755ms | 79.1200μs | 12.6390 KOps/s | 12.7433 KOps/s | |
test_creation_from_tensor | 0.4724ms | 82.7431μs | 12.0856 KOps/s | 12.0272 KOps/s | |
test_add_one[memmap_tensor0] | 0.4035ms | 6.2108μs | 161.0086 KOps/s | 165.1135 KOps/s | |
test_contiguous[memmap_tensor0] | 2.7596μs | 0.4224μs | 2.3672 MOps/s | 2.4315 MOps/s | |
test_stack[memmap_tensor0] | 44.0700μs | 4.5422μs | 220.1591 KOps/s | 231.4742 KOps/s | |
test_memmaptd_index | 1.9918ms | 0.2592ms | 3.8573 KOps/s | 3.9005 KOps/s | |
test_memmaptd_index_astensor | 1.0097ms | 0.3145ms | 3.1798 KOps/s | 3.1670 KOps/s | |
test_memmaptd_index_op | 0.9520ms | 0.5366ms | 1.8636 KOps/s | 1.6673 KOps/s | |
test_serialize_model | 0.1318s | 0.1302s | 7.6801 Ops/s | 5.3450 Ops/s | |
test_serialize_model_pickle | 1.3511s | 1.2157s | 0.8226 Ops/s | 0.8210 Ops/s | |
test_serialize_weights | 0.1297s | 0.1291s | 7.7438 Ops/s | 7.6913 Ops/s | |
test_serialize_weights_returnearly | 0.5767s | 63.2200ms | 15.8178 Ops/s | 23.6371 Ops/s | |
test_serialize_weights_pickle | 1.3816s | 1.2137s | 0.8239 Ops/s | 0.8203 Ops/s | |
test_reshape_pytree | 69.7910μs | 22.8050μs | 43.8500 KOps/s | 43.5464 KOps/s | |
test_reshape_td | 63.0010μs | 27.0412μs | 36.9806 KOps/s | 36.9417 KOps/s | |
test_view_pytree | 94.9620μs | 22.3833μs | 44.6762 KOps/s | 44.2357 KOps/s | |
test_view_td | 0.1102ms | 30.2149μs | 33.0963 KOps/s | 32.5999 KOps/s | |
test_unbind_pytree | 58.1510μs | 27.8167μs | 35.9496 KOps/s | 36.0665 KOps/s | |
test_unbind_td | 0.6268ms | 34.9176μs | 28.6389 KOps/s | 28.1327 KOps/s | |
test_split_pytree | 73.4820μs | 31.0132μs | 32.2443 KOps/s | 32.6702 KOps/s | |
test_split_td | 0.8213ms | 41.1713μs | 24.2888 KOps/s | 24.2993 KOps/s | |
test_add_pytree | 70.2320μs | 32.4339μs | 30.8320 KOps/s | 30.8550 KOps/s | |
test_add_td | 69.5610μs | 41.4166μs | 24.1449 KOps/s | 20.5833 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1708ms | 0.1205ms | 8.3010 KOps/s | 8.0799 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2198ms | 0.1280ms | 7.8108 KOps/s | 7.8995 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1645ms | 99.7939μs | 10.0206 KOps/s | 9.9656 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.3701ms | 0.1492ms | 6.7009 KOps/s | 6.7140 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 59.9110μs | 22.9587μs | 43.5565 KOps/s | 44.4717 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 61.7910μs | 27.3648μs | 36.5433 KOps/s | 35.9772 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2139ms | 62.4629μs | 16.0095 KOps/s | 15.3424 KOps/s | |
test_compile_copy_nested[pytree-eager] | 79.2110μs | 49.5499μs | 20.1817 KOps/s | 19.8116 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1799ms | 0.1421ms | 7.0377 KOps/s | 6.7387 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2995ms | 0.2092ms | 4.7802 KOps/s | 4.8004 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1428ms | 97.0362μs | 10.3054 KOps/s | 10.0806 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1253ms | 52.4985μs | 19.0482 KOps/s | 19.2289 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1944ms | 0.1440ms | 6.9453 KOps/s | 6.9131 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5389ms | 0.4767ms | 2.0978 KOps/s | 2.1042 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4192ms | 0.2474ms | 4.0413 KOps/s | 4.0186 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1979ms | 0.1432ms | 6.9814 KOps/s | 7.0356 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1504ms | 61.0646μs | 16.3761 KOps/s | 16.4636 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1853ms | 98.2928μs | 10.1737 KOps/s | 9.7966 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4964ms | 0.4110ms | 2.4328 KOps/s | 2.4769 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1970ms | 0.1378ms | 7.2544 KOps/s | 7.0932 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 77.7510μs | 21.0562μs | 47.4919 KOps/s | 53.4894 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 69.9020μs | 27.1547μs | 36.8260 KOps/s | 36.6556 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1014ms | 69.5274μs | 14.3828 KOps/s | 14.5127 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1049ms | 52.0388μs | 19.2164 KOps/s | 19.1281 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6144ms | 0.4394ms | 2.2757 KOps/s | 2.2340 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7022ms | 2.5474ms | 392.5596 Ops/s | 390.3655 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.6038ms | 0.4320ms | 2.3148 KOps/s | 2.2757 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7546ms | 2.6206ms | 381.5979 Ops/s | 384.2870 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3295ms | 0.1176ms | 8.5038 KOps/s | 8.6088 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5652ms | 77.8648μs | 12.8428 KOps/s | 12.7436 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3757ms | 0.1040ms | 9.6140 KOps/s | 9.7135 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1737ms | 69.9674μs | 14.2924 KOps/s | 14.9021 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1596ms | 0.1105ms | 9.0515 KOps/s | 9.6121 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1185ms | 70.7568μs | 14.1329 KOps/s | 14.7536 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1520ms | 0.1050ms | 9.5249 KOps/s | 9.7825 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1429ms | 18.0109μs | 55.5218 KOps/s | 49.8896 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1381ms | 95.9429μs | 10.4229 KOps/s | 9.8098 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 59.0410μs | 16.9700μs | 58.9275 KOps/s | 60.0554 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1552ms | 96.9845μs | 10.3109 KOps/s | 10.1018 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.3510μs | 17.2893μs | 57.8391 KOps/s | 59.9534 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.4835ms | 0.1024ms | 9.7638 KOps/s | 9.6745 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6828ms | 18.2083μs | 54.9200 KOps/s | 55.1789 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.4871ms | 97.3792μs | 10.2691 KOps/s | 9.7478 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1581ms | 16.8177μs | 59.4612 KOps/s | 60.2353 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.4839ms | 97.7378μs | 10.2315 KOps/s | 10.1588 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.3991ms | 16.8415μs | 59.3770 KOps/s | 60.1274 KOps/s | |
test_mod_add[eager] | 0.4316ms | 31.5618μs | 31.6839 KOps/s | 30.8273 KOps/s | |
test_mod_add[compile] | 0.1467ms | 80.9579μs | 12.3521 KOps/s | 12.9278 KOps/s | |
test_mod_add[compile-overhead] | 0.3123ms | 0.1633ms | 6.1228 KOps/s | 5.7350 KOps/s | |
test_mod_wrap[eager] | 0.6507ms | 0.2479ms | 4.0336 KOps/s | 3.8830 KOps/s | |
test_mod_wrap[compile] | 1.5409ms | 0.2800ms | 3.5713 KOps/s | 3.5071 KOps/s | |
test_mod_wrap[compile-overhead] | 8.1134ms | 4.2049ms | 237.8157 Ops/s | 238.4703 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4273ms | 1.3079ms | 764.5751 Ops/s | 700.9781 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3522ms | 1.2512ms | 799.2106 Ops/s | 731.7698 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3349ms | 0.8966ms | 1.1153 KOps/s | 976.4793 Ops/s | |
test_seq_add[eager] | 0.1990ms | 95.7986μs | 10.4386 KOps/s | 9.6958 KOps/s | |
test_seq_add[compile] | 0.1388ms | 85.7600μs | 11.6605 KOps/s | 11.5234 KOps/s | |
test_seq_add[compile-overhead] | 0.1678ms | 0.1263ms | 7.9189 KOps/s | 7.8830 KOps/s | |
test_seq_wrap[eager] | 0.4826ms | 0.3727ms | 2.6833 KOps/s | 2.5468 KOps/s | |
test_seq_wrap[compile] | 0.3795ms | 0.2988ms | 3.3469 KOps/s | 3.3067 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2726ms | 0.2171ms | 4.6065 KOps/s | 4.5013 KOps/s | |
test_func_call_runtime[False-eager] | 0.8369ms | 0.7203ms | 1.3883 KOps/s | 1.2844 KOps/s | |
test_func_call_runtime[False-compile] | 0.7896ms | 0.7462ms | 1.3401 KOps/s | 1.3243 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4384ms | 0.3527ms | 2.8355 KOps/s | 2.7918 KOps/s | |
test_func_call_runtime[True-eager] | 1.0333ms | 0.8896ms | 1.1242 KOps/s | 1.0943 KOps/s | |
test_func_call_runtime[True-compile] | 0.8660ms | 0.7656ms | 1.3062 KOps/s | 1.2883 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4274ms | 0.3749ms | 2.6673 KOps/s | 2.6456 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9636ms | 0.7332ms | 1.3639 KOps/s | 1.3469 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.9237ms | 0.7566ms | 1.3217 KOps/s | 1.3191 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4026ms | 0.3541ms | 2.8239 KOps/s | 2.7855 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0757ms | 0.9851ms | 1.0152 KOps/s | 985.9505 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.8621ms | 0.7928ms | 1.2614 KOps/s | 1.2431 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5598ms | 0.4042ms | 2.4739 KOps/s | 2.4530 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4339ms | 1.9797ms | 505.1333 Ops/s | 499.2518 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8713ms | 0.8007ms | 1.2490 KOps/s | 1.2210 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4591ms | 0.4028ms | 2.4825 KOps/s | 2.4392 KOps/s | |
test_distributed | 2.3180ms | 0.2521ms | 3.9670 KOps/s | 8.7921 KOps/s | |
test_tdmodule | 45.4610μs | 12.2235μs | 81.8097 KOps/s | 62.6537 KOps/s | |
test_tdmodule_dispatch | 58.3210μs | 24.1694μs | 41.3746 KOps/s | 34.1465 KOps/s | |
test_tdseq | 48.2010μs | 13.5728μs | 73.6768 KOps/s | 60.0121 KOps/s | |
test_tdseq_dispatch | 37.2800μs | 26.6447μs | 37.5309 KOps/s | 30.3289 KOps/s | |
test_instantiation_functorch | 1.6260ms | 1.5288ms | 654.0921 Ops/s | 636.9413 Ops/s | |
test_exec_functorch | 0.2033ms | 0.1410ms | 7.0908 KOps/s | 7.0177 KOps/s | |
test_exec_functional_call | 0.3124ms | 0.1332ms | 7.5062 KOps/s | 7.5490 KOps/s | |
test_exec_td_decorator | 0.3639ms | 0.1780ms | 5.6187 KOps/s | 5.6499 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7872ms | 0.6380ms | 1.5673 KOps/s | 1.5226 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.7423ms | 0.6341ms | 1.5771 KOps/s | 1.5276 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.6847ms | 0.5611ms | 1.7823 KOps/s | 1.7572 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6717ms | 0.5599ms | 1.7860 KOps/s | 1.7488 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 18.6938ms | 18.4837ms | 54.1016 Ops/s | 53.9280 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 18.7404ms | 18.4951ms | 54.0685 Ops/s | 53.7961 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.4806ms | 18.4830ms | 54.1039 Ops/s | 54.2938 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 18.6041ms | 18.3481ms | 54.5016 Ops/s | 54.1154 Ops/s | |
test_to_module_speed[True] | 1.0587ms | 0.9337ms | 1.0710 KOps/s | 1.0498 KOps/s | |
test_to_module_speed[False] | 1.4088ms | 0.9119ms | 1.0966 KOps/s | 1.0804 KOps/s | |
test_tc_init | 79.3520μs | 32.8914μs | 30.4031 KOps/s | 26.7201 KOps/s | |
test_tc_init_nested | 0.1169ms | 66.9635μs | 14.9335 KOps/s | 13.5199 KOps/s | |
test_tc_first_layer_tensor | 5.1187μs | 0.6957μs | 1.4375 MOps/s | 1.4246 MOps/s | |
test_tc_first_layer_nontensor | 42.6210μs | 2.3115μs | 432.6257 KOps/s | 431.1321 KOps/s | |
test_tc_second_layer_tensor | 10.2127μs | 1.4157μs | 706.3678 KOps/s | 703.6134 KOps/s | |
test_tc_second_layer_nontensor | 31.3400μs | 3.0385μs | 329.1100 KOps/s | 324.0801 KOps/s | |
test_unbind | 0.2293s | 9.7439ms | 102.6279 Ops/s | 151.7345 Ops/s | |
test_full_like | 9.4096ms | 9.0696ms | 110.2587 Ops/s | 109.4241 Ops/s | |
test_zeros_like | 4.9245ms | 4.3115ms | 231.9385 Ops/s | 114.9307 Ops/s | |
test_ones_like | 4.9011ms | 4.1785ms | 239.3201 Ops/s | 236.0018 Ops/s | |
test_clone | 11.2713ms | 9.0542ms | 110.4463 Ops/s | 158.1907 Ops/s | |
test_squeeze | 58.9110μs | 8.9750μs | 111.4209 KOps/s | 107.9539 KOps/s | |
test_unsqueeze | 0.1193ms | 70.3898μs | 14.2066 KOps/s | 14.0336 KOps/s | |
test_split | 0.3984ms | 0.1624ms | 6.1559 KOps/s | 6.2101 KOps/s | |
test_permute | 0.2285ms | 0.1767ms | 5.6601 KOps/s | 5.2751 KOps/s | |
test_stack | 52.7711ms | 50.6783ms | 19.7323 Ops/s | 19.6503 Ops/s | |
test_cat | 50.6061ms | 50.1524ms | 19.9392 Ops/s | 19.8349 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 7, 2024
ghstack-source-id: 16cedee8c0d38da6f377a262d5d7478a66fce07f Pull Request resolved: #1081
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
documentation
Improvements or additions to documentation
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):