-
Notifications
You must be signed in to change notification settings - Fork 78
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Ensure all dists work with DETERMINSTIC type without warning #1182
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Jan 14, 2025
ghstack-source-id: ff8028873d866116a0ba94fb1d64d2faa314f59d Pull Request resolved: #1182
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Jan 14, 2025
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 49.7330μs | 20.8105μs | 48.0526 KOps/s | 47.7702 KOps/s | |
test_plain_set_stack_nested | 71.1830μs | 20.9467μs | 47.7402 KOps/s | 46.6584 KOps/s | |
test_plain_set_nested_inplace | 70.2510μs | 22.6839μs | 44.0841 KOps/s | 44.1236 KOps/s | |
test_plain_set_stack_nested_inplace | 82.0030μs | 22.5425μs | 44.3606 KOps/s | 44.4539 KOps/s | |
test_items | 45.7460μs | 4.1362μs | 241.7655 KOps/s | 240.6384 KOps/s | |
test_items_nested | 0.7109ms | 0.3932ms | 2.5431 KOps/s | 2.5158 KOps/s | |
test_items_nested_locked | 0.8653ms | 0.3932ms | 2.5431 KOps/s | 2.5268 KOps/s | |
test_items_nested_leaf | 0.1499ms | 77.2845μs | 12.9392 KOps/s | 12.9125 KOps/s | |
test_items_stack_nested | 0.6066ms | 0.3967ms | 2.5206 KOps/s | 2.5022 KOps/s | |
test_items_stack_nested_leaf | 0.1411ms | 79.7770μs | 12.5349 KOps/s | 12.6201 KOps/s | |
test_items_stack_nested_locked | 0.7476ms | 0.3987ms | 2.5085 KOps/s | 2.4907 KOps/s | |
test_keys | 44.3020μs | 3.4534μs | 289.5672 KOps/s | 254.3927 KOps/s | |
test_keys_nested | 0.3052ms | 0.1614ms | 6.1956 KOps/s | 6.1176 KOps/s | |
test_keys_nested_locked | 1.8955ms | 0.1657ms | 6.0348 KOps/s | 5.8316 KOps/s | |
test_keys_nested_leaf | 0.2631ms | 0.1474ms | 6.7838 KOps/s | 7.0566 KOps/s | |
test_keys_stack_nested | 0.2373ms | 0.1612ms | 6.2035 KOps/s | 6.2747 KOps/s | |
test_keys_stack_nested_leaf | 0.1966ms | 0.1403ms | 7.1273 KOps/s | 7.3188 KOps/s | |
test_keys_stack_nested_locked | 0.3107ms | 0.1675ms | 5.9684 KOps/s | 6.0028 KOps/s | |
test_values | 8.5860μs | 1.0902μs | 917.2520 KOps/s | 955.8635 KOps/s | |
test_values_nested | 0.1358ms | 61.8106μs | 16.1784 KOps/s | 16.1637 KOps/s | |
test_values_nested_locked | 0.1107ms | 62.0724μs | 16.1102 KOps/s | 16.2013 KOps/s | |
test_values_nested_leaf | 0.1228ms | 71.1818μs | 14.0485 KOps/s | 13.5401 KOps/s | |
test_values_stack_nested | 0.1188ms | 62.7242μs | 15.9428 KOps/s | 15.6199 KOps/s | |
test_values_stack_nested_leaf | 0.1237ms | 70.7422μs | 14.1358 KOps/s | 14.4247 KOps/s | |
test_values_stack_nested_locked | 0.1112ms | 62.8995μs | 15.8984 KOps/s | 15.7185 KOps/s | |
test_membership | 16.2900μs | 0.8691μs | 1.1506 MOps/s | 1.1363 MOps/s | |
test_membership_nested | 47.7870μs | 2.8351μs | 352.7153 KOps/s | 350.3034 KOps/s | |
test_membership_nested_leaf | 26.3990μs | 2.8584μs | 349.8504 KOps/s | 345.5403 KOps/s | |
test_membership_stacked_nested | 24.4760μs | 2.8515μs | 350.6984 KOps/s | 353.8042 KOps/s | |
test_membership_stacked_nested_leaf | 29.6350μs | 2.8621μs | 349.3926 KOps/s | 331.3208 KOps/s | |
test_membership_nested_last | 23.8650μs | 4.2876μs | 233.2315 KOps/s | 236.2676 KOps/s | |
test_membership_nested_leaf_last | 47.5110μs | 4.2809μs | 233.5972 KOps/s | 234.7507 KOps/s | |
test_membership_stacked_nested_last | 37.8300μs | 4.2625μs | 234.6033 KOps/s | 234.5997 KOps/s | |
test_membership_stacked_nested_leaf_last | 48.4110μs | 4.2710μs | 234.1383 KOps/s | 236.6069 KOps/s | |
test_nested_getleaf | 38.3110μs | 10.5026μs | 95.2144 KOps/s | 93.7921 KOps/s | |
test_nested_get | 55.4930μs | 10.1384μs | 98.6352 KOps/s | 100.3198 KOps/s | |
test_stacked_getleaf | 39.1730μs | 10.5409μs | 94.8687 KOps/s | 97.7277 KOps/s | |
test_stacked_get | 41.2970μs | 10.0944μs | 99.0645 KOps/s | 102.8043 KOps/s | |
test_nested_getitemleaf | 78.9970μs | 10.8763μs | 91.9432 KOps/s | 89.6313 KOps/s | |
test_nested_getitem | 54.4310μs | 10.4947μs | 95.2859 KOps/s | 94.7270 KOps/s | |
test_stacked_getitemleaf | 38.0000μs | 10.9614μs | 91.2289 KOps/s | 91.4821 KOps/s | |
test_stacked_getitem | 36.5580μs | 10.4265μs | 95.9091 KOps/s | 95.3907 KOps/s | |
test_lock_nested | 5.2853ms | 0.4557ms | 2.1946 KOps/s | 1.8074 KOps/s | |
test_lock_stack_nested | 0.7647ms | 0.4229ms | 2.3649 KOps/s | 2.4135 KOps/s | |
test_unlock_nested | 0.7927ms | 0.3733ms | 2.6790 KOps/s | 2.6920 KOps/s | |
test_unlock_stack_nested | 0.6560ms | 0.3453ms | 2.8961 KOps/s | 3.0198 KOps/s | |
test_flatten_speed | 0.1999ms | 0.1011ms | 9.8873 KOps/s | 9.9247 KOps/s | |
test_unflatten_speed | 0.6190ms | 0.5147ms | 1.9430 KOps/s | 1.9201 KOps/s | |
test_common_ops | 4.0381ms | 0.8141ms | 1.2284 KOps/s | 1.2432 KOps/s | |
test_creation | 33.1010μs | 2.4436μs | 409.2311 KOps/s | 399.7752 KOps/s | |
test_creation_empty | 32.9710μs | 12.3098μs | 81.2362 KOps/s | 80.8724 KOps/s | |
test_creation_nested_1 | 38.7120μs | 15.2735μs | 65.4728 KOps/s | 65.3361 KOps/s | |
test_creation_nested_2 | 52.9190μs | 19.5183μs | 51.2339 KOps/s | 50.3801 KOps/s | |
test_clone | 72.5360μs | 13.4130μs | 74.5548 KOps/s | 73.2991 KOps/s | |
test_getitem[int] | 1.3341ms | 13.1015μs | 76.3271 KOps/s | 78.3977 KOps/s | |
test_getitem[slice_int] | 0.1373ms | 24.5234μs | 40.7773 KOps/s | 39.5636 KOps/s | |
test_getitem[range] | 0.2531ms | 49.0990μs | 20.3670 KOps/s | 20.4811 KOps/s | |
test_getitem[tuple] | 0.1330ms | 20.4514μs | 48.8964 KOps/s | 47.9724 KOps/s | |
test_getitem[list] | 0.3056ms | 44.5911μs | 22.4260 KOps/s | 22.9823 KOps/s | |
test_setitem_dim[int] | 66.0230μs | 25.3014μs | 39.5235 KOps/s | 37.9423 KOps/s | |
test_setitem_dim[slice_int] | 0.1005ms | 51.4352μs | 19.4419 KOps/s | 19.4130 KOps/s | |
test_setitem_dim[range] | 0.1513ms | 74.1387μs | 13.4882 KOps/s | 13.5911 KOps/s | |
test_setitem_dim[tuple] | 71.3840μs | 39.9368μs | 25.0396 KOps/s | 24.2023 KOps/s | |
test_setitem | 94.5070μs | 20.8271μs | 48.0145 KOps/s | 46.9752 KOps/s | |
test_set | 92.3420μs | 20.4201μs | 48.9713 KOps/s | 47.9434 KOps/s | |
test_set_shared | 1.2758ms | 0.1707ms | 5.8582 KOps/s | 5.9022 KOps/s | |
test_update | 0.1903ms | 23.5628μs | 42.4398 KOps/s | 41.9982 KOps/s | |
test_update_nested | 0.1604ms | 33.6614μs | 29.7076 KOps/s | 28.9951 KOps/s | |
test_update__nested | 0.6060ms | 33.6728μs | 29.6976 KOps/s | 29.0418 KOps/s | |
test_set_nested | 78.6570μs | 22.0999μs | 45.2490 KOps/s | 43.9688 KOps/s | |
test_set_nested_new | 0.1042ms | 26.4559μs | 37.7987 KOps/s | 36.4058 KOps/s | |
test_select | 0.1552ms | 42.6662μs | 23.4378 KOps/s | 22.6941 KOps/s | |
test_select_nested | 0.1222ms | 62.5495μs | 15.9873 KOps/s | 15.7702 KOps/s | |
test_exclude_nested | 0.1585ms | 79.4030μs | 12.5940 KOps/s | 12.1799 KOps/s | |
test_empty[True] | 1.2836ms | 0.4130ms | 2.4215 KOps/s | 2.4792 KOps/s | |
test_empty[False] | 10.1843μs | 1.3465μs | 742.6653 KOps/s | 726.6446 KOps/s | |
test_unbind_speed | 0.3798ms | 0.2695ms | 3.7102 KOps/s | 3.6760 KOps/s | |
test_unbind_speed_stack0 | 0.4754ms | 0.2665ms | 3.7517 KOps/s | 3.8602 KOps/s | |
test_unbind_speed_stack1 | 0.1061s | 0.8064ms | 1.2401 KOps/s | 1.4173 KOps/s | |
test_split | 0.1118s | 1.8021ms | 554.9034 Ops/s | 559.2171 Ops/s | |
test_chunk | 0.1130s | 1.8273ms | 547.2566 Ops/s | 561.6273 Ops/s | |
test_consolidate_njt[False-None] | 8.6260ms | 8.1633ms | 122.4996 Ops/s | 123.0964 Ops/s | |
test_creation[device0] | 0.3080ms | 90.9918μs | 10.9900 KOps/s | 11.2219 KOps/s | |
test_creation_from_tensor | 3.7096ms | 96.1988μs | 10.3951 KOps/s | 10.5828 KOps/s | |
test_add_one[memmap_tensor0] | 0.1620ms | 5.1637μs | 193.6597 KOps/s | 201.5399 KOps/s | |
test_contiguous[memmap_tensor0] | 14.2570μs | 0.5416μs | 1.8464 MOps/s | 1.8908 MOps/s | |
test_stack[memmap_tensor0] | 48.4900μs | 3.6521μs | 273.8138 KOps/s | 297.0321 KOps/s | |
test_memmaptd_index | 0.9983ms | 0.2411ms | 4.1468 KOps/s | 4.2456 KOps/s | |
test_memmaptd_index_astensor | 0.6913ms | 0.3281ms | 3.0474 KOps/s | 3.0927 KOps/s | |
test_memmaptd_index_op | 0.9926ms | 0.6058ms | 1.6507 KOps/s | 1.6893 KOps/s | |
test_serialize_model | 0.1235s | 0.1154s | 8.6691 Ops/s | 8.6140 Ops/s | |
test_serialize_model_pickle | 0.4856s | 0.3964s | 2.5229 Ops/s | 2.5177 Ops/s | |
test_serialize_weights | 0.2187s | 0.1290s | 7.7503 Ops/s | 8.5794 Ops/s | |
test_serialize_weights_returnearly | 0.1792s | 0.1634s | 6.1209 Ops/s | 6.3368 Ops/s | |
test_serialize_weights_pickle | 0.5438s | 0.4420s | 2.2626 Ops/s | 2.4678 Ops/s | |
test_serialize_weights_filesystem | 0.1443s | 0.1389s | 7.2007 Ops/s | 7.0146 Ops/s | |
test_serialize_model_filesystem | 0.1657s | 0.1508s | 6.6322 Ops/s | 6.5081 Ops/s | |
test_reshape_pytree | 68.9690μs | 26.1336μs | 38.2650 KOps/s | 37.8850 KOps/s | |
test_reshape_td | 66.8750μs | 33.6120μs | 29.7513 KOps/s | 30.1513 KOps/s | |
test_view_pytree | 67.2350μs | 26.5728μs | 37.6325 KOps/s | 38.3063 KOps/s | |
test_view_td | 0.1069ms | 38.9059μs | 25.7031 KOps/s | 26.3102 KOps/s | |
test_unbind_pytree | 64.0290μs | 29.7995μs | 33.5576 KOps/s | 33.0035 KOps/s | |
test_unbind_td | 0.3369ms | 39.7641μs | 25.1483 KOps/s | 25.2542 KOps/s | |
test_split_pytree | 88.7760μs | 29.2467μs | 34.1919 KOps/s | 33.8873 KOps/s | |
test_split_td | 0.1020s | 57.2010μs | 17.4822 KOps/s | 22.0142 KOps/s | |
test_add_pytree | 74.3900μs | 34.9598μs | 28.6043 KOps/s | 28.2405 KOps/s | |
test_add_td | 0.3381ms | 58.1187μs | 17.2062 KOps/s | 17.4836 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1516ms | 63.2179μs | 15.8183 KOps/s | 15.7081 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3718ms | 0.1732ms | 5.7749 KOps/s | 5.7749 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1197ms | 45.8215μs | 21.8238 KOps/s | 22.1059 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.1968ms | 0.1186ms | 8.4322 KOps/s | 8.4018 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 64.4700μs | 27.5308μs | 36.3230 KOps/s | 38.5896 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1158ms | 58.0168μs | 17.2364 KOps/s | 16.6038 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1667ms | 77.5166μs | 12.9005 KOps/s | 13.0162 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1787ms | 65.9070μs | 15.1729 KOps/s | 15.1172 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1837ms | 0.1061ms | 9.4293 KOps/s | 9.4945 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4273ms | 0.2172ms | 4.6043 KOps/s | 4.6592 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 90.6390μs | 46.4299μs | 21.5378 KOps/s | 20.9366 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4856ms | 67.8004μs | 14.7492 KOps/s | 14.8002 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1943ms | 0.1026ms | 9.7424 KOps/s | 9.8830 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.4041ms | 0.2045ms | 4.8899 KOps/s | 4.9549 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3593ms | 0.2336ms | 4.2816 KOps/s | 4.2667 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1787ms | 0.1042ms | 9.5925 KOps/s | 9.5126 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1527ms | 64.4330μs | 15.5200 KOps/s | 15.7568 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1271ms | 50.5829μs | 19.7695 KOps/s | 21.3749 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.2915ms | 0.1586ms | 6.3062 KOps/s | 6.3989 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1842ms | 0.1038ms | 9.6346 KOps/s | 9.9352 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 98.0230μs | 21.9500μs | 45.5581 KOps/s | 45.8495 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1346ms | 66.8765μs | 14.9529 KOps/s | 14.6211 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1527ms | 78.1710μs | 12.7925 KOps/s | 12.8563 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1348ms | 66.0590μs | 15.1380 KOps/s | 15.1138 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.2756ms | 0.2044ms | 4.8915 KOps/s | 4.7981 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.2526ms | 1.3701ms | 729.8888 Ops/s | 748.6093 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2782ms | 0.2005ms | 4.9866 KOps/s | 4.9164 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3076ms | 0.8015ms | 1.2476 KOps/s | 1.2893 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.7976ms | 0.4495ms | 2.2247 KOps/s | 2.2323 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.7106ms | 2.7368ms | 365.3878 Ops/s | 276.8042 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 99.2660μs | 36.1140μs | 27.6901 KOps/s | 25.7592 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5094ms | 33.3250μs | 30.0075 KOps/s | 29.4818 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 75.2900μs | 29.0649μs | 34.4058 KOps/s | 33.6024 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 84.9980μs | 23.0078μs | 43.4636 KOps/s | 42.4209 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 83.2750μs | 30.1379μs | 33.1808 KOps/s | 33.2051 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 78.5470μs | 22.9339μs | 43.6036 KOps/s | 43.5311 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1028ms | 53.1193μs | 18.8255 KOps/s | 18.7972 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.6199ms | 20.3848μs | 49.0561 KOps/s | 47.7578 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 98.6240μs | 45.7455μs | 21.8601 KOps/s | 22.5628 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 65.8130μs | 18.7875μs | 53.2270 KOps/s | 53.7717 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1035ms | 46.4533μs | 21.5270 KOps/s | 22.2218 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 48.5900μs | 18.7837μs | 53.2377 KOps/s | 53.4278 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1419ms | 54.6688μs | 18.2920 KOps/s | 18.4038 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 1.0606ms | 20.4524μs | 48.8940 KOps/s | 48.8532 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1022ms | 45.6985μs | 21.8826 KOps/s | 22.3622 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 53.6300μs | 18.6364μs | 53.6585 KOps/s | 51.9678 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1054ms | 45.9891μs | 21.7443 KOps/s | 22.2421 KOps/s | |
test_compile_indexing[int-pytree-eager] | 61.4150μs | 18.8084μs | 53.1677 KOps/s | 54.0100 KOps/s | |
test_mod_add[eager] | 90.6400μs | 34.5178μs | 28.9705 KOps/s | 28.9488 KOps/s | |
test_mod_add[compile] | 0.1093ms | 48.6412μs | 20.5587 KOps/s | 20.6078 KOps/s | |
test_mod_add[compile-overhead] | 0.1026ms | 47.5864μs | 21.0144 KOps/s | 19.9927 KOps/s | |
test_mod_wrap[eager] | 0.3341ms | 0.2218ms | 4.5084 KOps/s | 4.3256 KOps/s | |
test_mod_wrap[compile] | 0.3141ms | 0.2070ms | 4.8316 KOps/s | 4.8409 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4440ms | 0.2060ms | 4.8535 KOps/s | 4.8151 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.4025ms | 10.8510ms | 92.1575 Ops/s | 85.7004 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.9639ms | 11.2601ms | 88.8093 Ops/s | 82.0373 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.3284ms | 10.8552ms | 92.1220 Ops/s | 90.8955 Ops/s | |
test_seq_add[eager] | 0.1993ms | 0.1164ms | 8.5914 KOps/s | 8.5513 KOps/s | |
test_seq_add[compile] | 0.1193ms | 65.9713μs | 15.1581 KOps/s | 15.4088 KOps/s | |
test_seq_add[compile-overhead] | 0.1414ms | 62.5103μs | 15.9974 KOps/s | 15.7535 KOps/s | |
test_seq_wrap[eager] | 0.8289ms | 0.4411ms | 2.2671 KOps/s | 2.2089 KOps/s | |
test_seq_wrap[compile] | 0.3905ms | 0.2290ms | 4.3672 KOps/s | 4.2836 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4227ms | 0.2293ms | 4.3620 KOps/s | 4.4044 KOps/s | |
test_func_call_runtime[False-eager] | 0.8244ms | 0.5374ms | 1.8606 KOps/s | 1.8802 KOps/s | |
test_func_call_runtime[False-compile] | 0.6264ms | 0.4346ms | 2.3007 KOps/s | 2.3560 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5799ms | 0.4316ms | 2.3172 KOps/s | 2.3429 KOps/s | |
test_func_call_runtime[True-eager] | 1.1125ms | 0.7490ms | 1.3351 KOps/s | 1.3218 KOps/s | |
test_func_call_runtime[True-compile] | 0.8194ms | 0.4759ms | 2.1011 KOps/s | 2.1456 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6185ms | 0.4771ms | 2.0960 KOps/s | 2.1380 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9392ms | 0.5367ms | 1.8631 KOps/s | 1.8515 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8217ms | 0.4318ms | 2.3158 KOps/s | 2.3602 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9381ms | 0.4348ms | 2.2999 KOps/s | 2.3372 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4966ms | 0.9018ms | 1.1089 KOps/s | 1.1008 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.6603ms | 0.4988ms | 2.0048 KOps/s | 2.0245 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.9056ms | 0.5097ms | 1.9619 KOps/s | 2.0203 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.3287ms | 1.9065ms | 524.5246 Ops/s | 515.1495 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8781ms | 0.5337ms | 1.8736 KOps/s | 1.9142 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.7070ms | 0.5315ms | 1.8816 KOps/s | 1.9077 KOps/s | |
test_distributed | 0.2861ms | 0.1254ms | 7.9749 KOps/s | 7.7465 KOps/s | |
test_tdmodule | 67.3960μs | 26.2711μs | 38.0646 KOps/s | 37.8720 KOps/s | |
test_tdmodule_dispatch | 83.8870μs | 48.4150μs | 20.6548 KOps/s | 20.4409 KOps/s | |
test_tdseq | 58.9400μs | 29.3392μs | 34.0841 KOps/s | 33.9893 KOps/s | |
test_tdseq_dispatch | 0.1028ms | 53.5793μs | 18.6639 KOps/s | 18.6660 KOps/s | |
test_instantiation_functorch | 1.9362ms | 1.5263ms | 655.1930 Ops/s | 651.7040 Ops/s | |
test_exec_functorch | 0.3349ms | 0.1771ms | 5.6470 KOps/s | 5.6447 KOps/s | |
test_exec_functional_call | 0.4188ms | 0.1721ms | 5.8100 KOps/s | 5.8903 KOps/s | |
test_exec_td_decorator | 0.4669ms | 0.2250ms | 4.4442 KOps/s | 4.3231 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8840ms | 0.6560ms | 1.5243 KOps/s | 1.5307 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.1651ms | 0.6599ms | 1.5153 KOps/s | 1.5298 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8833ms | 0.5275ms | 1.8956 KOps/s | 1.8938 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9680ms | 0.5325ms | 1.8778 KOps/s | 1.8959 KOps/s | |
test_to_module_speed[True] | 2.0995ms | 1.3362ms | 748.4002 Ops/s | 748.2350 Ops/s | |
test_to_module_speed[False] | 1.8729ms | 1.2815ms | 780.3257 Ops/s | 764.5308 Ops/s | |
test_tc_init | 99.5460μs | 45.5385μs | 21.9594 KOps/s | 21.6985 KOps/s | |
test_tc_init_nested | 0.2011ms | 90.9191μs | 10.9988 KOps/s | 10.6704 KOps/s | |
test_tc_first_layer_tensor | 16.1700μs | 1.5122μs | 661.2980 KOps/s | 630.5702 KOps/s | |
test_tc_first_layer_nontensor | 25.7480μs | 4.6457μs | 215.2524 KOps/s | 208.2209 KOps/s | |
test_tc_second_layer_tensor | 38.4120μs | 2.7959μs | 357.6620 KOps/s | 339.1820 KOps/s | |
test_tc_second_layer_nontensor | 26.8600μs | 5.9594μs | 167.8025 KOps/s | 160.5988 KOps/s | |
test_unbind | 0.2377s | 13.4219ms | 74.5052 Ops/s | 77.4353 Ops/s | |
test_full_like | 8.8634ms | 7.5293ms | 132.8149 Ops/s | 80.6383 Ops/s | |
test_zeros_like | 3.3327ms | 2.8845ms | 346.6766 Ops/s | 141.0201 Ops/s | |
test_ones_like | 4.0809ms | 3.4856ms | 286.8965 Ops/s | 128.9975 Ops/s | |
test_clone | 6.4796ms | 5.4119ms | 184.7795 Ops/s | 105.6547 Ops/s | |
test_squeeze | 62.4960μs | 12.1489μs | 82.3119 KOps/s | 80.6941 KOps/s | |
test_unsqueeze | 0.3172ms | 92.4545μs | 10.8161 KOps/s | 10.3921 KOps/s | |
test_split | 0.3261ms | 0.1919ms | 5.2116 KOps/s | 5.0228 KOps/s | |
test_permute | 0.3127ms | 0.1993ms | 5.0178 KOps/s | 4.8397 KOps/s | |
test_stack | 31.3698ms | 25.8905ms | 38.6241 Ops/s | 38.9169 Ops/s | |
test_cat | 34.0779ms | 25.9056ms | 38.6017 Ops/s | 40.4320 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 25.3700μs | 11.2249μs | 89.0875 KOps/s | 75.9002 KOps/s | |
test_plain_set_stack_nested | 0.4077ms | 11.4478μs | 87.3530 KOps/s | 74.7761 KOps/s | |
test_plain_set_nested_inplace | 39.4200μs | 12.4228μs | 80.4970 KOps/s | 69.8819 KOps/s | |
test_plain_set_stack_nested_inplace | 0.4293ms | 12.3987μs | 80.6537 KOps/s | 69.1866 KOps/s | |
test_items | 0.4026ms | 2.9113μs | 343.4857 KOps/s | 341.3482 KOps/s | |
test_items_nested | 0.4291ms | 0.3618ms | 2.7643 KOps/s | 2.8068 KOps/s | |
test_items_nested_locked | 0.7649ms | 0.3600ms | 2.7775 KOps/s | 2.7914 KOps/s | |
test_items_nested_leaf | 0.4724ms | 57.9761μs | 17.2485 KOps/s | 17.2981 KOps/s | |
test_items_stack_nested | 0.7727ms | 0.3586ms | 2.7890 KOps/s | 2.7918 KOps/s | |
test_items_stack_nested_leaf | 0.4670ms | 58.1345μs | 17.2015 KOps/s | 17.0443 KOps/s | |
test_items_stack_nested_locked | 0.7787ms | 0.3575ms | 2.7970 KOps/s | 2.7669 KOps/s | |
test_keys | 34.5000μs | 3.4207μs | 292.3393 KOps/s | 290.6686 KOps/s | |
test_keys_nested | 0.4874ms | 89.0002μs | 11.2359 KOps/s | 11.3211 KOps/s | |
test_keys_nested_locked | 0.7659ms | 94.4268μs | 10.5902 KOps/s | 10.6713 KOps/s | |
test_keys_nested_leaf | 0.4779ms | 79.5207μs | 12.5753 KOps/s | 12.6183 KOps/s | |
test_keys_stack_nested | 0.4893ms | 88.9386μs | 11.2437 KOps/s | 11.2776 KOps/s | |
test_keys_stack_nested_leaf | 0.4730ms | 79.4379μs | 12.5884 KOps/s | 12.4550 KOps/s | |
test_keys_stack_nested_locked | 0.4969ms | 95.1667μs | 10.5079 KOps/s | 10.5530 KOps/s | |
test_values | 67.3343μs | 0.8720μs | 1.1468 MOps/s | 1.1749 MOps/s | |
test_values_nested | 66.3310μs | 38.0619μs | 26.2730 KOps/s | 26.4370 KOps/s | |
test_values_nested_locked | 0.4363ms | 39.2039μs | 25.5076 KOps/s | 25.4258 KOps/s | |
test_values_nested_leaf | 0.4405ms | 42.3604μs | 23.6070 KOps/s | 23.6302 KOps/s | |
test_values_stack_nested | 0.4403ms | 37.9768μs | 26.3319 KOps/s | 26.1810 KOps/s | |
test_values_stack_nested_leaf | 0.4463ms | 42.9337μs | 23.2917 KOps/s | 23.3687 KOps/s | |
test_values_stack_nested_locked | 0.4500ms | 39.7055μs | 25.1854 KOps/s | 25.0091 KOps/s | |
test_membership | 20.2288μs | 0.5139μs | 1.9460 MOps/s | 2.0115 MOps/s | |
test_membership_nested | 17.0200μs | 1.9601μs | 510.1715 KOps/s | 511.4579 KOps/s | |
test_membership_nested_leaf | 20.5000μs | 1.9660μs | 508.6566 KOps/s | 520.2370 KOps/s | |
test_membership_stacked_nested | 37.2510μs | 2.0612μs | 485.1549 KOps/s | 495.4854 KOps/s | |
test_membership_stacked_nested_leaf | 0.4166ms | 2.0696μs | 483.1761 KOps/s | 481.5274 KOps/s | |
test_membership_nested_last | 27.6710μs | 3.0686μs | 325.8848 KOps/s | 329.7943 KOps/s | |
test_membership_nested_leaf_last | 0.4048ms | 3.0583μs | 326.9740 KOps/s | 328.3270 KOps/s | |
test_membership_stacked_nested_last | 36.2600μs | 3.0512μs | 327.7430 KOps/s | 327.1713 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.4015ms | 3.0254μs | 330.5322 KOps/s | 330.0406 KOps/s | |
test_nested_getleaf | 45.5710μs | 6.0011μs | 166.6364 KOps/s | 164.5054 KOps/s | |
test_nested_get | 51.7610μs | 5.8339μs | 171.4108 KOps/s | 172.9944 KOps/s | |
test_stacked_getleaf | 39.1610μs | 6.1183μs | 163.4440 KOps/s | 163.2870 KOps/s | |
test_stacked_get | 60.4610μs | 5.7121μs | 175.0665 KOps/s | 173.0155 KOps/s | |
test_nested_getitemleaf | 44.3410μs | 6.4497μs | 155.0469 KOps/s | 157.0839 KOps/s | |
test_nested_getitem | 0.3946ms | 6.0653μs | 164.8736 KOps/s | 164.3385 KOps/s | |
test_stacked_getitemleaf | 0.4044ms | 6.3855μs | 156.6038 KOps/s | 155.3550 KOps/s | |
test_stacked_getitem | 30.4910μs | 5.9488μs | 168.0999 KOps/s | 162.9069 KOps/s | |
test_lock_nested | 9.4806ms | 0.3780ms | 2.6455 KOps/s | 2.6323 KOps/s | |
test_lock_stack_nested | 0.4516ms | 0.3378ms | 2.9606 KOps/s | 2.9019 KOps/s | |
test_unlock_nested | 0.6230ms | 0.3095ms | 3.2307 KOps/s | 3.2101 KOps/s | |
test_unlock_stack_nested | 0.3156ms | 0.2772ms | 3.6078 KOps/s | 3.5538 KOps/s | |
test_flatten_speed | 0.1240ms | 74.7878μs | 13.3712 KOps/s | 13.3655 KOps/s | |
test_unflatten_speed | 0.4368ms | 0.3149ms | 3.1755 KOps/s | 3.1811 KOps/s | |
test_common_ops | 1.6468ms | 0.5639ms | 1.7734 KOps/s | 1.5602 KOps/s | |
test_creation | 0.1091ms | 1.7262μs | 579.3187 KOps/s | 574.7610 KOps/s | |
test_creation_empty | 29.3600μs | 6.3145μs | 158.3657 KOps/s | 98.6531 KOps/s | |
test_creation_nested_1 | 31.8310μs | 7.9753μs | 125.3871 KOps/s | 84.7186 KOps/s | |
test_creation_nested_2 | 39.4310μs | 10.5178μs | 95.0769 KOps/s | 69.1398 KOps/s | |
test_clone | 78.6320μs | 10.2966μs | 97.1191 KOps/s | 95.1704 KOps/s | |
test_getitem[int] | 1.7525ms | 10.4154μs | 96.0121 KOps/s | 94.5602 KOps/s | |
test_getitem[slice_int] | 0.1113ms | 20.1600μs | 49.6031 KOps/s | 49.8557 KOps/s | |
test_getitem[range] | 0.1294ms | 35.3945μs | 28.2530 KOps/s | 28.0091 KOps/s | |
test_getitem[tuple] | 0.1091ms | 17.7938μs | 56.1995 KOps/s | 56.3563 KOps/s | |
test_getitem[list] | 0.2356ms | 31.4393μs | 31.8073 KOps/s | 30.9168 KOps/s | |
test_setitem_dim[int] | 40.7800μs | 18.8518μs | 53.0454 KOps/s | 53.7195 KOps/s | |
test_setitem_dim[slice_int] | 0.1556ms | 37.6611μs | 26.5526 KOps/s | 26.9558 KOps/s | |
test_setitem_dim[range] | 74.4810μs | 51.2548μs | 19.5104 KOps/s | 19.4689 KOps/s | |
test_setitem_dim[tuple] | 52.7710μs | 30.7481μs | 32.5223 KOps/s | 32.9199 KOps/s | |
test_setitem | 88.1610μs | 13.6192μs | 73.4259 KOps/s | 62.3792 KOps/s | |
test_set | 95.4120μs | 13.1904μs | 75.8127 KOps/s | 64.4005 KOps/s | |
test_set_shared | 1.5200ms | 0.1501ms | 6.6627 KOps/s | 6.6530 KOps/s | |
test_update | 0.3640ms | 14.8810μs | 67.1998 KOps/s | 51.5770 KOps/s | |
test_update_nested | 91.8020μs | 20.0382μs | 49.9047 KOps/s | 39.8128 KOps/s | |
test_update__nested | 1.0424ms | 25.3175μs | 39.4984 KOps/s | 39.9973 KOps/s | |
test_set_nested | 95.5910μs | 14.3034μs | 69.9132 KOps/s | 59.8384 KOps/s | |
test_set_nested_new | 0.1031ms | 16.8510μs | 59.3437 KOps/s | 51.9967 KOps/s | |
test_select | 0.1031ms | 28.4268μs | 35.1781 KOps/s | 32.3066 KOps/s | |
test_select_nested | 93.8020μs | 42.5086μs | 23.5247 KOps/s | 22.6740 KOps/s | |
test_exclude_nested | 98.9610μs | 61.2569μs | 16.3247 KOps/s | 16.1569 KOps/s | |
test_empty[True] | 0.7182ms | 0.2942ms | 3.3995 KOps/s | 3.4173 KOps/s | |
test_empty[False] | 3.4410μs | 0.8252μs | 1.2118 MOps/s | 1.2129 MOps/s | |
test_to | 85.7510μs | 54.8466μs | 18.2327 KOps/s | 17.0214 KOps/s | |
test_to_nonblocking | 0.1952ms | 46.3932μs | 21.5549 KOps/s | 21.4370 KOps/s | |
test_unbind_speed | 0.8328ms | 0.2342ms | 4.2707 KOps/s | 4.2174 KOps/s | |
test_unbind_speed_stack0 | 0.2918ms | 0.2319ms | 4.3123 KOps/s | 4.2719 KOps/s | |
test_unbind_speed_stack1 | 94.3248ms | 0.6691ms | 1.4946 KOps/s | 1.4889 KOps/s | |
test_split | 96.6717ms | 1.6988ms | 588.6498 Ops/s | 635.2133 Ops/s | |
test_chunk | 1.5317ms | 1.4190ms | 704.7175 Ops/s | 582.6259 Ops/s | |
test_consolidate[False-None] | 99.7775ms | 2.8710ms | 348.3139 Ops/s | 380.1582 Ops/s | |
test_consolidate[default-None] | 1.7093ms | 1.6342ms | 611.9359 Ops/s | 610.1202 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7681ms | 1.6760ms | 596.6525 Ops/s | 596.5023 Ops/s | |
test_consolidate_njt[False-None] | 6.8008ms | 6.2438ms | 160.1595 Ops/s | 157.5777 Ops/s | |
test_to[False-False-None] | 1.8732ms | 1.6962ms | 589.5660 Ops/s | 594.5990 Ops/s | |
test_to[True-False-None] | 1.7567ms | 1.2896ms | 775.4274 Ops/s | 789.5375 Ops/s | |
test_to[within-False-None] | 4.2970ms | 3.9771ms | 251.4415 Ops/s | 246.1598 Ops/s | |
test_to[True-default-None] | 5.4769ms | 5.1511ms | 194.1334 Ops/s | 194.0557 Ops/s | |
test_to_njt[False-False-None] | 6.8022ms | 6.7280ms | 148.6336 Ops/s | 146.8174 Ops/s | |
test_to_njt[True-False-None] | 5.5193ms | 5.2622ms | 190.0356 Ops/s | 188.6348 Ops/s | |
test_to_njt[within-False-None] | 12.1520ms | 11.6356ms | 85.9433 Ops/s | 83.3940 Ops/s | |
test_creation[device0] | 0.6167ms | 81.9100μs | 12.2085 KOps/s | 12.2935 KOps/s | |
test_creation_from_tensor | 0.5064ms | 83.7087μs | 11.9462 KOps/s | 12.0569 KOps/s | |
test_add_one[memmap_tensor0] | 0.4099ms | 6.5701μs | 152.2050 KOps/s | 154.4669 KOps/s | |
test_contiguous[memmap_tensor0] | 1.9305μs | 0.3973μs | 2.5172 MOps/s | 2.4705 MOps/s | |
test_stack[memmap_tensor0] | 22.7510μs | 4.1533μs | 240.7745 KOps/s | 239.1793 KOps/s | |
test_memmaptd_index | 1.7298ms | 0.2461ms | 4.0633 KOps/s | 4.1012 KOps/s | |
test_memmaptd_index_astensor | 0.8251ms | 0.3075ms | 3.2518 KOps/s | 3.2974 KOps/s | |
test_memmaptd_index_op | 0.9693ms | 0.5347ms | 1.8701 KOps/s | 1.6873 KOps/s | |
test_serialize_model | 0.1315s | 0.1308s | 7.6437 Ops/s | 7.6190 Ops/s | |
test_serialize_model_pickle | 1.3601s | 1.2179s | 0.8211 Ops/s | 0.8251 Ops/s | |
test_serialize_weights | 0.1314s | 0.1305s | 7.6604 Ops/s | 7.6895 Ops/s | |
test_serialize_weights_returnearly | 0.3324s | 53.3915ms | 18.7296 Ops/s | 15.7178 Ops/s | |
test_serialize_weights_pickle | 1.3731s | 1.2181s | 0.8210 Ops/s | 0.8231 Ops/s | |
test_reshape_pytree | 54.9910μs | 21.7697μs | 45.9354 KOps/s | 45.4610 KOps/s | |
test_reshape_td | 67.0910μs | 26.2266μs | 38.1292 KOps/s | 36.1645 KOps/s | |
test_view_pytree | 61.7610μs | 21.6052μs | 46.2852 KOps/s | 46.7027 KOps/s | |
test_view_td | 0.1405ms | 29.6005μs | 33.7832 KOps/s | 32.0473 KOps/s | |
test_unbind_pytree | 63.3910μs | 27.4238μs | 36.4647 KOps/s | 36.1925 KOps/s | |
test_unbind_td | 0.7646ms | 35.4841μs | 28.1817 KOps/s | 26.2170 KOps/s | |
test_split_pytree | 74.5610μs | 29.7783μs | 33.5815 KOps/s | 34.0436 KOps/s | |
test_split_td | 0.9668ms | 37.6509μs | 26.5598 KOps/s | 26.7961 KOps/s | |
test_add_pytree | 75.1110μs | 33.4730μs | 29.8748 KOps/s | 30.2324 KOps/s | |
test_add_td | 0.4435ms | 43.6074μs | 22.9319 KOps/s | 20.2545 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1800ms | 0.1247ms | 8.0216 KOps/s | 8.1753 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2262ms | 0.1315ms | 7.6030 KOps/s | 7.6478 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1391ms | 93.7941μs | 10.6616 KOps/s | 10.5652 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2733ms | 0.1485ms | 6.7362 KOps/s | 6.8535 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 97.5820μs | 21.8380μs | 45.7917 KOps/s | 45.4833 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 70.9510μs | 28.7526μs | 34.7795 KOps/s | 33.8247 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.4813ms | 63.1677μs | 15.8309 KOps/s | 15.6077 KOps/s | |
test_compile_copy_nested[pytree-eager] | 96.0910μs | 49.1460μs | 20.3475 KOps/s | 20.3020 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1928ms | 0.1388ms | 7.2029 KOps/s | 7.0113 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.6495ms | 0.2176ms | 4.5963 KOps/s | 4.6157 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1587ms | 0.1013ms | 9.8749 KOps/s | 10.3995 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1369ms | 56.6995μs | 17.6368 KOps/s | 18.3783 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2443ms | 0.1333ms | 7.4996 KOps/s | 7.4279 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5964ms | 0.4711ms | 2.1228 KOps/s | 2.1598 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4082ms | 0.2630ms | 3.8030 KOps/s | 3.8135 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2054ms | 0.1483ms | 6.7422 KOps/s | 7.0437 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2070ms | 68.0047μs | 14.7049 KOps/s | 14.7840 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1574ms | 98.2882μs | 10.1742 KOps/s | 10.2789 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5655ms | 0.4016ms | 2.4903 KOps/s | 2.5362 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1926ms | 0.1338ms | 7.4760 KOps/s | 7.5024 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 82.6820μs | 18.8044μs | 53.1791 KOps/s | 56.8736 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 74.2510μs | 30.9811μs | 32.2777 KOps/s | 31.5116 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.2033ms | 69.0521μs | 14.4818 KOps/s | 14.4642 KOps/s | |
test_compile_copy_flat[pytree-eager] | 94.4910μs | 50.9989μs | 19.6083 KOps/s | 19.6018 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.5752ms | 0.3797ms | 2.6337 KOps/s | 2.2670 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7215ms | 2.6048ms | 383.9085 Ops/s | 393.5432 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5483ms | 0.4219ms | 2.3705 KOps/s | 2.2875 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7288ms | 2.5707ms | 389.0025 Ops/s | 395.8501 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.5945ms | 0.1122ms | 8.9134 KOps/s | 8.9962 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5536ms | 76.1705μs | 13.1284 KOps/s | 12.1970 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.5052ms | 0.1080ms | 9.2557 KOps/s | 9.2997 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1257ms | 65.9402μs | 15.1653 KOps/s | 14.4496 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1662ms | 0.1078ms | 9.2790 KOps/s | 9.5750 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1848ms | 70.1893μs | 14.2472 KOps/s | 15.3147 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1637ms | 0.1030ms | 9.7065 KOps/s | 10.2110 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1418ms | 16.7187μs | 59.8134 KOps/s | 59.0632 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1376ms | 98.6446μs | 10.1374 KOps/s | 10.5915 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 99.6720μs | 15.4475μs | 64.7354 KOps/s | 59.4909 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.5134ms | 93.9089μs | 10.6486 KOps/s | 10.5371 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.4033ms | 15.5857μs | 64.1613 KOps/s | 64.8637 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.5042ms | 98.3138μs | 10.1715 KOps/s | 10.1401 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6567ms | 16.2725μs | 61.4532 KOps/s | 61.3075 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.5043ms | 93.9307μs | 10.6461 KOps/s | 10.5252 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 76.0110μs | 15.4304μs | 64.8070 KOps/s | 65.2716 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.5421ms | 93.6742μs | 10.6753 KOps/s | 10.4833 KOps/s | |
test_compile_indexing[int-pytree-eager] | 0.4186ms | 15.5464μs | 64.3235 KOps/s | 64.6702 KOps/s | |
test_mod_add[eager] | 0.4705ms | 35.4487μs | 28.2098 KOps/s | 26.0567 KOps/s | |
test_mod_add[compile] | 0.3533ms | 77.8096μs | 12.8519 KOps/s | 12.5972 KOps/s | |
test_mod_add[compile-overhead] | 0.3236ms | 0.1630ms | 6.1346 KOps/s | 5.7670 KOps/s | |
test_mod_wrap[eager] | 0.6679ms | 0.2429ms | 4.1176 KOps/s | 4.0331 KOps/s | |
test_mod_wrap[compile] | 0.3417ms | 0.2729ms | 3.6644 KOps/s | 3.5903 KOps/s | |
test_mod_wrap[compile-overhead] | 7.0426ms | 3.7349ms | 267.7482 Ops/s | 266.5366 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.7076ms | 1.3449ms | 743.5351 Ops/s | 691.7374 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3603ms | 1.2252ms | 816.1986 Ops/s | 745.0939 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3505ms | 0.9096ms | 1.0993 KOps/s | 960.1592 Ops/s | |
test_seq_add[eager] | 0.1800ms | 0.1073ms | 9.3231 KOps/s | 8.5519 KOps/s | |
test_seq_add[compile] | 0.2606ms | 88.2349μs | 11.3334 KOps/s | 11.7096 KOps/s | |
test_seq_add[compile-overhead] | 0.1728ms | 0.1320ms | 7.5785 KOps/s | 7.9616 KOps/s | |
test_seq_wrap[eager] | 0.4759ms | 0.4039ms | 2.4760 KOps/s | 2.3632 KOps/s | |
test_seq_wrap[compile] | 0.3614ms | 0.2978ms | 3.3584 KOps/s | 3.3858 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2717ms | 0.2200ms | 4.5448 KOps/s | 4.5077 KOps/s | |
test_func_call_runtime[False-eager] | 0.8144ms | 0.7421ms | 1.3474 KOps/s | 1.3752 KOps/s | |
test_func_call_runtime[False-compile] | 0.9147ms | 0.7365ms | 1.3577 KOps/s | 1.3876 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4258ms | 0.3514ms | 2.8458 KOps/s | 2.7835 KOps/s | |
test_func_call_runtime[True-eager] | 1.0102ms | 0.8980ms | 1.1136 KOps/s | 1.1144 KOps/s | |
test_func_call_runtime[True-compile] | 0.7955ms | 0.7360ms | 1.3587 KOps/s | 1.3338 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4227ms | 0.3726ms | 2.6840 KOps/s | 2.6311 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.1001ms | 0.7239ms | 1.3815 KOps/s | 1.3714 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8320ms | 0.7176ms | 1.3936 KOps/s | 1.3802 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4458ms | 0.3564ms | 2.8057 KOps/s | 2.7852 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0793ms | 0.9872ms | 1.0130 KOps/s | 1.0008 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8397ms | 0.7645ms | 1.3080 KOps/s | 1.3013 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4591ms | 0.3966ms | 2.5211 KOps/s | 2.4557 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5512ms | 2.0666ms | 483.8790 Ops/s | 475.4932 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8596ms | 0.7786ms | 1.2844 KOps/s | 1.2626 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4518ms | 0.4021ms | 2.4871 KOps/s | 2.4547 KOps/s | |
test_distributed | 6.1247ms | 0.4029ms | 2.4821 KOps/s | 8.6087 KOps/s | |
test_tdmodule | 0.1444ms | 18.6990μs | 53.4788 KOps/s | 47.1474 KOps/s | |
test_tdmodule_dispatch | 70.7510μs | 33.2551μs | 30.0706 KOps/s | 26.3942 KOps/s | |
test_tdseq | 46.4410μs | 19.7598μs | 50.6077 KOps/s | 46.0257 KOps/s | |
test_tdseq_dispatch | 62.7410μs | 36.0684μs | 27.7251 KOps/s | 24.4357 KOps/s | |
test_instantiation_functorch | 1.6120ms | 1.4961ms | 668.3873 Ops/s | 655.2125 Ops/s | |
test_exec_functorch | 0.1810ms | 0.1405ms | 7.1186 KOps/s | 7.1350 KOps/s | |
test_exec_functional_call | 0.1767ms | 0.1321ms | 7.5682 KOps/s | 7.4882 KOps/s | |
test_exec_td_decorator | 0.3651ms | 0.1835ms | 5.4505 KOps/s | 5.4582 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8533ms | 0.6804ms | 1.4697 KOps/s | 1.4540 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8306ms | 0.6682ms | 1.4965 KOps/s | 1.4594 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7501ms | 0.5874ms | 1.7023 KOps/s | 1.6931 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7094ms | 0.5868ms | 1.7041 KOps/s | 1.6911 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.2014ms | 19.0913ms | 52.3800 Ops/s | 52.0396 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.7912ms | 19.1267ms | 52.2831 Ops/s | 51.8711 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.1782ms | 19.0042ms | 52.6199 Ops/s | 52.3386 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.1487ms | 18.9615ms | 52.7384 Ops/s | 52.5008 Ops/s | |
test_to_module_speed[True] | 1.1131ms | 0.9515ms | 1.0510 KOps/s | 1.0445 KOps/s | |
test_to_module_speed[False] | 1.3313ms | 0.9401ms | 1.0637 KOps/s | 1.0489 KOps/s | |
test_tc_init | 64.3910μs | 34.7808μs | 28.7515 KOps/s | 27.0786 KOps/s | |
test_tc_init_nested | 0.1126ms | 68.0670μs | 14.6914 KOps/s | 13.6866 KOps/s | |
test_tc_first_layer_tensor | 22.8800μs | 0.8039μs | 1.2439 MOps/s | 1.2329 MOps/s | |
test_tc_first_layer_nontensor | 24.5900μs | 2.2307μs | 448.2838 KOps/s | 443.3242 KOps/s | |
test_tc_second_layer_tensor | 9.3270μs | 1.4082μs | 710.1064 KOps/s | 697.8133 KOps/s | |
test_tc_second_layer_nontensor | 23.2600μs | 2.9868μs | 334.8104 KOps/s | 335.2055 KOps/s | |
test_unbind | 0.2306s | 10.2659ms | 97.4099 Ops/s | 142.3727 Ops/s | |
test_full_like | 12.6108ms | 9.3678ms | 106.7484 Ops/s | 105.6935 Ops/s | |
test_zeros_like | 9.1582ms | 7.1583ms | 139.6985 Ops/s | 140.0183 Ops/s | |
test_ones_like | 4.9057ms | 4.3483ms | 229.9724 Ops/s | 229.5974 Ops/s | |
test_clone | 14.1105ms | 10.6405ms | 93.9804 Ops/s | 151.4699 Ops/s | |
test_squeeze | 57.9910μs | 9.4999μs | 105.2644 KOps/s | 105.9291 KOps/s | |
test_unsqueeze | 0.1203ms | 68.7825μs | 14.5386 KOps/s | 13.9543 KOps/s | |
test_split | 0.3952ms | 0.1563ms | 6.3989 KOps/s | 6.3612 KOps/s | |
test_permute | 0.3031ms | 0.1725ms | 5.7958 KOps/s | 5.6049 KOps/s | |
test_stack | 51.2113ms | 50.8631ms | 19.6606 Ops/s | 18.7135 Ops/s | |
test_cat | 51.5649ms | 50.9444ms | 19.6292 Ops/s | 18.7319 Ops/s |
vmoens
added a commit
that referenced
this pull request
Jan 14, 2025
ghstack-source-id: 63117f9b3ac4125a2be4e3e55719cc718051fc10 Pull Request resolved: #1182
vmoens
added a commit
that referenced
this pull request
Jan 14, 2025
ghstack-source-id: 63117f9b3ac4125a2be4e3e55719cc718051fc10 Pull Request resolved: #1182
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):