-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] Upgrade to linux_job_v2 #1097
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 20, 2024
vmoens
added a commit
that referenced
this pull request
Nov 20, 2024
ghstack-source-id: cf280238d4a241c02edb5d3e8ffb52c24fd228c9 Pull Request resolved: #1097
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 39.3630μs | 17.7139μs | 56.4529 KOps/s | 55.8191 KOps/s | |
test_plain_set_stack_nested | 45.2740μs | 17.7389μs | 56.3733 KOps/s | 55.3874 KOps/s | |
test_plain_set_nested_inplace | 69.2400μs | 19.6385μs | 50.9205 KOps/s | 50.7725 KOps/s | |
test_plain_set_stack_nested_inplace | 52.0870μs | 19.6910μs | 50.7846 KOps/s | 50.8966 KOps/s | |
test_items | 33.8630μs | 4.2188μs | 237.0356 KOps/s | 239.2109 KOps/s | |
test_items_nested | 0.5858ms | 0.3451ms | 2.8977 KOps/s | 2.9030 KOps/s | |
test_items_nested_locked | 0.7172ms | 0.3457ms | 2.8924 KOps/s | 2.8886 KOps/s | |
test_items_nested_leaf | 0.1439ms | 70.8852μs | 14.1073 KOps/s | 14.1307 KOps/s | |
test_items_stack_nested | 0.4429ms | 0.3464ms | 2.8865 KOps/s | 2.8942 KOps/s | |
test_items_stack_nested_leaf | 0.1415ms | 74.2351μs | 13.4707 KOps/s | 13.5418 KOps/s | |
test_items_stack_nested_locked | 0.6109ms | 0.3466ms | 2.8856 KOps/s | 2.9051 KOps/s | |
test_keys | 23.4840μs | 3.4778μs | 287.5393 KOps/s | 285.8889 KOps/s | |
test_keys_nested | 0.2556ms | 0.1347ms | 7.4261 KOps/s | 7.1987 KOps/s | |
test_keys_nested_locked | 1.8544ms | 0.1401ms | 7.1354 KOps/s | 6.9129 KOps/s | |
test_keys_nested_leaf | 0.2064ms | 0.1157ms | 8.6459 KOps/s | 8.3243 KOps/s | |
test_keys_stack_nested | 0.2578ms | 0.1369ms | 7.3037 KOps/s | 7.3004 KOps/s | |
test_keys_stack_nested_leaf | 0.2477ms | 0.1170ms | 8.5490 KOps/s | 8.5126 KOps/s | |
test_keys_stack_nested_locked | 0.2658ms | 0.1409ms | 7.0982 KOps/s | 7.0317 KOps/s | |
test_values | 4.9552μs | 1.0682μs | 936.1701 KOps/s | 943.1286 KOps/s | |
test_values_nested | 0.1070ms | 55.1082μs | 18.1461 KOps/s | 17.8881 KOps/s | |
test_values_nested_locked | 0.4075ms | 56.5431μs | 17.6856 KOps/s | 18.1496 KOps/s | |
test_values_nested_leaf | 0.1151ms | 60.0046μs | 16.6654 KOps/s | 16.5588 KOps/s | |
test_values_stack_nested | 0.1097ms | 56.5083μs | 17.6965 KOps/s | 17.6969 KOps/s | |
test_values_stack_nested_leaf | 0.1059ms | 60.0559μs | 16.6512 KOps/s | 16.5066 KOps/s | |
test_values_stack_nested_locked | 0.3495ms | 56.7319μs | 17.6268 KOps/s | 17.7439 KOps/s | |
test_membership | 23.1830μs | 0.8918μs | 1.1214 MOps/s | 1.1256 MOps/s | |
test_membership_nested | 26.4690μs | 2.7434μs | 364.5086 KOps/s | 358.9759 KOps/s | |
test_membership_nested_leaf | 28.0030μs | 2.7483μs | 363.8581 KOps/s | 358.9950 KOps/s | |
test_membership_stacked_nested | 26.5600μs | 2.7473μs | 363.9875 KOps/s | 362.3322 KOps/s | |
test_membership_stacked_nested_leaf | 27.7720μs | 2.7447μs | 364.3408 KOps/s | 365.1253 KOps/s | |
test_membership_nested_last | 23.8150μs | 4.0856μs | 244.7624 KOps/s | 247.4484 KOps/s | |
test_membership_nested_leaf_last | 18.1940μs | 4.1389μs | 241.6115 KOps/s | 247.5506 KOps/s | |
test_membership_stacked_nested_last | 27.5920μs | 4.1173μs | 242.8783 KOps/s | 248.4505 KOps/s | |
test_membership_stacked_nested_leaf_last | 28.9840μs | 4.1381μs | 241.6555 KOps/s | 235.8262 KOps/s | |
test_nested_getleaf | 36.1870μs | 10.5389μs | 94.8863 KOps/s | 94.8551 KOps/s | |
test_nested_get | 39.0930μs | 10.0730μs | 99.2753 KOps/s | 100.4554 KOps/s | |
test_stacked_getleaf | 33.0110μs | 10.4946μs | 95.2872 KOps/s | 95.1786 KOps/s | |
test_stacked_get | 43.0880μs | 9.9948μs | 100.0523 KOps/s | 100.3423 KOps/s | |
test_nested_getitemleaf | 35.3760μs | 10.9727μs | 91.1350 KOps/s | 91.1044 KOps/s | |
test_nested_getitem | 0.1289ms | 10.1258μs | 98.7572 KOps/s | 96.7187 KOps/s | |
test_stacked_getitemleaf | 0.1398ms | 11.3738μs | 87.9212 KOps/s | 90.2709 KOps/s | |
test_stacked_getitem | 34.2040μs | 10.2943μs | 97.1407 KOps/s | 97.4512 KOps/s | |
test_lock_nested | 2.9221ms | 0.4474ms | 2.2353 KOps/s | 1.8323 KOps/s | |
test_lock_stack_nested | 0.6254ms | 0.4125ms | 2.4243 KOps/s | 2.4519 KOps/s | |
test_unlock_nested | 0.7338ms | 0.3590ms | 2.7857 KOps/s | 2.7668 KOps/s | |
test_unlock_stack_nested | 0.6378ms | 0.3293ms | 3.0365 KOps/s | 3.0507 KOps/s | |
test_flatten_speed | 0.1763ms | 91.2862μs | 10.9546 KOps/s | 10.9314 KOps/s | |
test_unflatten_speed | 0.6647ms | 0.4694ms | 2.1302 KOps/s | 2.1501 KOps/s | |
test_common_ops | 4.3535ms | 0.7487ms | 1.3356 KOps/s | 1.3041 KOps/s | |
test_creation | 27.2510μs | 2.1469μs | 465.7891 KOps/s | 490.2582 KOps/s | |
test_creation_empty | 54.4520μs | 10.6048μs | 94.2971 KOps/s | 92.7564 KOps/s | |
test_creation_nested_1 | 40.1950μs | 13.5632μs | 73.7290 KOps/s | 72.1283 KOps/s | |
test_creation_nested_2 | 52.0570μs | 17.6093μs | 56.7881 KOps/s | 56.3412 KOps/s | |
test_clone | 60.4330μs | 13.1799μs | 75.8729 KOps/s | 76.8144 KOps/s | |
test_getitem[int] | 1.1474ms | 12.1881μs | 82.0473 KOps/s | 81.6155 KOps/s | |
test_getitem[slice_int] | 0.1370ms | 23.7054μs | 42.1844 KOps/s | 42.9802 KOps/s | |
test_getitem[range] | 0.1626ms | 47.4390μs | 21.0797 KOps/s | 20.7883 KOps/s | |
test_getitem[tuple] | 0.1288ms | 19.9102μs | 50.2256 KOps/s | 51.7925 KOps/s | |
test_getitem[list] | 0.1613ms | 43.6559μs | 22.9064 KOps/s | 22.5138 KOps/s | |
test_setitem_dim[int] | 68.3980μs | 24.5380μs | 40.7531 KOps/s | 40.6477 KOps/s | |
test_setitem_dim[slice_int] | 84.0880μs | 49.9292μs | 20.0283 KOps/s | 20.3390 KOps/s | |
test_setitem_dim[range] | 0.1032ms | 74.1237μs | 13.4910 KOps/s | 13.7420 KOps/s | |
test_setitem_dim[tuple] | 80.2300μs | 39.0555μs | 25.6046 KOps/s | 25.4462 KOps/s | |
test_setitem | 65.5720μs | 20.0625μs | 49.8442 KOps/s | 50.0035 KOps/s | |
test_set | 64.5410μs | 19.7689μs | 50.5846 KOps/s | 52.0282 KOps/s | |
test_set_shared | 3.5015ms | 0.1656ms | 6.0388 KOps/s | 6.0515 KOps/s | |
test_update | 0.1784ms | 22.0421μs | 45.3677 KOps/s | 45.7301 KOps/s | |
test_update_nested | 95.0170μs | 31.4947μs | 31.7514 KOps/s | 32.0982 KOps/s | |
test_update__nested | 0.6509ms | 32.3276μs | 30.9333 KOps/s | 31.7951 KOps/s | |
test_set_nested | 82.2940μs | 21.1085μs | 47.3744 KOps/s | 46.4166 KOps/s | |
test_set_nested_new | 67.5170μs | 25.7637μs | 38.8143 KOps/s | 38.7448 KOps/s | |
test_select | 0.1122ms | 42.1201μs | 23.7416 KOps/s | 24.1564 KOps/s | |
test_select_nested | 0.1194ms | 60.0002μs | 16.6666 KOps/s | 16.2588 KOps/s | |
test_exclude_nested | 0.1524ms | 75.4966μs | 13.2456 KOps/s | 13.7220 KOps/s | |
test_empty[True] | 0.6400ms | 0.3457ms | 2.8926 KOps/s | 2.8566 KOps/s | |
test_empty[False] | 6.3592μs | 1.2995μs | 769.5426 KOps/s | 818.4265 KOps/s | |
test_unbind_speed | 0.5350ms | 0.2605ms | 3.8395 KOps/s | 3.8954 KOps/s | |
test_unbind_speed_stack0 | 0.4444ms | 0.2578ms | 3.8792 KOps/s | 3.9467 KOps/s | |
test_unbind_speed_stack1 | 0.1070s | 0.7609ms | 1.3141 KOps/s | 1.4300 KOps/s | |
test_split | 93.1105ms | 1.7204ms | 581.2540 Ops/s | 576.5647 Ops/s | |
test_chunk | 0.1063s | 1.7404ms | 574.5683 Ops/s | 582.7583 Ops/s | |
test_consolidate_njt[False-None] | 8.2174ms | 7.9208ms | 126.2493 Ops/s | 121.9076 Ops/s | |
test_creation[device0] | 0.2495ms | 90.1558μs | 11.0919 KOps/s | 10.6477 KOps/s | |
test_creation_from_tensor | 4.1603ms | 94.6093μs | 10.5698 KOps/s | 10.5083 KOps/s | |
test_add_one[memmap_tensor0] | 0.1565ms | 4.7390μs | 211.0131 KOps/s | 209.4542 KOps/s | |
test_contiguous[memmap_tensor0] | 9.2470μs | 0.4957μs | 2.0174 MOps/s | 1.9544 MOps/s | |
test_stack[memmap_tensor0] | 36.3080μs | 3.5301μs | 283.2812 KOps/s | 295.5417 KOps/s | |
test_memmaptd_index | 1.0227ms | 0.2303ms | 4.3422 KOps/s | 4.3173 KOps/s | |
test_memmaptd_index_astensor | 0.8057ms | 0.3070ms | 3.2572 KOps/s | 3.2479 KOps/s | |
test_memmaptd_index_op | 0.9065ms | 0.5632ms | 1.7755 KOps/s | 1.7378 KOps/s | |
test_serialize_model | 0.1268s | 0.1152s | 8.6789 Ops/s | 7.5277 Ops/s | |
test_serialize_model_pickle | 0.4491s | 0.3906s | 2.5601 Ops/s | 2.6000 Ops/s | |
test_serialize_weights | 0.1255s | 0.1167s | 8.5655 Ops/s | 8.5723 Ops/s | |
test_serialize_weights_returnearly | 0.1847s | 0.1647s | 6.0711 Ops/s | 6.2711 Ops/s | |
test_serialize_weights_pickle | 1.1593s | 0.7037s | 1.4210 Ops/s | 1.2105 Ops/s | |
test_serialize_weights_filesystem | 0.1459s | 0.1406s | 7.1125 Ops/s | 7.0483 Ops/s | |
test_serialize_model_filesystem | 0.2401s | 0.1553s | 6.4389 Ops/s | 6.9395 Ops/s | |
test_reshape_pytree | 68.5480μs | 27.3234μs | 36.5987 KOps/s | 37.0436 KOps/s | |
test_reshape_td | 75.6610μs | 32.4343μs | 30.8316 KOps/s | 30.6931 KOps/s | |
test_view_pytree | 0.1009ms | 27.2704μs | 36.6698 KOps/s | 36.8488 KOps/s | |
test_view_td | 73.9780μs | 37.9341μs | 26.3615 KOps/s | 26.6914 KOps/s | |
test_unbind_pytree | 61.2740μs | 30.2995μs | 33.0038 KOps/s | 33.0670 KOps/s | |
test_unbind_td | 0.3476ms | 38.7259μs | 25.8225 KOps/s | 26.2387 KOps/s | |
test_split_pytree | 60.5630μs | 30.2020μs | 33.1104 KOps/s | 34.0311 KOps/s | |
test_split_td | 0.5989ms | 44.1431μs | 22.6536 KOps/s | 23.4849 KOps/s | |
test_add_pytree | 73.8790μs | 35.3197μs | 28.3128 KOps/s | 28.3887 KOps/s | |
test_add_td | 0.1170ms | 51.6883μs | 19.3467 KOps/s | 18.9721 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1150ms | 62.3221μs | 16.0457 KOps/s | 15.9912 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 1.4359ms | 0.1605ms | 6.2314 KOps/s | 6.2803 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1468ms | 45.4450μs | 22.0046 KOps/s | 21.3141 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2607ms | 0.1200ms | 8.3348 KOps/s | 8.5135 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 54.9020μs | 26.2415μs | 38.1075 KOps/s | 38.6480 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1213ms | 53.6257μs | 18.6478 KOps/s | 17.8373 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1540ms | 79.5534μs | 12.5702 KOps/s | 12.6980 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1425ms | 69.6425μs | 14.3590 KOps/s | 14.6540 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2304ms | 0.1052ms | 9.5038 KOps/s | 9.5459 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3695ms | 0.1987ms | 5.0326 KOps/s | 5.1239 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1091ms | 45.7508μs | 21.8576 KOps/s | 22.1240 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4631ms | 60.0908μs | 16.6415 KOps/s | 16.3853 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2331ms | 0.1036ms | 9.6551 KOps/s | 9.6405 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3398ms | 0.2035ms | 4.9139 KOps/s | 4.9801 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4218ms | 0.2083ms | 4.7997 KOps/s | 4.8571 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2449ms | 0.1094ms | 9.1370 KOps/s | 9.3776 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1879ms | 53.0094μs | 18.8646 KOps/s | 18.5409 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1033ms | 46.9594μs | 21.2950 KOps/s | 21.4206 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.5884ms | 0.1594ms | 6.2718 KOps/s | 6.3178 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1916ms | 0.1037ms | 9.6452 KOps/s | 9.2427 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 61.2240μs | 20.5577μs | 48.6437 KOps/s | 47.5603 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1224ms | 58.2846μs | 17.1572 KOps/s | 17.0945 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1341ms | 82.8313μs | 12.0727 KOps/s | 12.2773 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1313ms | 71.6458μs | 13.9575 KOps/s | 14.3061 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3935ms | 0.2174ms | 4.5990 KOps/s | 4.7619 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.5712ms | 1.2528ms | 798.2339 Ops/s | 802.0701 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3277ms | 0.2057ms | 4.8616 KOps/s | 4.8936 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.2048ms | 0.7794ms | 1.2831 KOps/s | 1.2964 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.6633ms | 0.4667ms | 2.1428 KOps/s | 2.1624 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.8390ms | 2.5813ms | 387.3986 Ops/s | 388.8275 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 74.8390μs | 36.9348μs | 27.0747 KOps/s | 27.2174 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.3911ms | 31.5483μs | 31.6974 KOps/s | 29.8598 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 88.1050μs | 28.8052μs | 34.7159 KOps/s | 32.3998 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 74.1890μs | 23.1716μs | 43.1562 KOps/s | 42.4818 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 68.0970μs | 29.8667μs | 33.4821 KOps/s | 32.1209 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 79.4480μs | 23.0383μs | 43.4061 KOps/s | 42.6366 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1155ms | 52.4319μs | 19.0724 KOps/s | 19.4105 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4909ms | 19.5160μs | 51.2400 KOps/s | 49.1014 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 99.8370μs | 43.7380μs | 22.8634 KOps/s | 22.9336 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 70.4310μs | 19.0785μs | 52.4152 KOps/s | 51.9329 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1068ms | 44.6610μs | 22.3909 KOps/s | 22.2172 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 71.1030μs | 19.1807μs | 52.1357 KOps/s | 52.5814 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1160ms | 53.8372μs | 18.5745 KOps/s | 19.0963 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8972ms | 19.3088μs | 51.7899 KOps/s | 50.1698 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1270ms | 44.7967μs | 22.3231 KOps/s | 22.0355 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 68.1370μs | 18.8472μs | 53.0584 KOps/s | 52.1818 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1098ms | 44.9483μs | 22.2478 KOps/s | 22.1352 KOps/s | |
test_compile_indexing[int-pytree-eager] | 57.6280μs | 18.9309μs | 52.8236 KOps/s | 52.2747 KOps/s | |
test_mod_add[eager] | 86.2910μs | 25.4875μs | 39.2350 KOps/s | 37.1423 KOps/s | |
test_mod_add[compile] | 0.1127ms | 45.2303μs | 22.1091 KOps/s | 21.6213 KOps/s | |
test_mod_add[compile-overhead] | 0.1223ms | 45.5237μs | 21.9666 KOps/s | 21.5243 KOps/s | |
test_mod_wrap[eager] | 0.3578ms | 0.2158ms | 4.6341 KOps/s | 4.6528 KOps/s | |
test_mod_wrap[compile] | 1.4511ms | 0.2065ms | 4.8427 KOps/s | 4.8674 KOps/s | |
test_mod_wrap[compile-overhead] | 1.4189ms | 0.2047ms | 4.8855 KOps/s | 4.9161 KOps/s | |
test_mod_wrap_and_backward[eager] | 18.2478ms | 12.0929ms | 82.6928 Ops/s | 89.4920 Ops/s | |
test_mod_wrap_and_backward[compile] | 16.2627ms | 11.6753ms | 85.6512 Ops/s | 77.2400 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 17.4201ms | 11.9293ms | 83.8271 Ops/s | 85.8731 Ops/s | |
test_seq_add[eager] | 0.1720ms | 88.2597μs | 11.3302 KOps/s | 10.4005 KOps/s | |
test_seq_add[compile] | 0.1156ms | 59.8635μs | 16.7047 KOps/s | 16.4225 KOps/s | |
test_seq_add[compile-overhead] | 0.1194ms | 59.4224μs | 16.8287 KOps/s | 16.8703 KOps/s | |
test_seq_wrap[eager] | 0.4980ms | 0.3937ms | 2.5398 KOps/s | 2.4776 KOps/s | |
test_seq_wrap[compile] | 0.4116ms | 0.2280ms | 4.3859 KOps/s | 4.3291 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4234ms | 0.2288ms | 4.3714 KOps/s | 4.4180 KOps/s | |
test_func_call_runtime[False-eager] | 0.7698ms | 0.5517ms | 1.8126 KOps/s | 1.8871 KOps/s | |
test_func_call_runtime[False-compile] | 0.5172ms | 0.4221ms | 2.3692 KOps/s | 2.3371 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.9242ms | 0.4256ms | 2.3498 KOps/s | 2.3393 KOps/s | |
test_func_call_runtime[True-eager] | 0.9412ms | 0.7654ms | 1.3065 KOps/s | 1.3358 KOps/s | |
test_func_call_runtime[True-compile] | 0.8404ms | 0.4662ms | 2.1452 KOps/s | 2.1346 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6072ms | 0.4648ms | 2.1517 KOps/s | 2.1368 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8571ms | 0.5538ms | 1.8056 KOps/s | 1.8819 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8692ms | 0.4283ms | 2.3349 KOps/s | 2.3471 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5443ms | 0.4249ms | 2.3534 KOps/s | 2.3138 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1019ms | 0.9014ms | 1.1094 KOps/s | 1.1235 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.5848ms | 0.4879ms | 2.0496 KOps/s | 2.0315 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.5781ms | 0.4903ms | 2.0395 KOps/s | 2.0330 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5276ms | 1.8694ms | 534.9395 Ops/s | 527.5591 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.8922ms | 0.5187ms | 1.9280 KOps/s | 1.9126 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8598ms | 0.5225ms | 1.9137 KOps/s | 1.9255 KOps/s | |
test_distributed | 0.2528ms | 0.1269ms | 7.8820 KOps/s | 7.7212 KOps/s | |
test_tdmodule | 0.1019ms | 18.8020μs | 53.1859 KOps/s | 51.7999 KOps/s | |
test_tdmodule_dispatch | 57.0770μs | 37.2302μs | 26.8599 KOps/s | 26.5370 KOps/s | |
test_tdseq | 39.7440μs | 21.4677μs | 46.5815 KOps/s | 45.9160 KOps/s | |
test_tdseq_dispatch | 69.0790μs | 42.1640μs | 23.7169 KOps/s | 23.1203 KOps/s | |
test_instantiation_functorch | 1.7506ms | 1.5123ms | 661.2433 Ops/s | 653.4522 Ops/s | |
test_exec_functorch | 0.4260ms | 0.1808ms | 5.5305 KOps/s | 5.5096 KOps/s | |
test_exec_functional_call | 0.3681ms | 0.1687ms | 5.9290 KOps/s | 5.7505 KOps/s | |
test_exec_td_decorator | 0.4599ms | 0.2269ms | 4.4076 KOps/s | 4.2785 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9397ms | 0.6359ms | 1.5725 KOps/s | 1.5694 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.2048ms | 0.6441ms | 1.5525 KOps/s | 1.5796 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.8505ms | 0.5284ms | 1.8925 KOps/s | 1.9171 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8363ms | 0.5293ms | 1.8892 KOps/s | 1.9390 KOps/s | |
test_to_module_speed[True] | 1.4913ms | 1.2979ms | 770.4677 Ops/s | 750.0554 Ops/s | |
test_to_module_speed[False] | 2.0907ms | 1.2756ms | 783.9265 Ops/s | 810.1362 Ops/s | |
test_tc_init | 84.4380μs | 45.8078μs | 21.8303 KOps/s | 22.4325 KOps/s | |
test_tc_init_nested | 0.1573ms | 91.2706μs | 10.9564 KOps/s | 11.1038 KOps/s | |
test_tc_first_layer_tensor | 23.0230μs | 1.5243μs | 656.0307 KOps/s | 642.7054 KOps/s | |
test_tc_first_layer_nontensor | 27.7420μs | 4.7110μs | 212.2674 KOps/s | 211.1031 KOps/s | |
test_tc_second_layer_tensor | 21.5610μs | 2.8910μs | 345.9065 KOps/s | 351.0283 KOps/s | |
test_tc_second_layer_nontensor | 26.6200μs | 6.0878μs | 164.2626 KOps/s | 168.4347 KOps/s | |
test_unbind | 0.2264s | 12.5044ms | 79.9718 Ops/s | 72.0628 Ops/s | |
test_full_like | 16.7930ms | 11.9819ms | 83.4593 Ops/s | 137.1118 Ops/s | |
test_zeros_like | 12.9782ms | 7.6101ms | 131.4045 Ops/s | 361.7556 Ops/s | |
test_ones_like | 13.6635ms | 7.7958ms | 128.2739 Ops/s | 306.2873 Ops/s | |
test_clone | 14.5869ms | 9.3818ms | 106.5898 Ops/s | 197.0765 Ops/s | |
test_squeeze | 61.0340μs | 11.6755μs | 85.6493 KOps/s | 86.1589 KOps/s | |
test_unsqueeze | 0.1546ms | 84.8318μs | 11.7880 KOps/s | 11.3282 KOps/s | |
test_split | 0.4928ms | 0.1884ms | 5.3078 KOps/s | 5.3210 KOps/s | |
test_permute | 0.3548ms | 0.2161ms | 4.6285 KOps/s | 4.5649 KOps/s | |
test_stack | 31.4839ms | 25.1326ms | 39.7890 Ops/s | 38.5598 Ops/s | |
test_cat | 26.7779ms | 24.8068ms | 40.3116 Ops/s | 39.6433 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CI
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):