Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] Upgrade to linux_job_v2 #1097

Merged
merged 1 commit into from
Nov 20, 2024
Merged

[CI] Upgrade to linux_job_v2 #1097

merged 1 commit into from
Nov 20, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Nov 20, 2024

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 20, 2024
vmoens added a commit that referenced this pull request Nov 20, 2024
ghstack-source-id: cf280238d4a241c02edb5d3e8ffb52c24fd228c9
Pull Request resolved: #1097
@vmoens vmoens added the CI label Nov 20, 2024
@vmoens vmoens merged commit 24427a3 into gh/vmoens/35/base Nov 20, 2024
34 of 41 checks passed
@vmoens vmoens deleted the gh/vmoens/35/head branch November 20, 2024 10:43
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 217. Improved: $\large\color{#35bf28}9$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 39.3630μs 17.7139μs 56.4529 KOps/s 55.8191 KOps/s $\color{#35bf28}+1.14\%$
test_plain_set_stack_nested 45.2740μs 17.7389μs 56.3733 KOps/s 55.3874 KOps/s $\color{#35bf28}+1.78\%$
test_plain_set_nested_inplace 69.2400μs 19.6385μs 50.9205 KOps/s 50.7725 KOps/s $\color{#35bf28}+0.29\%$
test_plain_set_stack_nested_inplace 52.0870μs 19.6910μs 50.7846 KOps/s 50.8966 KOps/s $\color{#d91a1a}-0.22\%$
test_items 33.8630μs 4.2188μs 237.0356 KOps/s 239.2109 KOps/s $\color{#d91a1a}-0.91\%$
test_items_nested 0.5858ms 0.3451ms 2.8977 KOps/s 2.9030 KOps/s $\color{#d91a1a}-0.18\%$
test_items_nested_locked 0.7172ms 0.3457ms 2.8924 KOps/s 2.8886 KOps/s $\color{#35bf28}+0.13\%$
test_items_nested_leaf 0.1439ms 70.8852μs 14.1073 KOps/s 14.1307 KOps/s $\color{#d91a1a}-0.17\%$
test_items_stack_nested 0.4429ms 0.3464ms 2.8865 KOps/s 2.8942 KOps/s $\color{#d91a1a}-0.27\%$
test_items_stack_nested_leaf 0.1415ms 74.2351μs 13.4707 KOps/s 13.5418 KOps/s $\color{#d91a1a}-0.52\%$
test_items_stack_nested_locked 0.6109ms 0.3466ms 2.8856 KOps/s 2.9051 KOps/s $\color{#d91a1a}-0.67\%$
test_keys 23.4840μs 3.4778μs 287.5393 KOps/s 285.8889 KOps/s $\color{#35bf28}+0.58\%$
test_keys_nested 0.2556ms 0.1347ms 7.4261 KOps/s 7.1987 KOps/s $\color{#35bf28}+3.16\%$
test_keys_nested_locked 1.8544ms 0.1401ms 7.1354 KOps/s 6.9129 KOps/s $\color{#35bf28}+3.22\%$
test_keys_nested_leaf 0.2064ms 0.1157ms 8.6459 KOps/s 8.3243 KOps/s $\color{#35bf28}+3.86\%$
test_keys_stack_nested 0.2578ms 0.1369ms 7.3037 KOps/s 7.3004 KOps/s $\color{#35bf28}+0.04\%$
test_keys_stack_nested_leaf 0.2477ms 0.1170ms 8.5490 KOps/s 8.5126 KOps/s $\color{#35bf28}+0.43\%$
test_keys_stack_nested_locked 0.2658ms 0.1409ms 7.0982 KOps/s 7.0317 KOps/s $\color{#35bf28}+0.95\%$
test_values 4.9552μs 1.0682μs 936.1701 KOps/s 943.1286 KOps/s $\color{#d91a1a}-0.74\%$
test_values_nested 0.1070ms 55.1082μs 18.1461 KOps/s 17.8881 KOps/s $\color{#35bf28}+1.44\%$
test_values_nested_locked 0.4075ms 56.5431μs 17.6856 KOps/s 18.1496 KOps/s $\color{#d91a1a}-2.56\%$
test_values_nested_leaf 0.1151ms 60.0046μs 16.6654 KOps/s 16.5588 KOps/s $\color{#35bf28}+0.64\%$
test_values_stack_nested 0.1097ms 56.5083μs 17.6965 KOps/s 17.6969 KOps/s $-0.00\%$
test_values_stack_nested_leaf 0.1059ms 60.0559μs 16.6512 KOps/s 16.5066 KOps/s $\color{#35bf28}+0.88\%$
test_values_stack_nested_locked 0.3495ms 56.7319μs 17.6268 KOps/s 17.7439 KOps/s $\color{#d91a1a}-0.66\%$
test_membership 23.1830μs 0.8918μs 1.1214 MOps/s 1.1256 MOps/s $\color{#d91a1a}-0.37\%$
test_membership_nested 26.4690μs 2.7434μs 364.5086 KOps/s 358.9759 KOps/s $\color{#35bf28}+1.54\%$
test_membership_nested_leaf 28.0030μs 2.7483μs 363.8581 KOps/s 358.9950 KOps/s $\color{#35bf28}+1.35\%$
test_membership_stacked_nested 26.5600μs 2.7473μs 363.9875 KOps/s 362.3322 KOps/s $\color{#35bf28}+0.46\%$
test_membership_stacked_nested_leaf 27.7720μs 2.7447μs 364.3408 KOps/s 365.1253 KOps/s $\color{#d91a1a}-0.21\%$
test_membership_nested_last 23.8150μs 4.0856μs 244.7624 KOps/s 247.4484 KOps/s $\color{#d91a1a}-1.09\%$
test_membership_nested_leaf_last 18.1940μs 4.1389μs 241.6115 KOps/s 247.5506 KOps/s $\color{#d91a1a}-2.40\%$
test_membership_stacked_nested_last 27.5920μs 4.1173μs 242.8783 KOps/s 248.4505 KOps/s $\color{#d91a1a}-2.24\%$
test_membership_stacked_nested_leaf_last 28.9840μs 4.1381μs 241.6555 KOps/s 235.8262 KOps/s $\color{#35bf28}+2.47\%$
test_nested_getleaf 36.1870μs 10.5389μs 94.8863 KOps/s 94.8551 KOps/s $\color{#35bf28}+0.03\%$
test_nested_get 39.0930μs 10.0730μs 99.2753 KOps/s 100.4554 KOps/s $\color{#d91a1a}-1.17\%$
test_stacked_getleaf 33.0110μs 10.4946μs 95.2872 KOps/s 95.1786 KOps/s $\color{#35bf28}+0.11\%$
test_stacked_get 43.0880μs 9.9948μs 100.0523 KOps/s 100.3423 KOps/s $\color{#d91a1a}-0.29\%$
test_nested_getitemleaf 35.3760μs 10.9727μs 91.1350 KOps/s 91.1044 KOps/s $\color{#35bf28}+0.03\%$
test_nested_getitem 0.1289ms 10.1258μs 98.7572 KOps/s 96.7187 KOps/s $\color{#35bf28}+2.11\%$
test_stacked_getitemleaf 0.1398ms 11.3738μs 87.9212 KOps/s 90.2709 KOps/s $\color{#d91a1a}-2.60\%$
test_stacked_getitem 34.2040μs 10.2943μs 97.1407 KOps/s 97.4512 KOps/s $\color{#d91a1a}-0.32\%$
test_lock_nested 2.9221ms 0.4474ms 2.2353 KOps/s 1.8323 KOps/s $\textbf{\color{#35bf28}+21.99\%}$
test_lock_stack_nested 0.6254ms 0.4125ms 2.4243 KOps/s 2.4519 KOps/s $\color{#d91a1a}-1.13\%$
test_unlock_nested 0.7338ms 0.3590ms 2.7857 KOps/s 2.7668 KOps/s $\color{#35bf28}+0.68\%$
test_unlock_stack_nested 0.6378ms 0.3293ms 3.0365 KOps/s 3.0507 KOps/s $\color{#d91a1a}-0.47\%$
test_flatten_speed 0.1763ms 91.2862μs 10.9546 KOps/s 10.9314 KOps/s $\color{#35bf28}+0.21\%$
test_unflatten_speed 0.6647ms 0.4694ms 2.1302 KOps/s 2.1501 KOps/s $\color{#d91a1a}-0.92\%$
test_common_ops 4.3535ms 0.7487ms 1.3356 KOps/s 1.3041 KOps/s $\color{#35bf28}+2.42\%$
test_creation 27.2510μs 2.1469μs 465.7891 KOps/s 490.2582 KOps/s $\color{#d91a1a}-4.99\%$
test_creation_empty 54.4520μs 10.6048μs 94.2971 KOps/s 92.7564 KOps/s $\color{#35bf28}+1.66\%$
test_creation_nested_1 40.1950μs 13.5632μs 73.7290 KOps/s 72.1283 KOps/s $\color{#35bf28}+2.22\%$
test_creation_nested_2 52.0570μs 17.6093μs 56.7881 KOps/s 56.3412 KOps/s $\color{#35bf28}+0.79\%$
test_clone 60.4330μs 13.1799μs 75.8729 KOps/s 76.8144 KOps/s $\color{#d91a1a}-1.23\%$
test_getitem[int] 1.1474ms 12.1881μs 82.0473 KOps/s 81.6155 KOps/s $\color{#35bf28}+0.53\%$
test_getitem[slice_int] 0.1370ms 23.7054μs 42.1844 KOps/s 42.9802 KOps/s $\color{#d91a1a}-1.85\%$
test_getitem[range] 0.1626ms 47.4390μs 21.0797 KOps/s 20.7883 KOps/s $\color{#35bf28}+1.40\%$
test_getitem[tuple] 0.1288ms 19.9102μs 50.2256 KOps/s 51.7925 KOps/s $\color{#d91a1a}-3.03\%$
test_getitem[list] 0.1613ms 43.6559μs 22.9064 KOps/s 22.5138 KOps/s $\color{#35bf28}+1.74\%$
test_setitem_dim[int] 68.3980μs 24.5380μs 40.7531 KOps/s 40.6477 KOps/s $\color{#35bf28}+0.26\%$
test_setitem_dim[slice_int] 84.0880μs 49.9292μs 20.0283 KOps/s 20.3390 KOps/s $\color{#d91a1a}-1.53\%$
test_setitem_dim[range] 0.1032ms 74.1237μs 13.4910 KOps/s 13.7420 KOps/s $\color{#d91a1a}-1.83\%$
test_setitem_dim[tuple] 80.2300μs 39.0555μs 25.6046 KOps/s 25.4462 KOps/s $\color{#35bf28}+0.62\%$
test_setitem 65.5720μs 20.0625μs 49.8442 KOps/s 50.0035 KOps/s $\color{#d91a1a}-0.32\%$
test_set 64.5410μs 19.7689μs 50.5846 KOps/s 52.0282 KOps/s $\color{#d91a1a}-2.77\%$
test_set_shared 3.5015ms 0.1656ms 6.0388 KOps/s 6.0515 KOps/s $\color{#d91a1a}-0.21\%$
test_update 0.1784ms 22.0421μs 45.3677 KOps/s 45.7301 KOps/s $\color{#d91a1a}-0.79\%$
test_update_nested 95.0170μs 31.4947μs 31.7514 KOps/s 32.0982 KOps/s $\color{#d91a1a}-1.08\%$
test_update__nested 0.6509ms 32.3276μs 30.9333 KOps/s 31.7951 KOps/s $\color{#d91a1a}-2.71\%$
test_set_nested 82.2940μs 21.1085μs 47.3744 KOps/s 46.4166 KOps/s $\color{#35bf28}+2.06\%$
test_set_nested_new 67.5170μs 25.7637μs 38.8143 KOps/s 38.7448 KOps/s $\color{#35bf28}+0.18\%$
test_select 0.1122ms 42.1201μs 23.7416 KOps/s 24.1564 KOps/s $\color{#d91a1a}-1.72\%$
test_select_nested 0.1194ms 60.0002μs 16.6666 KOps/s 16.2588 KOps/s $\color{#35bf28}+2.51\%$
test_exclude_nested 0.1524ms 75.4966μs 13.2456 KOps/s 13.7220 KOps/s $\color{#d91a1a}-3.47\%$
test_empty[True] 0.6400ms 0.3457ms 2.8926 KOps/s 2.8566 KOps/s $\color{#35bf28}+1.26\%$
test_empty[False] 6.3592μs 1.2995μs 769.5426 KOps/s 818.4265 KOps/s $\textbf{\color{#d91a1a}-5.97\%}$
test_unbind_speed 0.5350ms 0.2605ms 3.8395 KOps/s 3.8954 KOps/s $\color{#d91a1a}-1.44\%$
test_unbind_speed_stack0 0.4444ms 0.2578ms 3.8792 KOps/s 3.9467 KOps/s $\color{#d91a1a}-1.71\%$
test_unbind_speed_stack1 0.1070s 0.7609ms 1.3141 KOps/s 1.4300 KOps/s $\textbf{\color{#d91a1a}-8.10\%}$
test_split 93.1105ms 1.7204ms 581.2540 Ops/s 576.5647 Ops/s $\color{#35bf28}+0.81\%$
test_chunk 0.1063s 1.7404ms 574.5683 Ops/s 582.7583 Ops/s $\color{#d91a1a}-1.41\%$
test_consolidate_njt[False-None] 8.2174ms 7.9208ms 126.2493 Ops/s 121.9076 Ops/s $\color{#35bf28}+3.56\%$
test_creation[device0] 0.2495ms 90.1558μs 11.0919 KOps/s 10.6477 KOps/s $\color{#35bf28}+4.17\%$
test_creation_from_tensor 4.1603ms 94.6093μs 10.5698 KOps/s 10.5083 KOps/s $\color{#35bf28}+0.59\%$
test_add_one[memmap_tensor0] 0.1565ms 4.7390μs 211.0131 KOps/s 209.4542 KOps/s $\color{#35bf28}+0.74\%$
test_contiguous[memmap_tensor0] 9.2470μs 0.4957μs 2.0174 MOps/s 1.9544 MOps/s $\color{#35bf28}+3.22\%$
test_stack[memmap_tensor0] 36.3080μs 3.5301μs 283.2812 KOps/s 295.5417 KOps/s $\color{#d91a1a}-4.15\%$
test_memmaptd_index 1.0227ms 0.2303ms 4.3422 KOps/s 4.3173 KOps/s $\color{#35bf28}+0.58\%$
test_memmaptd_index_astensor 0.8057ms 0.3070ms 3.2572 KOps/s 3.2479 KOps/s $\color{#35bf28}+0.29\%$
test_memmaptd_index_op 0.9065ms 0.5632ms 1.7755 KOps/s 1.7378 KOps/s $\color{#35bf28}+2.17\%$
test_serialize_model 0.1268s 0.1152s 8.6789 Ops/s 7.5277 Ops/s $\textbf{\color{#35bf28}+15.29\%}$
test_serialize_model_pickle 0.4491s 0.3906s 2.5601 Ops/s 2.6000 Ops/s $\color{#d91a1a}-1.53\%$
test_serialize_weights 0.1255s 0.1167s 8.5655 Ops/s 8.5723 Ops/s $\color{#d91a1a}-0.08\%$
test_serialize_weights_returnearly 0.1847s 0.1647s 6.0711 Ops/s 6.2711 Ops/s $\color{#d91a1a}-3.19\%$
test_serialize_weights_pickle 1.1593s 0.7037s 1.4210 Ops/s 1.2105 Ops/s $\textbf{\color{#35bf28}+17.39\%}$
test_serialize_weights_filesystem 0.1459s 0.1406s 7.1125 Ops/s 7.0483 Ops/s $\color{#35bf28}+0.91\%$
test_serialize_model_filesystem 0.2401s 0.1553s 6.4389 Ops/s 6.9395 Ops/s $\textbf{\color{#d91a1a}-7.21\%}$
test_reshape_pytree 68.5480μs 27.3234μs 36.5987 KOps/s 37.0436 KOps/s $\color{#d91a1a}-1.20\%$
test_reshape_td 75.6610μs 32.4343μs 30.8316 KOps/s 30.6931 KOps/s $\color{#35bf28}+0.45\%$
test_view_pytree 0.1009ms 27.2704μs 36.6698 KOps/s 36.8488 KOps/s $\color{#d91a1a}-0.49\%$
test_view_td 73.9780μs 37.9341μs 26.3615 KOps/s 26.6914 KOps/s $\color{#d91a1a}-1.24\%$
test_unbind_pytree 61.2740μs 30.2995μs 33.0038 KOps/s 33.0670 KOps/s $\color{#d91a1a}-0.19\%$
test_unbind_td 0.3476ms 38.7259μs 25.8225 KOps/s 26.2387 KOps/s $\color{#d91a1a}-1.59\%$
test_split_pytree 60.5630μs 30.2020μs 33.1104 KOps/s 34.0311 KOps/s $\color{#d91a1a}-2.71\%$
test_split_td 0.5989ms 44.1431μs 22.6536 KOps/s 23.4849 KOps/s $\color{#d91a1a}-3.54\%$
test_add_pytree 73.8790μs 35.3197μs 28.3128 KOps/s 28.3887 KOps/s $\color{#d91a1a}-0.27\%$
test_add_td 0.1170ms 51.6883μs 19.3467 KOps/s 18.9721 KOps/s $\color{#35bf28}+1.97\%$
test_compile_add_one_nested[tensordict-compile] 0.1150ms 62.3221μs 16.0457 KOps/s 15.9912 KOps/s $\color{#35bf28}+0.34\%$
test_compile_add_one_nested[tensordict-eager] 1.4359ms 0.1605ms 6.2314 KOps/s 6.2803 KOps/s $\color{#d91a1a}-0.78\%$
test_compile_add_one_nested[pytree-compile] 0.1468ms 45.4450μs 22.0046 KOps/s 21.3141 KOps/s $\color{#35bf28}+3.24\%$
test_compile_add_one_nested[pytree-eager] 0.2607ms 0.1200ms 8.3348 KOps/s 8.5135 KOps/s $\color{#d91a1a}-2.10\%$
test_compile_copy_nested[tensordict-compile] 54.9020μs 26.2415μs 38.1075 KOps/s 38.6480 KOps/s $\color{#d91a1a}-1.40\%$
test_compile_copy_nested[tensordict-eager] 0.1213ms 53.6257μs 18.6478 KOps/s 17.8373 KOps/s $\color{#35bf28}+4.54\%$
test_compile_copy_nested[pytree-compile] 0.1540ms 79.5534μs 12.5702 KOps/s 12.6980 KOps/s $\color{#d91a1a}-1.01\%$
test_compile_copy_nested[pytree-eager] 0.1425ms 69.6425μs 14.3590 KOps/s 14.6540 KOps/s $\color{#d91a1a}-2.01\%$
test_compile_add_one_flat[tensordict-compile] 0.2304ms 0.1052ms 9.5038 KOps/s 9.5459 KOps/s $\color{#d91a1a}-0.44\%$
test_compile_add_one_flat[tensordict-eager] 0.3695ms 0.1987ms 5.0326 KOps/s 5.1239 KOps/s $\color{#d91a1a}-1.78\%$
test_compile_add_one_flat[tensorclass-compile] 0.1091ms 45.7508μs 21.8576 KOps/s 22.1240 KOps/s $\color{#d91a1a}-1.20\%$
test_compile_add_one_flat[tensorclass-eager] 0.4631ms 60.0908μs 16.6415 KOps/s 16.3853 KOps/s $\color{#35bf28}+1.56\%$
test_compile_add_one_flat[pytree-compile] 0.2331ms 0.1036ms 9.6551 KOps/s 9.6405 KOps/s $\color{#35bf28}+0.15\%$
test_compile_add_one_flat[pytree-eager] 0.3398ms 0.2035ms 4.9139 KOps/s 4.9801 KOps/s $\color{#d91a1a}-1.33\%$
test_compile_add_self_flat[tensordict-eager] 0.4218ms 0.2083ms 4.7997 KOps/s 4.8571 KOps/s $\color{#d91a1a}-1.18\%$
test_compile_add_self_flat[tensordict-compile] 0.2449ms 0.1094ms 9.1370 KOps/s 9.3776 KOps/s $\color{#d91a1a}-2.57\%$
test_compile_add_self_flat[tensorclass-eager] 0.1879ms 53.0094μs 18.8646 KOps/s 18.5409 KOps/s $\color{#35bf28}+1.75\%$
test_compile_add_self_flat[tensorclass-compile] 0.1033ms 46.9594μs 21.2950 KOps/s 21.4206 KOps/s $\color{#d91a1a}-0.59\%$
test_compile_add_self_flat[pytree-eager] 0.5884ms 0.1594ms 6.2718 KOps/s 6.3178 KOps/s $\color{#d91a1a}-0.73\%$
test_compile_add_self_flat[pytree-compile] 0.1916ms 0.1037ms 9.6452 KOps/s 9.2427 KOps/s $\color{#35bf28}+4.36\%$
test_compile_copy_flat[tensordict-compile] 61.2240μs 20.5577μs 48.6437 KOps/s 47.5603 KOps/s $\color{#35bf28}+2.28\%$
test_compile_copy_flat[tensordict-eager] 0.1224ms 58.2846μs 17.1572 KOps/s 17.0945 KOps/s $\color{#35bf28}+0.37\%$
test_compile_copy_flat[pytree-compile] 0.1341ms 82.8313μs 12.0727 KOps/s 12.2773 KOps/s $\color{#d91a1a}-1.67\%$
test_compile_copy_flat[pytree-eager] 0.1313ms 71.6458μs 13.9575 KOps/s 14.3061 KOps/s $\color{#d91a1a}-2.44\%$
test_compile_assign_and_add[tensordict-compile] 0.3935ms 0.2174ms 4.5990 KOps/s 4.7619 KOps/s $\color{#d91a1a}-3.42\%$
test_compile_assign_and_add[tensordict-eager] 2.5712ms 1.2528ms 798.2339 Ops/s 802.0701 Ops/s $\color{#d91a1a}-0.48\%$
test_compile_assign_and_add[pytree-compile] 0.3277ms 0.2057ms 4.8616 KOps/s 4.8936 KOps/s $\color{#d91a1a}-0.65\%$
test_compile_assign_and_add[pytree-eager] 1.2048ms 0.7794ms 1.2831 KOps/s 1.2964 KOps/s $\color{#d91a1a}-1.02\%$
test_compile_assign_and_add_stack[compile] 0.6633ms 0.4667ms 2.1428 KOps/s 2.1624 KOps/s $\color{#d91a1a}-0.90\%$
test_compile_assign_and_add_stack[eager] 2.8390ms 2.5813ms 387.3986 Ops/s 388.8275 Ops/s $\color{#d91a1a}-0.37\%$
test_compile_indexing[tensor-tensordict-compile] 74.8390μs 36.9348μs 27.0747 KOps/s 27.2174 KOps/s $\color{#d91a1a}-0.52\%$
test_compile_indexing[tensor-tensordict-eager] 0.3911ms 31.5483μs 31.6974 KOps/s 29.8598 KOps/s $\textbf{\color{#35bf28}+6.15\%}$
test_compile_indexing[tensor-tensorclass-compile] 88.1050μs 28.8052μs 34.7159 KOps/s 32.3998 KOps/s $\textbf{\color{#35bf28}+7.15\%}$
test_compile_indexing[tensor-tensorclass-eager] 74.1890μs 23.1716μs 43.1562 KOps/s 42.4818 KOps/s $\color{#35bf28}+1.59\%$
test_compile_indexing[tensor-pytree-compile] 68.0970μs 29.8667μs 33.4821 KOps/s 32.1209 KOps/s $\color{#35bf28}+4.24\%$
test_compile_indexing[tensor-pytree-eager] 79.4480μs 23.0383μs 43.4061 KOps/s 42.6366 KOps/s $\color{#35bf28}+1.80\%$
test_compile_indexing[slice-tensordict-compile] 0.1155ms 52.4319μs 19.0724 KOps/s 19.4105 KOps/s $\color{#d91a1a}-1.74\%$
test_compile_indexing[slice-tensordict-eager] 0.4909ms 19.5160μs 51.2400 KOps/s 49.1014 KOps/s $\color{#35bf28}+4.36\%$
test_compile_indexing[slice-tensorclass-compile] 99.8370μs 43.7380μs 22.8634 KOps/s 22.9336 KOps/s $\color{#d91a1a}-0.31\%$
test_compile_indexing[slice-tensorclass-eager] 70.4310μs 19.0785μs 52.4152 KOps/s 51.9329 KOps/s $\color{#35bf28}+0.93\%$
test_compile_indexing[slice-pytree-compile] 0.1068ms 44.6610μs 22.3909 KOps/s 22.2172 KOps/s $\color{#35bf28}+0.78\%$
test_compile_indexing[slice-pytree-eager] 71.1030μs 19.1807μs 52.1357 KOps/s 52.5814 KOps/s $\color{#d91a1a}-0.85\%$
test_compile_indexing[int-tensordict-compile] 0.1160ms 53.8372μs 18.5745 KOps/s 19.0963 KOps/s $\color{#d91a1a}-2.73\%$
test_compile_indexing[int-tensordict-eager] 0.8972ms 19.3088μs 51.7899 KOps/s 50.1698 KOps/s $\color{#35bf28}+3.23\%$
test_compile_indexing[int-tensorclass-compile] 0.1270ms 44.7967μs 22.3231 KOps/s 22.0355 KOps/s $\color{#35bf28}+1.31\%$
test_compile_indexing[int-tensorclass-eager] 68.1370μs 18.8472μs 53.0584 KOps/s 52.1818 KOps/s $\color{#35bf28}+1.68\%$
test_compile_indexing[int-pytree-compile] 0.1098ms 44.9483μs 22.2478 KOps/s 22.1352 KOps/s $\color{#35bf28}+0.51\%$
test_compile_indexing[int-pytree-eager] 57.6280μs 18.9309μs 52.8236 KOps/s 52.2747 KOps/s $\color{#35bf28}+1.05\%$
test_mod_add[eager] 86.2910μs 25.4875μs 39.2350 KOps/s 37.1423 KOps/s $\textbf{\color{#35bf28}+5.63\%}$
test_mod_add[compile] 0.1127ms 45.2303μs 22.1091 KOps/s 21.6213 KOps/s $\color{#35bf28}+2.26\%$
test_mod_add[compile-overhead] 0.1223ms 45.5237μs 21.9666 KOps/s 21.5243 KOps/s $\color{#35bf28}+2.05\%$
test_mod_wrap[eager] 0.3578ms 0.2158ms 4.6341 KOps/s 4.6528 KOps/s $\color{#d91a1a}-0.40\%$
test_mod_wrap[compile] 1.4511ms 0.2065ms 4.8427 KOps/s 4.8674 KOps/s $\color{#d91a1a}-0.51\%$
test_mod_wrap[compile-overhead] 1.4189ms 0.2047ms 4.8855 KOps/s 4.9161 KOps/s $\color{#d91a1a}-0.62\%$
test_mod_wrap_and_backward[eager] 18.2478ms 12.0929ms 82.6928 Ops/s 89.4920 Ops/s $\textbf{\color{#d91a1a}-7.60\%}$
test_mod_wrap_and_backward[compile] 16.2627ms 11.6753ms 85.6512 Ops/s 77.2400 Ops/s $\textbf{\color{#35bf28}+10.89\%}$
test_mod_wrap_and_backward[compile-overhead] 17.4201ms 11.9293ms 83.8271 Ops/s 85.8731 Ops/s $\color{#d91a1a}-2.38\%$
test_seq_add[eager] 0.1720ms 88.2597μs 11.3302 KOps/s 10.4005 KOps/s $\textbf{\color{#35bf28}+8.94\%}$
test_seq_add[compile] 0.1156ms 59.8635μs 16.7047 KOps/s 16.4225 KOps/s $\color{#35bf28}+1.72\%$
test_seq_add[compile-overhead] 0.1194ms 59.4224μs 16.8287 KOps/s 16.8703 KOps/s $\color{#d91a1a}-0.25\%$
test_seq_wrap[eager] 0.4980ms 0.3937ms 2.5398 KOps/s 2.4776 KOps/s $\color{#35bf28}+2.51\%$
test_seq_wrap[compile] 0.4116ms 0.2280ms 4.3859 KOps/s 4.3291 KOps/s $\color{#35bf28}+1.31\%$
test_seq_wrap[compile-overhead] 0.4234ms 0.2288ms 4.3714 KOps/s 4.4180 KOps/s $\color{#d91a1a}-1.05\%$
test_func_call_runtime[False-eager] 0.7698ms 0.5517ms 1.8126 KOps/s 1.8871 KOps/s $\color{#d91a1a}-3.95\%$
test_func_call_runtime[False-compile] 0.5172ms 0.4221ms 2.3692 KOps/s 2.3371 KOps/s $\color{#35bf28}+1.37\%$
test_func_call_runtime[False-compile-overhead] 0.9242ms 0.4256ms 2.3498 KOps/s 2.3393 KOps/s $\color{#35bf28}+0.45\%$
test_func_call_runtime[True-eager] 0.9412ms 0.7654ms 1.3065 KOps/s 1.3358 KOps/s $\color{#d91a1a}-2.19\%$
test_func_call_runtime[True-compile] 0.8404ms 0.4662ms 2.1452 KOps/s 2.1346 KOps/s $\color{#35bf28}+0.50\%$
test_func_call_runtime[True-compile-overhead] 0.6072ms 0.4648ms 2.1517 KOps/s 2.1368 KOps/s $\color{#35bf28}+0.70\%$
test_func_call_cm_runtime[False-eager] 0.8571ms 0.5538ms 1.8056 KOps/s 1.8819 KOps/s $\color{#d91a1a}-4.05\%$
test_func_call_cm_runtime[False-compile] 0.8692ms 0.4283ms 2.3349 KOps/s 2.3471 KOps/s $\color{#d91a1a}-0.52\%$
test_func_call_cm_runtime[False-compile-overhead] 0.5443ms 0.4249ms 2.3534 KOps/s 2.3138 KOps/s $\color{#35bf28}+1.71\%$
test_func_call_cm_runtime[True-eager] 1.1019ms 0.9014ms 1.1094 KOps/s 1.1235 KOps/s $\color{#d91a1a}-1.26\%$
test_func_call_cm_runtime[True-compile] 0.5848ms 0.4879ms 2.0496 KOps/s 2.0315 KOps/s $\color{#35bf28}+0.89\%$
test_func_call_cm_runtime[True-compile-overhead] 0.5781ms 0.4903ms 2.0395 KOps/s 2.0330 KOps/s $\color{#35bf28}+0.32\%$
test_vmap_func_call_cm_runtime[eager] 2.5276ms 1.8694ms 534.9395 Ops/s 527.5591 Ops/s $\color{#35bf28}+1.40\%$
test_vmap_func_call_cm_runtime[compile] 0.8922ms 0.5187ms 1.9280 KOps/s 1.9126 KOps/s $\color{#35bf28}+0.80\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.8598ms 0.5225ms 1.9137 KOps/s 1.9255 KOps/s $\color{#d91a1a}-0.61\%$
test_distributed 0.2528ms 0.1269ms 7.8820 KOps/s 7.7212 KOps/s $\color{#35bf28}+2.08\%$
test_tdmodule 0.1019ms 18.8020μs 53.1859 KOps/s 51.7999 KOps/s $\color{#35bf28}+2.68\%$
test_tdmodule_dispatch 57.0770μs 37.2302μs 26.8599 KOps/s 26.5370 KOps/s $\color{#35bf28}+1.22\%$
test_tdseq 39.7440μs 21.4677μs 46.5815 KOps/s 45.9160 KOps/s $\color{#35bf28}+1.45\%$
test_tdseq_dispatch 69.0790μs 42.1640μs 23.7169 KOps/s 23.1203 KOps/s $\color{#35bf28}+2.58\%$
test_instantiation_functorch 1.7506ms 1.5123ms 661.2433 Ops/s 653.4522 Ops/s $\color{#35bf28}+1.19\%$
test_exec_functorch 0.4260ms 0.1808ms 5.5305 KOps/s 5.5096 KOps/s $\color{#35bf28}+0.38\%$
test_exec_functional_call 0.3681ms 0.1687ms 5.9290 KOps/s 5.7505 KOps/s $\color{#35bf28}+3.10\%$
test_exec_td_decorator 0.4599ms 0.2269ms 4.4076 KOps/s 4.2785 KOps/s $\color{#35bf28}+3.02\%$
test_vmap_mlp_speed_decorator[True-True] 0.9397ms 0.6359ms 1.5725 KOps/s 1.5694 KOps/s $\color{#35bf28}+0.20\%$
test_vmap_mlp_speed_decorator[True-False] 1.2048ms 0.6441ms 1.5525 KOps/s 1.5796 KOps/s $\color{#d91a1a}-1.72\%$
test_vmap_mlp_speed_decorator[False-True] 0.8505ms 0.5284ms 1.8925 KOps/s 1.9171 KOps/s $\color{#d91a1a}-1.28\%$
test_vmap_mlp_speed_decorator[False-False] 0.8363ms 0.5293ms 1.8892 KOps/s 1.9390 KOps/s $\color{#d91a1a}-2.57\%$
test_to_module_speed[True] 1.4913ms 1.2979ms 770.4677 Ops/s 750.0554 Ops/s $\color{#35bf28}+2.72\%$
test_to_module_speed[False] 2.0907ms 1.2756ms 783.9265 Ops/s 810.1362 Ops/s $\color{#d91a1a}-3.24\%$
test_tc_init 84.4380μs 45.8078μs 21.8303 KOps/s 22.4325 KOps/s $\color{#d91a1a}-2.68\%$
test_tc_init_nested 0.1573ms 91.2706μs 10.9564 KOps/s 11.1038 KOps/s $\color{#d91a1a}-1.33\%$
test_tc_first_layer_tensor 23.0230μs 1.5243μs 656.0307 KOps/s 642.7054 KOps/s $\color{#35bf28}+2.07\%$
test_tc_first_layer_nontensor 27.7420μs 4.7110μs 212.2674 KOps/s 211.1031 KOps/s $\color{#35bf28}+0.55\%$
test_tc_second_layer_tensor 21.5610μs 2.8910μs 345.9065 KOps/s 351.0283 KOps/s $\color{#d91a1a}-1.46\%$
test_tc_second_layer_nontensor 26.6200μs 6.0878μs 164.2626 KOps/s 168.4347 KOps/s $\color{#d91a1a}-2.48\%$
test_unbind 0.2264s 12.5044ms 79.9718 Ops/s 72.0628 Ops/s $\textbf{\color{#35bf28}+10.98\%}$
test_full_like 16.7930ms 11.9819ms 83.4593 Ops/s 137.1118 Ops/s $\textbf{\color{#d91a1a}-39.13\%}$
test_zeros_like 12.9782ms 7.6101ms 131.4045 Ops/s 361.7556 Ops/s $\textbf{\color{#d91a1a}-63.68\%}$
test_ones_like 13.6635ms 7.7958ms 128.2739 Ops/s 306.2873 Ops/s $\textbf{\color{#d91a1a}-58.12\%}$
test_clone 14.5869ms 9.3818ms 106.5898 Ops/s 197.0765 Ops/s $\textbf{\color{#d91a1a}-45.91\%}$
test_squeeze 61.0340μs 11.6755μs 85.6493 KOps/s 86.1589 KOps/s $\color{#d91a1a}-0.59\%$
test_unsqueeze 0.1546ms 84.8318μs 11.7880 KOps/s 11.3282 KOps/s $\color{#35bf28}+4.06\%$
test_split 0.4928ms 0.1884ms 5.3078 KOps/s 5.3210 KOps/s $\color{#d91a1a}-0.25\%$
test_permute 0.3548ms 0.2161ms 4.6285 KOps/s 4.5649 KOps/s $\color{#35bf28}+1.39\%$
test_stack 31.4839ms 25.1326ms 39.7890 Ops/s 38.5598 Ops/s $\color{#35bf28}+3.19\%$
test_cat 26.7779ms 24.8068ms 40.3116 Ops/s 39.6433 Ops/s $\color{#35bf28}+1.69\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CI CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants