Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Deprecation] Act warned deprecations for v0.6 #1001

Merged
merged 8 commits into from
Oct 8, 2024

Conversation

vmoens
Copy link
Contributor

@vmoens vmoens commented Sep 20, 2024

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 20, 2024
ghstack-source-id: 74fd1c82528ab2e0eaffa9d2c18821c7eee4e7fc
Pull Request resolved: #1001
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 20, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 20, 2024
ghstack-source-id: f8d2312e8bd6a32e482864b86f8c1e3afcd9b888
Pull Request resolved: #1001
@vmoens vmoens added the Refactor Refactoring code - not a new feature label Sep 20, 2024
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 20, 2024
ghstack-source-id: d67b08c3f2929c59ab891c23ac9c371e96b6f336
Pull Request resolved: #1001
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Sep 20, 2024
ghstack-source-id: d76b7dc305c5a754c6567da6cc11e7793cb8ab52
Pull Request resolved: #1001
Copy link

github-actions bot commented Sep 20, 2024

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 228. Improved: $\large\color{#35bf28}24$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_plain_set_nested 0.1184ms 16.2999μs 61.3499 KOps/s 55.9595 KOps/s $\textbf{\color{#35bf28}+9.63\%}$
test_plain_set_stack_nested 49.2910μs 16.3818μs 61.0435 KOps/s 55.2117 KOps/s $\textbf{\color{#35bf28}+10.56\%}$
test_plain_set_nested_inplace 44.5210μs 17.5079μs 57.1171 KOps/s 52.0220 KOps/s $\textbf{\color{#35bf28}+9.79\%}$
test_plain_set_stack_nested_inplace 45.2510μs 17.2881μs 57.8432 KOps/s 52.5629 KOps/s $\textbf{\color{#35bf28}+10.05\%}$
test_items 29.7900μs 2.8577μs 349.9359 KOps/s 344.6733 KOps/s $\color{#35bf28}+1.53\%$
test_items_nested 0.3679ms 0.3388ms 2.9519 KOps/s 2.9649 KOps/s $\color{#d91a1a}-0.44\%$
test_items_nested_locked 0.4032ms 0.3408ms 2.9342 KOps/s 2.9165 KOps/s $\color{#35bf28}+0.61\%$
test_items_nested_leaf 87.7310μs 62.4694μs 16.0078 KOps/s 16.0777 KOps/s $\color{#d91a1a}-0.43\%$
test_items_stack_nested 0.3792ms 0.3436ms 2.9104 KOps/s 2.8675 KOps/s $\color{#35bf28}+1.49\%$
test_items_stack_nested_leaf 92.4110μs 63.4655μs 15.7566 KOps/s 15.6693 KOps/s $\color{#35bf28}+0.56\%$
test_items_stack_nested_locked 0.3777ms 0.3489ms 2.8661 KOps/s 2.8866 KOps/s $\color{#d91a1a}-0.71\%$
test_keys 28.0400μs 3.4378μs 290.8855 KOps/s 291.4667 KOps/s $\color{#d91a1a}-0.20\%$
test_keys_nested 0.1048ms 71.0992μs 14.0649 KOps/s 14.1808 KOps/s $\color{#d91a1a}-0.82\%$
test_keys_nested_locked 0.8073ms 76.8793μs 13.0074 KOps/s 12.9490 KOps/s $\color{#35bf28}+0.45\%$
test_keys_nested_leaf 98.8210μs 60.3736μs 16.5635 KOps/s 16.1780 KOps/s $\color{#35bf28}+2.38\%$
test_keys_stack_nested 98.5020μs 70.2087μs 14.2433 KOps/s 13.8408 KOps/s $\color{#35bf28}+2.91\%$
test_keys_stack_nested_leaf 91.8620μs 62.4119μs 16.0226 KOps/s 15.5802 KOps/s $\color{#35bf28}+2.84\%$
test_keys_stack_nested_locked 0.1094ms 76.1771μs 13.1273 KOps/s 12.8825 KOps/s $\color{#35bf28}+1.90\%$
test_values 5.2483μs 0.8365μs 1.1955 MOps/s 1.1442 MOps/s $\color{#35bf28}+4.48\%$
test_values_nested 77.9310μs 49.3440μs 20.2659 KOps/s 20.5057 KOps/s $\color{#d91a1a}-1.17\%$
test_values_nested_locked 83.8220μs 51.0619μs 19.5841 KOps/s 19.7471 KOps/s $\color{#d91a1a}-0.83\%$
test_values_nested_leaf 70.3420μs 43.0964μs 23.2038 KOps/s 23.4039 KOps/s $\color{#d91a1a}-0.85\%$
test_values_stack_nested 83.4510μs 49.6737μs 20.1314 KOps/s 20.3392 KOps/s $\color{#d91a1a}-1.02\%$
test_values_stack_nested_leaf 71.3010μs 43.4520μs 23.0139 KOps/s 22.8910 KOps/s $\color{#35bf28}+0.54\%$
test_values_stack_nested_locked 79.2820μs 51.7263μs 19.3325 KOps/s 19.2118 KOps/s $\color{#35bf28}+0.63\%$
test_membership 2.0015μs 0.5004μs 1.9982 MOps/s 1.9897 MOps/s $\color{#35bf28}+0.43\%$
test_membership_nested 26.6255μs 1.8695μs 534.8893 KOps/s 535.3166 KOps/s $\color{#d91a1a}-0.08\%$
test_membership_nested_leaf 12.6250μs 1.8960μs 527.4234 KOps/s 529.2537 KOps/s $\color{#d91a1a}-0.35\%$
test_membership_stacked_nested 31.5910μs 1.9132μs 522.6875 KOps/s 519.8066 KOps/s $\color{#35bf28}+0.55\%$
test_membership_stacked_nested_leaf 23.4800μs 1.9269μs 518.9637 KOps/s 514.2188 KOps/s $\color{#35bf28}+0.92\%$
test_membership_nested_last 25.8110μs 3.0063μs 332.6360 KOps/s 332.6895 KOps/s $\color{#d91a1a}-0.02\%$
test_membership_nested_leaf_last 31.2010μs 3.0265μs 330.4146 KOps/s 339.8538 KOps/s $\color{#d91a1a}-2.78\%$
test_membership_stacked_nested_last 24.3500μs 8.2427μs 121.3189 KOps/s 329.1993 KOps/s $\textbf{\color{#d91a1a}-63.15\%}$
test_membership_stacked_nested_leaf_last 43.4510μs 8.1897μs 122.1045 KOps/s 329.0007 KOps/s $\textbf{\color{#d91a1a}-62.89\%}$
test_nested_getleaf 29.6110μs 6.0516μs 165.2464 KOps/s 166.0483 KOps/s $\color{#d91a1a}-0.48\%$
test_nested_get 31.6510μs 5.7315μs 174.4741 KOps/s 172.6332 KOps/s $\color{#35bf28}+1.07\%$
test_stacked_getleaf 42.3410μs 6.0230μs 166.0312 KOps/s 165.2533 KOps/s $\color{#35bf28}+0.47\%$
test_stacked_get 25.8510μs 5.6198μs 177.9422 KOps/s 175.8287 KOps/s $\color{#35bf28}+1.20\%$
test_nested_getitemleaf 30.5510μs 6.1060μs 163.7724 KOps/s 164.5748 KOps/s $\color{#d91a1a}-0.49\%$
test_nested_getitem 24.6000μs 5.6711μs 176.3318 KOps/s 174.8952 KOps/s $\color{#35bf28}+0.82\%$
test_stacked_getitemleaf 39.1600μs 6.0528μs 165.2126 KOps/s 162.4167 KOps/s $\color{#35bf28}+1.72\%$
test_stacked_getitem 34.5500μs 5.8066μs 172.2188 KOps/s 175.2374 KOps/s $\color{#d91a1a}-1.72\%$
test_lock_nested 7.1462ms 0.4329ms 2.3099 KOps/s 2.3223 KOps/s $\color{#d91a1a}-0.53\%$
test_lock_stack_nested 0.4123ms 0.3811ms 2.6242 KOps/s 2.5195 KOps/s $\color{#35bf28}+4.16\%$
test_unlock_nested 0.7822ms 0.3664ms 2.7293 KOps/s 2.7308 KOps/s $\color{#d91a1a}-0.05\%$
test_unlock_stack_nested 0.3675ms 0.3202ms 3.1233 KOps/s 2.9926 KOps/s $\color{#35bf28}+4.37\%$
test_flatten_speed 0.1548ms 76.9949μs 12.9879 KOps/s 13.0903 KOps/s $\color{#d91a1a}-0.78\%$
test_unflatten_speed 0.3680ms 0.3197ms 3.1281 KOps/s 3.1286 KOps/s $\color{#d91a1a}-0.01\%$
test_common_ops 1.6139ms 1.2629ms 791.8138 Ops/s 757.6501 Ops/s $\color{#35bf28}+4.51\%$
test_creation 22.1100μs 1.4897μs 671.2657 KOps/s 683.5305 KOps/s $\color{#d91a1a}-1.79\%$
test_creation_empty 45.1210μs 14.5421μs 68.7659 KOps/s 55.8315 KOps/s $\textbf{\color{#35bf28}+23.17\%}$
test_creation_nested_1 50.9210μs 16.4984μs 60.6119 KOps/s 49.8879 KOps/s $\textbf{\color{#35bf28}+21.50\%}$
test_creation_nested_2 53.9900μs 18.9024μs 52.9034 KOps/s 44.3848 KOps/s $\textbf{\color{#35bf28}+19.19\%}$
test_clone 66.4210μs 30.0467μs 33.2816 KOps/s 33.3083 KOps/s $\color{#d91a1a}-0.08\%$
test_getitem[int] 1.2605ms 16.3142μs 61.2963 KOps/s 61.3869 KOps/s $\color{#d91a1a}-0.15\%$
test_getitem[slice_int] 0.1213ms 28.5819μs 34.9871 KOps/s 35.1150 KOps/s $\color{#d91a1a}-0.36\%$
test_getitem[range] 0.1860ms 0.1108ms 9.0247 KOps/s 9.0323 KOps/s $\color{#d91a1a}-0.08\%$
test_getitem[tuple] 0.1234ms 23.9270μs 41.7938 KOps/s 40.7451 KOps/s $\color{#35bf28}+2.57\%$
test_getitem[list] 0.1865ms 0.1002ms 9.9796 KOps/s 9.9422 KOps/s $\color{#35bf28}+0.38\%$
test_setitem_dim[int] 89.2920μs 45.4683μs 21.9933 KOps/s 21.9840 KOps/s $\color{#35bf28}+0.04\%$
test_setitem_dim[slice_int] 0.1033ms 68.2648μs 14.6488 KOps/s 14.6061 KOps/s $\color{#35bf28}+0.29\%$
test_setitem_dim[range] 0.1602ms 0.1284ms 7.7859 KOps/s 7.7281 KOps/s $\color{#35bf28}+0.75\%$
test_setitem_dim[tuple] 93.3910μs 61.7338μs 16.1986 KOps/s 16.0713 KOps/s $\color{#35bf28}+0.79\%$
test_setitem 78.9610μs 42.4067μs 23.5812 KOps/s 22.7862 KOps/s $\color{#35bf28}+3.49\%$
test_set 89.9910μs 41.4032μs 24.1527 KOps/s 23.0918 KOps/s $\color{#35bf28}+4.59\%$
test_set_shared 0.4010ms 55.0592μs 18.1623 KOps/s 18.2268 KOps/s $\color{#d91a1a}-0.35\%$
test_update 91.1220μs 50.3353μs 19.8668 KOps/s 18.6284 KOps/s $\textbf{\color{#35bf28}+6.65\%}$
test_update_nested 0.1066ms 58.9231μs 16.9713 KOps/s 16.3831 KOps/s $\color{#35bf28}+3.59\%$
test_update__nested 0.1487ms 63.3925μs 15.7747 KOps/s 16.4289 KOps/s $\color{#d91a1a}-3.98\%$
test_set_nested 74.3220μs 44.2722μs 22.5875 KOps/s 21.5994 KOps/s $\color{#35bf28}+4.57\%$
test_set_nested_new 0.4075ms 47.6641μs 20.9802 KOps/s 20.0813 KOps/s $\color{#35bf28}+4.48\%$
test_select 0.1068ms 61.4115μs 16.2836 KOps/s 15.9014 KOps/s $\color{#35bf28}+2.40\%$
test_select_nested 71.2010μs 41.2023μs 24.2705 KOps/s 23.7613 KOps/s $\color{#35bf28}+2.14\%$
test_exclude_nested 90.5210μs 58.7287μs 17.0274 KOps/s 16.8953 KOps/s $\color{#35bf28}+0.78\%$
test_empty[True] 0.3191ms 0.2564ms 3.9003 KOps/s 3.8738 KOps/s $\color{#35bf28}+0.68\%$
test_empty[False] 2.8071μs 0.7355μs 1.3597 MOps/s 1.3595 MOps/s $\color{#35bf28}+0.01\%$
test_to 52.9210μs 26.6786μs 37.4832 KOps/s 36.1873 KOps/s $\color{#35bf28}+3.58\%$
test_to_nonblocking 58.9810μs 24.7596μs 40.3883 KOps/s 38.6640 KOps/s $\color{#35bf28}+4.46\%$
test_unbind_speed 0.3332ms 0.2787ms 3.5878 KOps/s 3.5193 KOps/s $\color{#35bf28}+1.95\%$
test_unbind_speed_stack0 0.3225ms 0.2692ms 3.7142 KOps/s 3.5516 KOps/s $\color{#35bf28}+4.58\%$
test_unbind_speed_stack1 92.0484ms 0.6932ms 1.4427 KOps/s 1.5352 KOps/s $\textbf{\color{#d91a1a}-6.03\%}$
test_split 93.5504ms 2.2078ms 452.9438 Ops/s 443.6564 Ops/s $\color{#35bf28}+2.09\%$
test_chunk 94.9659ms 2.2159ms 451.2744 Ops/s 446.1645 Ops/s $\color{#35bf28}+1.15\%$
test_creation[device0] 0.3358ms 0.1262ms 7.9258 KOps/s 7.7639 KOps/s $\color{#35bf28}+2.09\%$
test_creation_from_tensor 0.3984ms 0.1284ms 7.7862 KOps/s 7.6683 KOps/s $\color{#35bf28}+1.54\%$
test_add_one[memmap_tensor0] 0.2391ms 9.1886μs 108.8304 KOps/s 106.9946 KOps/s $\color{#35bf28}+1.72\%$
test_contiguous[memmap_tensor0] 21.5400μs 2.2190μs 450.6445 KOps/s 449.0825 KOps/s $\color{#35bf28}+0.35\%$
test_stack[memmap_tensor0] 32.3900μs 6.8823μs 145.3002 KOps/s 143.2144 KOps/s $\color{#35bf28}+1.46\%$
test_memmaptd_index 1.0533ms 0.4250ms 2.3530 KOps/s 2.2570 KOps/s $\color{#35bf28}+4.25\%$
test_memmaptd_index_astensor 0.7468ms 0.5008ms 1.9966 KOps/s 1.9495 KOps/s $\color{#35bf28}+2.42\%$
test_memmaptd_index_op 1.4089ms 1.0277ms 973.0462 Ops/s 905.3028 Ops/s $\textbf{\color{#35bf28}+7.48\%}$
test_serialize_model 0.1308s 0.1299s 7.6960 Ops/s 7.6821 Ops/s $\color{#35bf28}+0.18\%$
test_serialize_model_pickle 1.3463s 1.2171s 0.8216 Ops/s 0.8248 Ops/s $\color{#d91a1a}-0.39\%$
test_serialize_weights 0.1314s 0.1297s 7.7081 Ops/s 6.9946 Ops/s $\textbf{\color{#35bf28}+10.20\%}$
test_serialize_weights_returnearly 0.2311s 56.7324ms 17.6266 Ops/s 17.9469 Ops/s $\color{#d91a1a}-1.78\%$
test_serialize_weights_pickle 1.3668s 1.2167s 0.8219 Ops/s 0.8214 Ops/s $\color{#35bf28}+0.06\%$
test_reshape_pytree 72.6510μs 35.3189μs 28.3134 KOps/s 27.4564 KOps/s $\color{#35bf28}+3.12\%$
test_reshape_td 81.2310μs 45.8351μs 21.8174 KOps/s 23.9086 KOps/s $\textbf{\color{#d91a1a}-8.75\%}$
test_view_pytree 68.4310μs 34.8057μs 28.7309 KOps/s 27.5791 KOps/s $\color{#35bf28}+4.18\%$
test_view_td 90.6010μs 45.3286μs 22.0611 KOps/s 21.1635 KOps/s $\color{#35bf28}+4.24\%$
test_unbind_pytree 70.0410μs 34.5904μs 28.9098 KOps/s 28.4189 KOps/s $\color{#35bf28}+1.73\%$
test_unbind_td 0.5326ms 44.2047μs 22.6220 KOps/s 22.9638 KOps/s $\color{#d91a1a}-1.49\%$
test_split_pytree 0.5185ms 47.1370μs 21.2148 KOps/s 21.2514 KOps/s $\color{#d91a1a}-0.17\%$
test_split_td 0.1459ms 55.2402μs 18.1028 KOps/s 15.3319 KOps/s $\textbf{\color{#35bf28}+18.07\%}$
test_add_pytree 99.6520μs 58.2643μs 17.1632 KOps/s 17.0711 KOps/s $\color{#35bf28}+0.54\%$
test_add_td 0.1285ms 91.9137μs 10.8798 KOps/s 10.3932 KOps/s $\color{#35bf28}+4.68\%$
test_compile_add_one_nested[tensordict-compile] 0.2096ms 0.1609ms 6.2161 KOps/s 6.1047 KOps/s $\color{#35bf28}+1.82\%$
test_compile_add_one_nested[tensordict-eager] 0.2842ms 0.1620ms 6.1720 KOps/s 5.9918 KOps/s $\color{#35bf28}+3.01\%$
test_compile_add_one_nested[pytree-compile] 0.1816ms 0.1447ms 6.9127 KOps/s 6.9363 KOps/s $\color{#d91a1a}-0.34\%$
test_compile_add_one_nested[pytree-eager] 0.2817ms 0.1996ms 5.0102 KOps/s 5.3219 KOps/s $\textbf{\color{#d91a1a}-5.86\%}$
test_compile_copy_nested[tensordict-compile] 58.1710μs 21.3182μs 46.9083 KOps/s 45.4539 KOps/s $\color{#35bf28}+3.20\%$
test_compile_copy_nested[tensordict-eager] 92.9820μs 49.3580μs 20.2601 KOps/s 19.9101 KOps/s $\color{#35bf28}+1.76\%$
test_compile_copy_nested[pytree-compile] 0.1052ms 64.1454μs 15.5896 KOps/s 15.3941 KOps/s $\color{#35bf28}+1.27\%$
test_compile_copy_nested[pytree-eager] 97.4420μs 49.1061μs 20.3641 KOps/s 20.1536 KOps/s $\color{#35bf28}+1.04\%$
test_compile_add_one_flat[tensordict-compile] 0.4185ms 0.3191ms 3.1337 KOps/s 3.0996 KOps/s $\color{#35bf28}+1.10\%$
test_compile_add_one_flat[tensordict-eager] 0.3461ms 0.2327ms 4.2977 KOps/s 4.2007 KOps/s $\color{#35bf28}+2.31\%$
test_compile_add_one_flat[tensorclass-compile] 0.2197ms 0.1333ms 7.5005 KOps/s 7.7493 KOps/s $\color{#d91a1a}-3.21\%$
test_compile_add_one_flat[tensorclass-eager] 0.4446ms 66.4580μs 15.0471 KOps/s 15.0286 KOps/s $\color{#35bf28}+0.12\%$
test_compile_add_one_flat[pytree-compile] 0.7128ms 0.3172ms 3.1521 KOps/s 3.1253 KOps/s $\color{#35bf28}+0.86\%$
test_compile_add_one_flat[pytree-eager] 1.0680ms 0.6582ms 1.5193 KOps/s 1.5359 KOps/s $\color{#d91a1a}-1.08\%$
test_compile_add_self_flat[tensordict-eager] 0.6667ms 0.2830ms 3.5335 KOps/s 3.4806 KOps/s $\color{#35bf28}+1.52\%$
test_compile_add_self_flat[tensordict-compile] 0.3692ms 0.3221ms 3.1050 KOps/s 3.1024 KOps/s $\color{#35bf28}+0.09\%$
test_compile_add_self_flat[tensorclass-eager] 0.4806ms 77.9840μs 12.8231 KOps/s 12.8206 KOps/s $\color{#35bf28}+0.02\%$
test_compile_add_self_flat[tensorclass-compile] 0.1736ms 0.1288ms 7.7661 KOps/s 7.7400 KOps/s $\color{#35bf28}+0.34\%$
test_compile_add_self_flat[pytree-eager] 0.7105ms 0.5352ms 1.8685 KOps/s 1.8617 KOps/s $\color{#35bf28}+0.37\%$
test_compile_add_self_flat[pytree-compile] 0.3724ms 0.3169ms 3.1554 KOps/s 3.1100 KOps/s $\color{#35bf28}+1.46\%$
test_compile_copy_flat[tensordict-compile] 0.3922ms 19.7785μs 50.5600 KOps/s 51.0156 KOps/s $\color{#d91a1a}-0.89\%$
test_compile_copy_flat[tensordict-eager] 0.4457ms 38.9860μs 25.6503 KOps/s 26.0731 KOps/s $\color{#d91a1a}-1.62\%$
test_compile_copy_flat[pytree-compile] 0.1062ms 69.9544μs 14.2950 KOps/s 13.9909 KOps/s $\color{#35bf28}+2.17\%$
test_compile_copy_flat[pytree-eager] 0.4345ms 50.4908μs 19.8056 KOps/s 19.1006 KOps/s $\color{#35bf28}+3.69\%$
test_compile_assign_and_add[tensordict-compile] 2.3741ms 0.7884ms 1.2684 KOps/s 1.1093 KOps/s $\textbf{\color{#35bf28}+14.34\%}$
test_compile_assign_and_add[tensordict-eager] 3.4337ms 3.3082ms 302.2807 Ops/s 303.4035 Ops/s $\color{#d91a1a}-0.37\%$
test_compile_assign_and_add[pytree-compile] 2.3172ms 0.8156ms 1.2260 KOps/s 1.1134 KOps/s $\textbf{\color{#35bf28}+10.11\%}$
test_compile_assign_and_add[pytree-eager] 3.4107ms 3.3296ms 300.3337 Ops/s 290.8390 Ops/s $\color{#35bf28}+3.26\%$
test_compile_indexing[tensor-tensordict-compile] 0.1491ms 0.1096ms 9.1269 KOps/s 9.2338 KOps/s $\color{#d91a1a}-1.16\%$
test_compile_indexing[tensor-tensordict-eager] 0.1919ms 62.4636μs 16.0093 KOps/s 15.3639 KOps/s $\color{#35bf28}+4.20\%$
test_compile_indexing[tensor-tensorclass-compile] 0.2031ms 0.1058ms 9.4517 KOps/s 9.6700 KOps/s $\color{#d91a1a}-2.26\%$
test_compile_indexing[tensor-tensorclass-eager] 92.1010μs 45.2999μs 22.0751 KOps/s 22.0808 KOps/s $\color{#d91a1a}-0.03\%$
test_compile_indexing[tensor-pytree-compile] 0.1464ms 0.1046ms 9.5597 KOps/s 9.5154 KOps/s $\color{#35bf28}+0.47\%$
test_compile_indexing[tensor-pytree-eager] 83.8110μs 44.1332μs 22.6587 KOps/s 22.4503 KOps/s $\color{#35bf28}+0.93\%$
test_compile_indexing[slice-tensordict-compile] 0.2047ms 0.1382ms 7.2345 KOps/s 7.2289 KOps/s $\color{#35bf28}+0.08\%$
test_compile_indexing[slice-tensordict-eager] 0.1638ms 25.5013μs 39.2137 KOps/s 38.3334 KOps/s $\color{#35bf28}+2.30\%$
test_compile_indexing[slice-tensorclass-compile] 0.1770ms 0.1311ms 7.6264 KOps/s 7.6109 KOps/s $\color{#35bf28}+0.20\%$
test_compile_indexing[slice-tensorclass-eager] 65.4610μs 20.7549μs 48.1814 KOps/s 46.6226 KOps/s $\color{#35bf28}+3.34\%$
test_compile_indexing[slice-pytree-compile] 0.1908ms 0.1322ms 7.5651 KOps/s 7.5014 KOps/s $\color{#35bf28}+0.85\%$
test_compile_indexing[slice-pytree-eager] 48.7310μs 20.6003μs 48.5430 KOps/s 46.7076 KOps/s $\color{#35bf28}+3.93\%$
test_compile_indexing[int-tensordict-compile] 0.1821ms 0.1382ms 7.2335 KOps/s 7.2432 KOps/s $\color{#d91a1a}-0.13\%$
test_compile_indexing[int-tensordict-eager] 0.4749ms 24.6803μs 40.5181 KOps/s 38.6141 KOps/s $\color{#35bf28}+4.93\%$
test_compile_indexing[int-tensorclass-compile] 0.2439ms 0.1315ms 7.6041 KOps/s 7.5284 KOps/s $\color{#35bf28}+1.01\%$
test_compile_indexing[int-tensorclass-eager] 64.2410μs 20.5030μs 48.7733 KOps/s 47.1040 KOps/s $\color{#35bf28}+3.54\%$
test_compile_indexing[int-pytree-compile] 0.2173ms 0.1318ms 7.5860 KOps/s 7.5510 KOps/s $\color{#35bf28}+0.46\%$
test_compile_indexing[int-pytree-eager] 52.7510μs 20.7991μs 48.0791 KOps/s 46.7910 KOps/s $\color{#35bf28}+2.75\%$
test_mod_add[eager] 81.1820μs 33.3429μs 29.9914 KOps/s 28.0748 KOps/s $\textbf{\color{#35bf28}+6.83\%}$
test_mod_add[compile] 0.3578ms 72.9553μs 13.7070 KOps/s 14.2567 KOps/s $\color{#d91a1a}-3.86\%$
test_mod_add[compile-overhead] 0.2598ms 0.1347ms 7.4261 KOps/s 6.9647 KOps/s $\textbf{\color{#35bf28}+6.63\%}$
test_mod_wrap[eager] 0.3321ms 0.2568ms 3.8939 KOps/s 4.0329 KOps/s $\color{#d91a1a}-3.45\%$
test_mod_wrap[compile] 1.3971ms 0.2965ms 3.3724 KOps/s 3.3106 KOps/s $\color{#35bf28}+1.87\%$
test_mod_wrap[compile-overhead] 7.7252ms 4.1065ms 243.5143 Ops/s 244.0560 Ops/s $\color{#d91a1a}-0.22\%$
test_mod_wrap_and_backward[eager] 1.4948ms 1.3548ms 738.1356 Ops/s 733.5405 Ops/s $\color{#35bf28}+0.63\%$
test_mod_wrap_and_backward[compile] 1.5772ms 1.3343ms 749.4376 Ops/s 683.0865 Ops/s $\textbf{\color{#35bf28}+9.71\%}$
test_mod_wrap_and_backward[compile-overhead] 1.3353ms 0.9022ms 1.1085 KOps/s 1.0115 KOps/s $\textbf{\color{#35bf28}+9.58\%}$
test_seq_add[eager] 0.1653ms 99.0958μs 10.0912 KOps/s 9.1322 KOps/s $\textbf{\color{#35bf28}+10.50\%}$
test_seq_add[compile] 0.1342ms 83.2116μs 12.0176 KOps/s 12.5198 KOps/s $\color{#d91a1a}-4.01\%$
test_seq_add[compile-overhead] 0.1686ms 0.1141ms 8.7673 KOps/s 8.4875 KOps/s $\color{#35bf28}+3.30\%$
test_seq_wrap[eager] 0.5163ms 0.3791ms 2.6375 KOps/s 2.3773 KOps/s $\textbf{\color{#35bf28}+10.94\%}$
test_seq_wrap[compile] 0.3820ms 0.3152ms 3.1725 KOps/s 3.0146 KOps/s $\textbf{\color{#35bf28}+5.24\%}$
test_seq_wrap[compile-overhead] 0.2642ms 0.2182ms 4.5823 KOps/s 4.4950 KOps/s $\color{#35bf28}+1.94\%$
test_func_call_runtime[False-eager] 0.8659ms 0.7458ms 1.3408 KOps/s 1.3164 KOps/s $\color{#35bf28}+1.85\%$
test_func_call_runtime[False-compile] 0.8578ms 0.7920ms 1.2626 KOps/s 1.2479 KOps/s $\color{#35bf28}+1.18\%$
test_func_call_runtime[False-compile-overhead] 0.4133ms 0.3593ms 2.7833 KOps/s 2.7539 KOps/s $\color{#35bf28}+1.07\%$
test_func_call_runtime[True-eager] 1.0993ms 0.9092ms 1.0999 KOps/s 1.0866 KOps/s $\color{#35bf28}+1.22\%$
test_func_call_runtime[True-compile] 0.9489ms 0.8178ms 1.2228 KOps/s 1.2159 KOps/s $\color{#35bf28}+0.56\%$
test_func_call_runtime[True-compile-overhead] 0.4838ms 0.3850ms 2.5976 KOps/s 2.6045 KOps/s $\color{#d91a1a}-0.27\%$
test_func_call_cm_runtime[False-eager] 0.8133ms 0.7429ms 1.3460 KOps/s 1.2971 KOps/s $\color{#35bf28}+3.78\%$
test_func_call_cm_runtime[False-compile] 0.8555ms 0.7976ms 1.2538 KOps/s 1.2431 KOps/s $\color{#35bf28}+0.86\%$
test_func_call_cm_runtime[False-compile-overhead] 0.4230ms 0.3631ms 2.7538 KOps/s 2.7484 KOps/s $\color{#35bf28}+0.20\%$
test_func_call_cm_runtime[True-eager] 1.1239ms 1.0207ms 979.7085 Ops/s 980.0486 Ops/s $\color{#d91a1a}-0.03\%$
test_func_call_cm_runtime[True-compile] 0.9657ms 0.8464ms 1.1815 KOps/s 1.1758 KOps/s $\color{#35bf28}+0.49\%$
test_func_call_cm_runtime[True-compile-overhead] 0.4872ms 0.4056ms 2.4655 KOps/s 2.4263 KOps/s $\color{#35bf28}+1.62\%$
test_vmap_func_call_cm_runtime[eager] 2.5454ms 2.0973ms 476.8108 Ops/s 475.6292 Ops/s $\color{#35bf28}+0.25\%$
test_vmap_func_call_cm_runtime[compile] 0.9585ms 0.8593ms 1.1638 KOps/s 1.1474 KOps/s $\color{#35bf28}+1.42\%$
test_vmap_func_call_cm_runtime[compile-overhead] 0.4615ms 0.4082ms 2.4496 KOps/s 2.4100 KOps/s $\color{#35bf28}+1.64\%$
test_distributed 0.4904ms 0.1189ms 8.4124 KOps/s 8.8426 KOps/s $\color{#d91a1a}-4.86\%$
test_tdmodule 0.3131ms 15.0145μs 66.6023 KOps/s 61.0156 KOps/s $\textbf{\color{#35bf28}+9.16\%}$
test_tdmodule_dispatch 57.7110μs 28.1982μs 35.4632 KOps/s 31.7539 KOps/s $\textbf{\color{#35bf28}+11.68\%}$
test_tdseq 27.2900μs 15.5951μs 64.1227 KOps/s 58.4824 KOps/s $\textbf{\color{#35bf28}+9.64\%}$
test_tdseq_dispatch 51.2010μs 30.3309μs 32.9697 KOps/s 28.9538 KOps/s $\textbf{\color{#35bf28}+13.87\%}$
test_instantiation_functorch 2.1198ms 1.8635ms 536.6153 Ops/s 534.3706 Ops/s $\color{#35bf28}+0.42\%$
test_instantiation_td 1.8217ms 1.2047ms 830.0811 Ops/s 829.2152 Ops/s $\color{#35bf28}+0.10\%$
test_exec_functorch 0.2443ms 0.2089ms 4.7881 KOps/s 4.7572 KOps/s $\color{#35bf28}+0.65\%$
test_exec_functional_call 0.2560ms 0.2109ms 4.7426 KOps/s 4.6891 KOps/s $\color{#35bf28}+1.14\%$
test_exec_td 0.2736ms 0.2335ms 4.2826 KOps/s 4.3521 KOps/s $\color{#d91a1a}-1.60\%$
test_exec_td_decorator 1.0959ms 0.2644ms 3.7824 KOps/s 3.7418 KOps/s $\color{#35bf28}+1.09\%$
test_vmap_mlp_speed[True-True] 0.7988ms 0.7105ms 1.4075 KOps/s 1.4110 KOps/s $\color{#d91a1a}-0.25\%$
test_vmap_mlp_speed[True-False] 0.7763ms 0.7012ms 1.4262 KOps/s 1.4022 KOps/s $\color{#35bf28}+1.71\%$
test_vmap_mlp_speed[False-True] 0.7470ms 0.5980ms 1.6723 KOps/s 1.6278 KOps/s $\color{#35bf28}+2.74\%$
test_vmap_mlp_speed[False-False] 0.6470ms 0.5972ms 1.6744 KOps/s 1.6647 KOps/s $\color{#35bf28}+0.58\%$
test_vmap_mlp_speed_decorator[True-True] 1.3678ms 0.6961ms 1.4366 KOps/s 1.4620 KOps/s $\color{#d91a1a}-1.74\%$
test_vmap_mlp_speed_decorator[True-False] 0.8177ms 0.6773ms 1.4764 KOps/s 1.4572 KOps/s $\color{#35bf28}+1.32\%$
test_vmap_mlp_speed_decorator[False-True] 0.7163ms 0.5986ms 1.6706 KOps/s 1.6650 KOps/s $\color{#35bf28}+0.34\%$
test_vmap_mlp_speed_decorator[False-False] 0.6887ms 0.5999ms 1.6670 KOps/s 1.6619 KOps/s $\color{#35bf28}+0.31\%$
test_vmap_transformer_speed[True-True] 8.5366ms 8.4730ms 118.0221 Ops/s 116.5190 Ops/s $\color{#35bf28}+1.29\%$
test_vmap_transformer_speed[True-False] 8.5256ms 8.4863ms 117.8371 Ops/s 116.9479 Ops/s $\color{#35bf28}+0.76\%$
test_vmap_transformer_speed[False-True] 8.3430ms 8.3123ms 120.3039 Ops/s 119.7364 Ops/s $\color{#35bf28}+0.47\%$
test_vmap_transformer_speed[False-False] 8.6201ms 8.2561ms 121.1229 Ops/s 119.3864 Ops/s $\color{#35bf28}+1.45\%$
test_vmap_transformer_speed_decorator[True-True] 19.6268ms 19.5520ms 51.1456 Ops/s 50.6310 Ops/s $\color{#35bf28}+1.02\%$
test_vmap_transformer_speed_decorator[True-False] 19.6526ms 19.5758ms 51.0835 Ops/s 51.0581 Ops/s $\color{#35bf28}+0.05\%$
test_vmap_transformer_speed_decorator[False-True] 19.4851ms 19.3891ms 51.5753 Ops/s 51.0620 Ops/s $\color{#35bf28}+1.01\%$
test_vmap_transformer_speed_decorator[False-False] 19.4639ms 19.4059ms 51.5307 Ops/s 51.1536 Ops/s $\color{#35bf28}+0.74\%$
test_to_module_speed[True] 1.2904ms 1.0038ms 996.2262 Ops/s 994.2483 Ops/s $\color{#35bf28}+0.20\%$
test_to_module_speed[False] 1.3606ms 0.9647ms 1.0366 KOps/s 1.0325 KOps/s $\color{#35bf28}+0.39\%$
test_tc_init 63.3610μs 34.4958μs 28.9890 KOps/s 27.7084 KOps/s $\color{#35bf28}+4.62\%$
test_tc_init_nested 0.1061ms 72.6990μs 13.7554 KOps/s 13.6803 KOps/s $\color{#35bf28}+0.55\%$
test_tc_first_layer_tensor 5.3557μs 0.6731μs 1.4857 MOps/s 1.4782 MOps/s $\color{#35bf28}+0.51\%$
test_tc_first_layer_nontensor 40.2810μs 2.2567μs 443.1256 KOps/s 442.0080 KOps/s $\color{#35bf28}+0.25\%$
test_tc_second_layer_tensor 29.9280μs 1.3649μs 732.6379 KOps/s 727.4051 KOps/s $\color{#35bf28}+0.72\%$
test_tc_second_layer_nontensor 38.2600μs 2.9412μs 339.9958 KOps/s 341.2498 KOps/s $\color{#d91a1a}-0.37\%$
test_unbind 0.1861s 12.0224ms 83.1784 Ops/s 94.9778 Ops/s $\textbf{\color{#d91a1a}-12.42\%}$
test_full_like 0.6562ms 0.5751ms 1.7389 KOps/s 1.7428 KOps/s $\color{#d91a1a}-0.22\%$
test_zeros_like 0.2769ms 0.1978ms 5.0543 KOps/s 5.0524 KOps/s $\color{#35bf28}+0.04\%$
test_ones_like 0.2387ms 0.1978ms 5.0567 KOps/s 5.0573 KOps/s $\color{#d91a1a}-0.01\%$
test_clone 0.4423ms 0.4144ms 2.4129 KOps/s 2.4184 KOps/s $\color{#d91a1a}-0.23\%$
test_squeeze 43.2910μs 9.9189μs 100.8180 KOps/s 100.4573 KOps/s $\color{#35bf28}+0.36\%$
test_unsqueeze 0.2228ms 73.6114μs 13.5848 KOps/s 13.0669 KOps/s $\color{#35bf28}+3.96\%$
test_split 0.4128ms 0.1599ms 6.2548 KOps/s 6.2272 KOps/s $\color{#35bf28}+0.44\%$
test_permute 0.2231ms 0.1770ms 5.6492 KOps/s 5.4949 KOps/s $\color{#35bf28}+2.81\%$
test_stack 1.2565ms 0.8573ms 1.1665 KOps/s 1.1675 KOps/s $\color{#d91a1a}-0.08\%$
test_cat 1.2867ms 1.2312ms 812.1911 Ops/s 811.5912 Ops/s $\color{#35bf28}+0.07\%$

[ghstack-poisoned]
vmoens added a commit that referenced this pull request Oct 4, 2024
ghstack-source-id: fa65ff98d46363f84e4fac1cf306b59cd8f7e555
Pull Request resolved: #1001
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Oct 8, 2024
ghstack-source-id: db7893b09e6838506d1b798a07c767743651925c
Pull Request resolved: #1001
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Oct 8, 2024
ghstack-source-id: db7e19e0fc06b7137a769f03f6337836179883d5
Pull Request resolved: #1001
[ghstack-poisoned]
vmoens added a commit that referenced this pull request Oct 8, 2024
ghstack-source-id: 9f9ce070d8726c74fcf5a22e0edd05b8c9fd7e19
Pull Request resolved: #1001
@vmoens vmoens merged commit 48c2d74 into gh/vmoens/17/base Oct 8, 2024
18 of 38 checks passed
@vmoens vmoens deleted the gh/vmoens/17/head branch October 8, 2024 10:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Refactor Refactoring code - not a new feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants