-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Deprecation] Act warned deprecations for v0.6 #1001
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Sep 20, 2024
ghstack-source-id: 74fd1c82528ab2e0eaffa9d2c18821c7eee4e7fc Pull Request resolved: #1001
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 20, 2024
vmoens
added a commit
that referenced
this pull request
Sep 20, 2024
ghstack-source-id: f8d2312e8bd6a32e482864b86f8c1e3afcd9b888 Pull Request resolved: #1001
vmoens
added a commit
that referenced
this pull request
Sep 20, 2024
ghstack-source-id: d67b08c3f2929c59ab891c23ac9c371e96b6f336 Pull Request resolved: #1001
vmoens
added a commit
that referenced
this pull request
Sep 20, 2024
ghstack-source-id: d76b7dc305c5a754c6567da6cc11e7793cb8ab52 Pull Request resolved: #1001
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 0.1184ms | 16.2999μs | 61.3499 KOps/s | 55.9595 KOps/s | |
test_plain_set_stack_nested | 49.2910μs | 16.3818μs | 61.0435 KOps/s | 55.2117 KOps/s | |
test_plain_set_nested_inplace | 44.5210μs | 17.5079μs | 57.1171 KOps/s | 52.0220 KOps/s | |
test_plain_set_stack_nested_inplace | 45.2510μs | 17.2881μs | 57.8432 KOps/s | 52.5629 KOps/s | |
test_items | 29.7900μs | 2.8577μs | 349.9359 KOps/s | 344.6733 KOps/s | |
test_items_nested | 0.3679ms | 0.3388ms | 2.9519 KOps/s | 2.9649 KOps/s | |
test_items_nested_locked | 0.4032ms | 0.3408ms | 2.9342 KOps/s | 2.9165 KOps/s | |
test_items_nested_leaf | 87.7310μs | 62.4694μs | 16.0078 KOps/s | 16.0777 KOps/s | |
test_items_stack_nested | 0.3792ms | 0.3436ms | 2.9104 KOps/s | 2.8675 KOps/s | |
test_items_stack_nested_leaf | 92.4110μs | 63.4655μs | 15.7566 KOps/s | 15.6693 KOps/s | |
test_items_stack_nested_locked | 0.3777ms | 0.3489ms | 2.8661 KOps/s | 2.8866 KOps/s | |
test_keys | 28.0400μs | 3.4378μs | 290.8855 KOps/s | 291.4667 KOps/s | |
test_keys_nested | 0.1048ms | 71.0992μs | 14.0649 KOps/s | 14.1808 KOps/s | |
test_keys_nested_locked | 0.8073ms | 76.8793μs | 13.0074 KOps/s | 12.9490 KOps/s | |
test_keys_nested_leaf | 98.8210μs | 60.3736μs | 16.5635 KOps/s | 16.1780 KOps/s | |
test_keys_stack_nested | 98.5020μs | 70.2087μs | 14.2433 KOps/s | 13.8408 KOps/s | |
test_keys_stack_nested_leaf | 91.8620μs | 62.4119μs | 16.0226 KOps/s | 15.5802 KOps/s | |
test_keys_stack_nested_locked | 0.1094ms | 76.1771μs | 13.1273 KOps/s | 12.8825 KOps/s | |
test_values | 5.2483μs | 0.8365μs | 1.1955 MOps/s | 1.1442 MOps/s | |
test_values_nested | 77.9310μs | 49.3440μs | 20.2659 KOps/s | 20.5057 KOps/s | |
test_values_nested_locked | 83.8220μs | 51.0619μs | 19.5841 KOps/s | 19.7471 KOps/s | |
test_values_nested_leaf | 70.3420μs | 43.0964μs | 23.2038 KOps/s | 23.4039 KOps/s | |
test_values_stack_nested | 83.4510μs | 49.6737μs | 20.1314 KOps/s | 20.3392 KOps/s | |
test_values_stack_nested_leaf | 71.3010μs | 43.4520μs | 23.0139 KOps/s | 22.8910 KOps/s | |
test_values_stack_nested_locked | 79.2820μs | 51.7263μs | 19.3325 KOps/s | 19.2118 KOps/s | |
test_membership | 2.0015μs | 0.5004μs | 1.9982 MOps/s | 1.9897 MOps/s | |
test_membership_nested | 26.6255μs | 1.8695μs | 534.8893 KOps/s | 535.3166 KOps/s | |
test_membership_nested_leaf | 12.6250μs | 1.8960μs | 527.4234 KOps/s | 529.2537 KOps/s | |
test_membership_stacked_nested | 31.5910μs | 1.9132μs | 522.6875 KOps/s | 519.8066 KOps/s | |
test_membership_stacked_nested_leaf | 23.4800μs | 1.9269μs | 518.9637 KOps/s | 514.2188 KOps/s | |
test_membership_nested_last | 25.8110μs | 3.0063μs | 332.6360 KOps/s | 332.6895 KOps/s | |
test_membership_nested_leaf_last | 31.2010μs | 3.0265μs | 330.4146 KOps/s | 339.8538 KOps/s | |
test_membership_stacked_nested_last | 24.3500μs | 8.2427μs | 121.3189 KOps/s | 329.1993 KOps/s | |
test_membership_stacked_nested_leaf_last | 43.4510μs | 8.1897μs | 122.1045 KOps/s | 329.0007 KOps/s | |
test_nested_getleaf | 29.6110μs | 6.0516μs | 165.2464 KOps/s | 166.0483 KOps/s | |
test_nested_get | 31.6510μs | 5.7315μs | 174.4741 KOps/s | 172.6332 KOps/s | |
test_stacked_getleaf | 42.3410μs | 6.0230μs | 166.0312 KOps/s | 165.2533 KOps/s | |
test_stacked_get | 25.8510μs | 5.6198μs | 177.9422 KOps/s | 175.8287 KOps/s | |
test_nested_getitemleaf | 30.5510μs | 6.1060μs | 163.7724 KOps/s | 164.5748 KOps/s | |
test_nested_getitem | 24.6000μs | 5.6711μs | 176.3318 KOps/s | 174.8952 KOps/s | |
test_stacked_getitemleaf | 39.1600μs | 6.0528μs | 165.2126 KOps/s | 162.4167 KOps/s | |
test_stacked_getitem | 34.5500μs | 5.8066μs | 172.2188 KOps/s | 175.2374 KOps/s | |
test_lock_nested | 7.1462ms | 0.4329ms | 2.3099 KOps/s | 2.3223 KOps/s | |
test_lock_stack_nested | 0.4123ms | 0.3811ms | 2.6242 KOps/s | 2.5195 KOps/s | |
test_unlock_nested | 0.7822ms | 0.3664ms | 2.7293 KOps/s | 2.7308 KOps/s | |
test_unlock_stack_nested | 0.3675ms | 0.3202ms | 3.1233 KOps/s | 2.9926 KOps/s | |
test_flatten_speed | 0.1548ms | 76.9949μs | 12.9879 KOps/s | 13.0903 KOps/s | |
test_unflatten_speed | 0.3680ms | 0.3197ms | 3.1281 KOps/s | 3.1286 KOps/s | |
test_common_ops | 1.6139ms | 1.2629ms | 791.8138 Ops/s | 757.6501 Ops/s | |
test_creation | 22.1100μs | 1.4897μs | 671.2657 KOps/s | 683.5305 KOps/s | |
test_creation_empty | 45.1210μs | 14.5421μs | 68.7659 KOps/s | 55.8315 KOps/s | |
test_creation_nested_1 | 50.9210μs | 16.4984μs | 60.6119 KOps/s | 49.8879 KOps/s | |
test_creation_nested_2 | 53.9900μs | 18.9024μs | 52.9034 KOps/s | 44.3848 KOps/s | |
test_clone | 66.4210μs | 30.0467μs | 33.2816 KOps/s | 33.3083 KOps/s | |
test_getitem[int] | 1.2605ms | 16.3142μs | 61.2963 KOps/s | 61.3869 KOps/s | |
test_getitem[slice_int] | 0.1213ms | 28.5819μs | 34.9871 KOps/s | 35.1150 KOps/s | |
test_getitem[range] | 0.1860ms | 0.1108ms | 9.0247 KOps/s | 9.0323 KOps/s | |
test_getitem[tuple] | 0.1234ms | 23.9270μs | 41.7938 KOps/s | 40.7451 KOps/s | |
test_getitem[list] | 0.1865ms | 0.1002ms | 9.9796 KOps/s | 9.9422 KOps/s | |
test_setitem_dim[int] | 89.2920μs | 45.4683μs | 21.9933 KOps/s | 21.9840 KOps/s | |
test_setitem_dim[slice_int] | 0.1033ms | 68.2648μs | 14.6488 KOps/s | 14.6061 KOps/s | |
test_setitem_dim[range] | 0.1602ms | 0.1284ms | 7.7859 KOps/s | 7.7281 KOps/s | |
test_setitem_dim[tuple] | 93.3910μs | 61.7338μs | 16.1986 KOps/s | 16.0713 KOps/s | |
test_setitem | 78.9610μs | 42.4067μs | 23.5812 KOps/s | 22.7862 KOps/s | |
test_set | 89.9910μs | 41.4032μs | 24.1527 KOps/s | 23.0918 KOps/s | |
test_set_shared | 0.4010ms | 55.0592μs | 18.1623 KOps/s | 18.2268 KOps/s | |
test_update | 91.1220μs | 50.3353μs | 19.8668 KOps/s | 18.6284 KOps/s | |
test_update_nested | 0.1066ms | 58.9231μs | 16.9713 KOps/s | 16.3831 KOps/s | |
test_update__nested | 0.1487ms | 63.3925μs | 15.7747 KOps/s | 16.4289 KOps/s | |
test_set_nested | 74.3220μs | 44.2722μs | 22.5875 KOps/s | 21.5994 KOps/s | |
test_set_nested_new | 0.4075ms | 47.6641μs | 20.9802 KOps/s | 20.0813 KOps/s | |
test_select | 0.1068ms | 61.4115μs | 16.2836 KOps/s | 15.9014 KOps/s | |
test_select_nested | 71.2010μs | 41.2023μs | 24.2705 KOps/s | 23.7613 KOps/s | |
test_exclude_nested | 90.5210μs | 58.7287μs | 17.0274 KOps/s | 16.8953 KOps/s | |
test_empty[True] | 0.3191ms | 0.2564ms | 3.9003 KOps/s | 3.8738 KOps/s | |
test_empty[False] | 2.8071μs | 0.7355μs | 1.3597 MOps/s | 1.3595 MOps/s | |
test_to | 52.9210μs | 26.6786μs | 37.4832 KOps/s | 36.1873 KOps/s | |
test_to_nonblocking | 58.9810μs | 24.7596μs | 40.3883 KOps/s | 38.6640 KOps/s | |
test_unbind_speed | 0.3332ms | 0.2787ms | 3.5878 KOps/s | 3.5193 KOps/s | |
test_unbind_speed_stack0 | 0.3225ms | 0.2692ms | 3.7142 KOps/s | 3.5516 KOps/s | |
test_unbind_speed_stack1 | 92.0484ms | 0.6932ms | 1.4427 KOps/s | 1.5352 KOps/s | |
test_split | 93.5504ms | 2.2078ms | 452.9438 Ops/s | 443.6564 Ops/s | |
test_chunk | 94.9659ms | 2.2159ms | 451.2744 Ops/s | 446.1645 Ops/s | |
test_creation[device0] | 0.3358ms | 0.1262ms | 7.9258 KOps/s | 7.7639 KOps/s | |
test_creation_from_tensor | 0.3984ms | 0.1284ms | 7.7862 KOps/s | 7.6683 KOps/s | |
test_add_one[memmap_tensor0] | 0.2391ms | 9.1886μs | 108.8304 KOps/s | 106.9946 KOps/s | |
test_contiguous[memmap_tensor0] | 21.5400μs | 2.2190μs | 450.6445 KOps/s | 449.0825 KOps/s | |
test_stack[memmap_tensor0] | 32.3900μs | 6.8823μs | 145.3002 KOps/s | 143.2144 KOps/s | |
test_memmaptd_index | 1.0533ms | 0.4250ms | 2.3530 KOps/s | 2.2570 KOps/s | |
test_memmaptd_index_astensor | 0.7468ms | 0.5008ms | 1.9966 KOps/s | 1.9495 KOps/s | |
test_memmaptd_index_op | 1.4089ms | 1.0277ms | 973.0462 Ops/s | 905.3028 Ops/s | |
test_serialize_model | 0.1308s | 0.1299s | 7.6960 Ops/s | 7.6821 Ops/s | |
test_serialize_model_pickle | 1.3463s | 1.2171s | 0.8216 Ops/s | 0.8248 Ops/s | |
test_serialize_weights | 0.1314s | 0.1297s | 7.7081 Ops/s | 6.9946 Ops/s | |
test_serialize_weights_returnearly | 0.2311s | 56.7324ms | 17.6266 Ops/s | 17.9469 Ops/s | |
test_serialize_weights_pickle | 1.3668s | 1.2167s | 0.8219 Ops/s | 0.8214 Ops/s | |
test_reshape_pytree | 72.6510μs | 35.3189μs | 28.3134 KOps/s | 27.4564 KOps/s | |
test_reshape_td | 81.2310μs | 45.8351μs | 21.8174 KOps/s | 23.9086 KOps/s | |
test_view_pytree | 68.4310μs | 34.8057μs | 28.7309 KOps/s | 27.5791 KOps/s | |
test_view_td | 90.6010μs | 45.3286μs | 22.0611 KOps/s | 21.1635 KOps/s | |
test_unbind_pytree | 70.0410μs | 34.5904μs | 28.9098 KOps/s | 28.4189 KOps/s | |
test_unbind_td | 0.5326ms | 44.2047μs | 22.6220 KOps/s | 22.9638 KOps/s | |
test_split_pytree | 0.5185ms | 47.1370μs | 21.2148 KOps/s | 21.2514 KOps/s | |
test_split_td | 0.1459ms | 55.2402μs | 18.1028 KOps/s | 15.3319 KOps/s | |
test_add_pytree | 99.6520μs | 58.2643μs | 17.1632 KOps/s | 17.0711 KOps/s | |
test_add_td | 0.1285ms | 91.9137μs | 10.8798 KOps/s | 10.3932 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2096ms | 0.1609ms | 6.2161 KOps/s | 6.1047 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2842ms | 0.1620ms | 6.1720 KOps/s | 5.9918 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1816ms | 0.1447ms | 6.9127 KOps/s | 6.9363 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2817ms | 0.1996ms | 5.0102 KOps/s | 5.3219 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 58.1710μs | 21.3182μs | 46.9083 KOps/s | 45.4539 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 92.9820μs | 49.3580μs | 20.2601 KOps/s | 19.9101 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1052ms | 64.1454μs | 15.5896 KOps/s | 15.3941 KOps/s | |
test_compile_copy_nested[pytree-eager] | 97.4420μs | 49.1061μs | 20.3641 KOps/s | 20.1536 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.4185ms | 0.3191ms | 3.1337 KOps/s | 3.0996 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3461ms | 0.2327ms | 4.2977 KOps/s | 4.2007 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2197ms | 0.1333ms | 7.5005 KOps/s | 7.7493 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.4446ms | 66.4580μs | 15.0471 KOps/s | 15.0286 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.7128ms | 0.3172ms | 3.1521 KOps/s | 3.1253 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 1.0680ms | 0.6582ms | 1.5193 KOps/s | 1.5359 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.6667ms | 0.2830ms | 3.5335 KOps/s | 3.4806 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3692ms | 0.3221ms | 3.1050 KOps/s | 3.1024 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.4806ms | 77.9840μs | 12.8231 KOps/s | 12.8206 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1736ms | 0.1288ms | 7.7661 KOps/s | 7.7400 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.7105ms | 0.5352ms | 1.8685 KOps/s | 1.8617 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3724ms | 0.3169ms | 3.1554 KOps/s | 3.1100 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.3922ms | 19.7785μs | 50.5600 KOps/s | 51.0156 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.4457ms | 38.9860μs | 25.6503 KOps/s | 26.0731 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1062ms | 69.9544μs | 14.2950 KOps/s | 13.9909 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.4345ms | 50.4908μs | 19.8056 KOps/s | 19.1006 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3741ms | 0.7884ms | 1.2684 KOps/s | 1.1093 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.4337ms | 3.3082ms | 302.2807 Ops/s | 303.4035 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3172ms | 0.8156ms | 1.2260 KOps/s | 1.1134 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.4107ms | 3.3296ms | 300.3337 Ops/s | 290.8390 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1491ms | 0.1096ms | 9.1269 KOps/s | 9.2338 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1919ms | 62.4636μs | 16.0093 KOps/s | 15.3639 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.2031ms | 0.1058ms | 9.4517 KOps/s | 9.6700 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 92.1010μs | 45.2999μs | 22.0751 KOps/s | 22.0808 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1464ms | 0.1046ms | 9.5597 KOps/s | 9.5154 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 83.8110μs | 44.1332μs | 22.6587 KOps/s | 22.4503 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2047ms | 0.1382ms | 7.2345 KOps/s | 7.2289 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1638ms | 25.5013μs | 39.2137 KOps/s | 38.3334 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1770ms | 0.1311ms | 7.6264 KOps/s | 7.6109 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 65.4610μs | 20.7549μs | 48.1814 KOps/s | 46.6226 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1908ms | 0.1322ms | 7.5651 KOps/s | 7.5014 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 48.7310μs | 20.6003μs | 48.5430 KOps/s | 46.7076 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1821ms | 0.1382ms | 7.2335 KOps/s | 7.2432 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4749ms | 24.6803μs | 40.5181 KOps/s | 38.6141 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2439ms | 0.1315ms | 7.6041 KOps/s | 7.5284 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.2410μs | 20.5030μs | 48.7733 KOps/s | 47.1040 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2173ms | 0.1318ms | 7.5860 KOps/s | 7.5510 KOps/s | |
test_compile_indexing[int-pytree-eager] | 52.7510μs | 20.7991μs | 48.0791 KOps/s | 46.7910 KOps/s | |
test_mod_add[eager] | 81.1820μs | 33.3429μs | 29.9914 KOps/s | 28.0748 KOps/s | |
test_mod_add[compile] | 0.3578ms | 72.9553μs | 13.7070 KOps/s | 14.2567 KOps/s | |
test_mod_add[compile-overhead] | 0.2598ms | 0.1347ms | 7.4261 KOps/s | 6.9647 KOps/s | |
test_mod_wrap[eager] | 0.3321ms | 0.2568ms | 3.8939 KOps/s | 4.0329 KOps/s | |
test_mod_wrap[compile] | 1.3971ms | 0.2965ms | 3.3724 KOps/s | 3.3106 KOps/s | |
test_mod_wrap[compile-overhead] | 7.7252ms | 4.1065ms | 243.5143 Ops/s | 244.0560 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.4948ms | 1.3548ms | 738.1356 Ops/s | 733.5405 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5772ms | 1.3343ms | 749.4376 Ops/s | 683.0865 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3353ms | 0.9022ms | 1.1085 KOps/s | 1.0115 KOps/s | |
test_seq_add[eager] | 0.1653ms | 99.0958μs | 10.0912 KOps/s | 9.1322 KOps/s | |
test_seq_add[compile] | 0.1342ms | 83.2116μs | 12.0176 KOps/s | 12.5198 KOps/s | |
test_seq_add[compile-overhead] | 0.1686ms | 0.1141ms | 8.7673 KOps/s | 8.4875 KOps/s | |
test_seq_wrap[eager] | 0.5163ms | 0.3791ms | 2.6375 KOps/s | 2.3773 KOps/s | |
test_seq_wrap[compile] | 0.3820ms | 0.3152ms | 3.1725 KOps/s | 3.0146 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2642ms | 0.2182ms | 4.5823 KOps/s | 4.4950 KOps/s | |
test_func_call_runtime[False-eager] | 0.8659ms | 0.7458ms | 1.3408 KOps/s | 1.3164 KOps/s | |
test_func_call_runtime[False-compile] | 0.8578ms | 0.7920ms | 1.2626 KOps/s | 1.2479 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4133ms | 0.3593ms | 2.7833 KOps/s | 2.7539 KOps/s | |
test_func_call_runtime[True-eager] | 1.0993ms | 0.9092ms | 1.0999 KOps/s | 1.0866 KOps/s | |
test_func_call_runtime[True-compile] | 0.9489ms | 0.8178ms | 1.2228 KOps/s | 1.2159 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.4838ms | 0.3850ms | 2.5976 KOps/s | 2.6045 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8133ms | 0.7429ms | 1.3460 KOps/s | 1.2971 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8555ms | 0.7976ms | 1.2538 KOps/s | 1.2431 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4230ms | 0.3631ms | 2.7538 KOps/s | 2.7484 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1239ms | 1.0207ms | 979.7085 Ops/s | 980.0486 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9657ms | 0.8464ms | 1.1815 KOps/s | 1.1758 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4872ms | 0.4056ms | 2.4655 KOps/s | 2.4263 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5454ms | 2.0973ms | 476.8108 Ops/s | 475.6292 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9585ms | 0.8593ms | 1.1638 KOps/s | 1.1474 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4615ms | 0.4082ms | 2.4496 KOps/s | 2.4100 KOps/s | |
test_distributed | 0.4904ms | 0.1189ms | 8.4124 KOps/s | 8.8426 KOps/s | |
test_tdmodule | 0.3131ms | 15.0145μs | 66.6023 KOps/s | 61.0156 KOps/s | |
test_tdmodule_dispatch | 57.7110μs | 28.1982μs | 35.4632 KOps/s | 31.7539 KOps/s | |
test_tdseq | 27.2900μs | 15.5951μs | 64.1227 KOps/s | 58.4824 KOps/s | |
test_tdseq_dispatch | 51.2010μs | 30.3309μs | 32.9697 KOps/s | 28.9538 KOps/s | |
test_instantiation_functorch | 2.1198ms | 1.8635ms | 536.6153 Ops/s | 534.3706 Ops/s | |
test_instantiation_td | 1.8217ms | 1.2047ms | 830.0811 Ops/s | 829.2152 Ops/s | |
test_exec_functorch | 0.2443ms | 0.2089ms | 4.7881 KOps/s | 4.7572 KOps/s | |
test_exec_functional_call | 0.2560ms | 0.2109ms | 4.7426 KOps/s | 4.6891 KOps/s | |
test_exec_td | 0.2736ms | 0.2335ms | 4.2826 KOps/s | 4.3521 KOps/s | |
test_exec_td_decorator | 1.0959ms | 0.2644ms | 3.7824 KOps/s | 3.7418 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7988ms | 0.7105ms | 1.4075 KOps/s | 1.4110 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7763ms | 0.7012ms | 1.4262 KOps/s | 1.4022 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7470ms | 0.5980ms | 1.6723 KOps/s | 1.6278 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6470ms | 0.5972ms | 1.6744 KOps/s | 1.6647 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.3678ms | 0.6961ms | 1.4366 KOps/s | 1.4620 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8177ms | 0.6773ms | 1.4764 KOps/s | 1.4572 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7163ms | 0.5986ms | 1.6706 KOps/s | 1.6650 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.6887ms | 0.5999ms | 1.6670 KOps/s | 1.6619 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.5366ms | 8.4730ms | 118.0221 Ops/s | 116.5190 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.5256ms | 8.4863ms | 117.8371 Ops/s | 116.9479 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.3430ms | 8.3123ms | 120.3039 Ops/s | 119.7364 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.6201ms | 8.2561ms | 121.1229 Ops/s | 119.3864 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.6268ms | 19.5520ms | 51.1456 Ops/s | 50.6310 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 19.6526ms | 19.5758ms | 51.0835 Ops/s | 51.0581 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.4851ms | 19.3891ms | 51.5753 Ops/s | 51.0620 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.4639ms | 19.4059ms | 51.5307 Ops/s | 51.1536 Ops/s | |
test_to_module_speed[True] | 1.2904ms | 1.0038ms | 996.2262 Ops/s | 994.2483 Ops/s | |
test_to_module_speed[False] | 1.3606ms | 0.9647ms | 1.0366 KOps/s | 1.0325 KOps/s | |
test_tc_init | 63.3610μs | 34.4958μs | 28.9890 KOps/s | 27.7084 KOps/s | |
test_tc_init_nested | 0.1061ms | 72.6990μs | 13.7554 KOps/s | 13.6803 KOps/s | |
test_tc_first_layer_tensor | 5.3557μs | 0.6731μs | 1.4857 MOps/s | 1.4782 MOps/s | |
test_tc_first_layer_nontensor | 40.2810μs | 2.2567μs | 443.1256 KOps/s | 442.0080 KOps/s | |
test_tc_second_layer_tensor | 29.9280μs | 1.3649μs | 732.6379 KOps/s | 727.4051 KOps/s | |
test_tc_second_layer_nontensor | 38.2600μs | 2.9412μs | 339.9958 KOps/s | 341.2498 KOps/s | |
test_unbind | 0.1861s | 12.0224ms | 83.1784 Ops/s | 94.9778 Ops/s | |
test_full_like | 0.6562ms | 0.5751ms | 1.7389 KOps/s | 1.7428 KOps/s | |
test_zeros_like | 0.2769ms | 0.1978ms | 5.0543 KOps/s | 5.0524 KOps/s | |
test_ones_like | 0.2387ms | 0.1978ms | 5.0567 KOps/s | 5.0573 KOps/s | |
test_clone | 0.4423ms | 0.4144ms | 2.4129 KOps/s | 2.4184 KOps/s | |
test_squeeze | 43.2910μs | 9.9189μs | 100.8180 KOps/s | 100.4573 KOps/s | |
test_unsqueeze | 0.2228ms | 73.6114μs | 13.5848 KOps/s | 13.0669 KOps/s | |
test_split | 0.4128ms | 0.1599ms | 6.2548 KOps/s | 6.2272 KOps/s | |
test_permute | 0.2231ms | 0.1770ms | 5.6492 KOps/s | 5.4949 KOps/s | |
test_stack | 1.2565ms | 0.8573ms | 1.1665 KOps/s | 1.1675 KOps/s | |
test_cat | 1.2867ms | 1.2312ms | 812.1911 Ops/s | 811.5912 Ops/s |
vmoens
added a commit
that referenced
this pull request
Oct 4, 2024
ghstack-source-id: fa65ff98d46363f84e4fac1cf306b59cd8f7e555 Pull Request resolved: #1001
vmoens
added a commit
that referenced
this pull request
Oct 8, 2024
ghstack-source-id: db7893b09e6838506d1b798a07c767743651925c Pull Request resolved: #1001
vmoens
added a commit
that referenced
this pull request
Oct 8, 2024
ghstack-source-id: db7e19e0fc06b7137a769f03f6337836179883d5 Pull Request resolved: #1001
vmoens
added a commit
that referenced
this pull request
Oct 8, 2024
ghstack-source-id: 9f9ce070d8726c74fcf5a22e0edd05b8c9fd7e19 Pull Request resolved: #1001
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Refactor
Refactoring code - not a new feature
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):