-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] allow tensorclass to be customized #1080
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 7, 2024
ghstack-source-id: 04281e1299a62085d1d242bb63ec837fe001dfea Pull Request resolved: #1080
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 7, 2024
vmoens
added a commit
that referenced
this pull request
Nov 7, 2024
ghstack-source-id: c3460ff67c999c91000263bcfe00a04e88918b28 Pull Request resolved: #1080
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 50.9850μs | 17.5032μs | 57.1323 KOps/s | 55.2817 KOps/s | |
test_plain_set_stack_nested | 43.2900μs | 17.8190μs | 56.1200 KOps/s | 54.9033 KOps/s | |
test_plain_set_nested_inplace | 80.8100μs | 19.3263μs | 51.7430 KOps/s | 50.0216 KOps/s | |
test_plain_set_stack_nested_inplace | 0.1707ms | 19.4080μs | 51.5250 KOps/s | 50.4927 KOps/s | |
test_items | 48.5220μs | 4.1474μs | 241.1162 KOps/s | 240.2752 KOps/s | |
test_items_nested | 0.5822ms | 0.3445ms | 2.9031 KOps/s | 2.9316 KOps/s | |
test_items_nested_locked | 0.5242ms | 0.3448ms | 2.9004 KOps/s | 2.9514 KOps/s | |
test_items_nested_leaf | 0.1478ms | 71.2852μs | 14.0282 KOps/s | 14.1811 KOps/s | |
test_items_stack_nested | 0.5619ms | 0.3483ms | 2.8709 KOps/s | 2.9263 KOps/s | |
test_items_stack_nested_leaf | 0.1553ms | 74.7213μs | 13.3831 KOps/s | 13.8487 KOps/s | |
test_items_stack_nested_locked | 0.6392ms | 0.3478ms | 2.8753 KOps/s | 2.9598 KOps/s | |
test_keys | 30.3760μs | 3.5013μs | 285.6067 KOps/s | 285.6426 KOps/s | |
test_keys_nested | 0.2211ms | 0.1376ms | 7.2686 KOps/s | 7.4323 KOps/s | |
test_keys_nested_locked | 1.9424ms | 0.1419ms | 7.0472 KOps/s | 7.1715 KOps/s | |
test_keys_nested_leaf | 0.2494ms | 0.1150ms | 8.6964 KOps/s | 8.6117 KOps/s | |
test_keys_stack_nested | 0.2578ms | 0.1354ms | 7.3840 KOps/s | 7.3749 KOps/s | |
test_keys_stack_nested_leaf | 0.2066ms | 0.1158ms | 8.6332 KOps/s | 8.6987 KOps/s | |
test_keys_stack_nested_locked | 0.4424ms | 0.1452ms | 6.8881 KOps/s | 7.1164 KOps/s | |
test_values | 9.4696μs | 1.0387μs | 962.7773 KOps/s | 891.1586 KOps/s | |
test_values_nested | 0.1120ms | 55.3620μs | 18.0629 KOps/s | 18.2139 KOps/s | |
test_values_nested_locked | 0.1032ms | 55.6893μs | 17.9568 KOps/s | 18.2651 KOps/s | |
test_values_nested_leaf | 0.3163ms | 59.8717μs | 16.7024 KOps/s | 16.8452 KOps/s | |
test_values_stack_nested | 0.1179ms | 56.9341μs | 17.5642 KOps/s | 17.8085 KOps/s | |
test_values_stack_nested_leaf | 0.1204ms | 59.8490μs | 16.7087 KOps/s | 16.8668 KOps/s | |
test_values_stack_nested_locked | 0.1199ms | 56.4820μs | 17.7048 KOps/s | 17.8228 KOps/s | |
test_membership | 40.2260μs | 0.8708μs | 1.1483 MOps/s | 1.1667 MOps/s | |
test_membership_nested | 30.6770μs | 2.7258μs | 366.8632 KOps/s | 362.7692 KOps/s | |
test_membership_nested_leaf | 33.1810μs | 2.7525μs | 363.3115 KOps/s | 360.7172 KOps/s | |
test_membership_stacked_nested | 20.5680μs | 2.7095μs | 369.0761 KOps/s | 366.7860 KOps/s | |
test_membership_stacked_nested_leaf | 23.2130μs | 2.7352μs | 365.6014 KOps/s | 348.0464 KOps/s | |
test_membership_nested_last | 45.7450μs | 3.9333μs | 254.2418 KOps/s | 245.6591 KOps/s | |
test_membership_nested_leaf_last | 51.1450μs | 3.9730μs | 251.6962 KOps/s | 243.9172 KOps/s | |
test_membership_stacked_nested_last | 25.6970μs | 4.6493μs | 215.0872 KOps/s | 76.7336 KOps/s | |
test_membership_stacked_nested_leaf_last | 57.0460μs | 4.6078μs | 217.0228 KOps/s | 76.2671 KOps/s | |
test_nested_getleaf | 39.8940μs | 10.5476μs | 94.8079 KOps/s | 94.6177 KOps/s | |
test_nested_get | 57.6770μs | 9.9090μs | 100.9183 KOps/s | 99.9943 KOps/s | |
test_stacked_getleaf | 56.6550μs | 10.4121μs | 96.0423 KOps/s | 94.3861 KOps/s | |
test_stacked_get | 47.0070μs | 10.0419μs | 99.5825 KOps/s | 100.8635 KOps/s | |
test_nested_getitemleaf | 59.1500μs | 11.0907μs | 90.1657 KOps/s | 91.1051 KOps/s | |
test_nested_getitem | 62.7970μs | 10.3589μs | 96.5350 KOps/s | 97.9766 KOps/s | |
test_stacked_getitemleaf | 36.9780μs | 10.9846μs | 91.0369 KOps/s | 91.2714 KOps/s | |
test_stacked_getitem | 65.9830μs | 10.2225μs | 97.8230 KOps/s | 98.1420 KOps/s | |
test_lock_nested | 3.0901ms | 0.4374ms | 2.2862 KOps/s | 2.2355 KOps/s | |
test_lock_stack_nested | 0.4980ms | 0.4031ms | 2.4807 KOps/s | 2.4922 KOps/s | |
test_unlock_nested | 0.8967ms | 0.3540ms | 2.8248 KOps/s | 2.7498 KOps/s | |
test_unlock_stack_nested | 0.5978ms | 0.3221ms | 3.1048 KOps/s | 3.1140 KOps/s | |
test_flatten_speed | 0.1866ms | 90.9702μs | 10.9926 KOps/s | 10.9403 KOps/s | |
test_unflatten_speed | 0.5352ms | 0.4561ms | 2.1925 KOps/s | 2.0869 KOps/s | |
test_common_ops | 3.8484ms | 0.7705ms | 1.2979 KOps/s | 1.2099 KOps/s | |
test_creation | 57.7280μs | 2.0320μs | 492.1333 KOps/s | 490.2415 KOps/s | |
test_creation_empty | 42.7400μs | 10.9658μs | 91.1929 KOps/s | 84.2569 KOps/s | |
test_creation_nested_1 | 45.6540μs | 13.4874μs | 74.1430 KOps/s | 67.2356 KOps/s | |
test_creation_nested_2 | 51.5260μs | 18.0533μs | 55.3916 KOps/s | 53.1864 KOps/s | |
test_clone | 1.4621ms | 13.2966μs | 75.2074 KOps/s | 75.3176 KOps/s | |
test_getitem[int] | 0.8368ms | 12.6105μs | 79.2990 KOps/s | 77.6600 KOps/s | |
test_getitem[slice_int] | 0.1792ms | 24.1021μs | 41.4901 KOps/s | 39.4217 KOps/s | |
test_getitem[range] | 0.2604ms | 49.6157μs | 20.1549 KOps/s | 19.3264 KOps/s | |
test_getitem[tuple] | 0.1377ms | 19.7584μs | 50.6115 KOps/s | 48.3280 KOps/s | |
test_getitem[list] | 0.3663ms | 44.4689μs | 22.4876 KOps/s | 21.1484 KOps/s | |
test_setitem_dim[int] | 53.3390μs | 24.5543μs | 40.7261 KOps/s | 39.6389 KOps/s | |
test_setitem_dim[slice_int] | 81.8520μs | 50.3101μs | 19.8767 KOps/s | 18.7450 KOps/s | |
test_setitem_dim[range] | 0.1247ms | 72.6828μs | 13.7584 KOps/s | 13.0344 KOps/s | |
test_setitem_dim[tuple] | 85.9100μs | 39.6057μs | 25.2489 KOps/s | 23.8056 KOps/s | |
test_setitem | 0.2051ms | 20.4519μs | 48.8952 KOps/s | 47.6164 KOps/s | |
test_set | 0.1605ms | 19.9871μs | 50.0324 KOps/s | 48.4982 KOps/s | |
test_set_shared | 1.2490ms | 0.1685ms | 5.9337 KOps/s | 5.6490 KOps/s | |
test_update | 0.2024ms | 22.5405μs | 44.3646 KOps/s | 42.5637 KOps/s | |
test_update_nested | 0.2284ms | 32.2918μs | 30.9676 KOps/s | 28.8027 KOps/s | |
test_update__nested | 1.1385ms | 32.5211μs | 30.7493 KOps/s | 30.0194 KOps/s | |
test_set_nested | 0.1786ms | 21.7243μs | 46.0314 KOps/s | 43.8074 KOps/s | |
test_set_nested_new | 0.1645ms | 26.4662μs | 37.7841 KOps/s | 36.0417 KOps/s | |
test_select | 0.1612ms | 41.5459μs | 24.0698 KOps/s | 22.6621 KOps/s | |
test_select_nested | 0.1249ms | 58.7617μs | 17.0179 KOps/s | 16.2105 KOps/s | |
test_exclude_nested | 0.1460ms | 73.4331μs | 13.6178 KOps/s | 13.0072 KOps/s | |
test_empty[True] | 0.5817ms | 0.3460ms | 2.8899 KOps/s | 2.8502 KOps/s | |
test_empty[False] | 12.3505μs | 1.2037μs | 830.8051 KOps/s | 823.4668 KOps/s | |
test_unbind_speed | 0.4085ms | 0.2553ms | 3.9172 KOps/s | 3.8359 KOps/s | |
test_unbind_speed_stack0 | 0.4150ms | 0.2537ms | 3.9414 KOps/s | 3.9831 KOps/s | |
test_unbind_speed_stack1 | 0.1164s | 0.7763ms | 1.2881 KOps/s | 1.4442 KOps/s | |
test_split | 1.8042ms | 1.5475ms | 646.2188 Ops/s | 558.0143 Ops/s | |
test_chunk | 0.1127s | 1.8883ms | 529.5629 Ops/s | 560.3730 Ops/s | |
test_consolidate_njt[False-None] | 11.1333ms | 8.0382ms | 124.4060 Ops/s | 120.5901 Ops/s | |
test_creation[device0] | 0.2419ms | 89.9273μs | 11.1201 KOps/s | 10.3498 KOps/s | |
test_creation_from_tensor | 3.7894ms | 94.5355μs | 10.5780 KOps/s | 10.3983 KOps/s | |
test_add_one[memmap_tensor0] | 0.1929ms | 4.7544μs | 210.3318 KOps/s | 205.0665 KOps/s | |
test_contiguous[memmap_tensor0] | 39.4040μs | 0.5077μs | 1.9695 MOps/s | 1.9635 MOps/s | |
test_stack[memmap_tensor0] | 54.9120μs | 3.4160μs | 292.7430 KOps/s | 279.0151 KOps/s | |
test_memmaptd_index | 1.0297ms | 0.2318ms | 4.3134 KOps/s | 4.2300 KOps/s | |
test_memmaptd_index_astensor | 0.8424ms | 0.3077ms | 3.2500 KOps/s | 3.1607 KOps/s | |
test_memmaptd_index_op | 0.9346ms | 0.5647ms | 1.7708 KOps/s | 1.6186 KOps/s | |
test_serialize_model | 0.1361s | 0.1206s | 8.2917 Ops/s | 8.5808 Ops/s | |
test_serialize_model_pickle | 0.4489s | 0.3920s | 2.5512 Ops/s | 2.5594 Ops/s | |
test_serialize_weights | 0.2289s | 0.1346s | 7.4306 Ops/s | 8.6655 Ops/s | |
test_serialize_weights_returnearly | 0.1660s | 0.1588s | 6.2955 Ops/s | 6.2687 Ops/s | |
test_serialize_weights_pickle | 1.1110s | 0.6983s | 1.4321 Ops/s | 2.2816 Ops/s | |
test_serialize_weights_filesystem | 0.1532s | 0.1448s | 6.9053 Ops/s | 7.0174 Ops/s | |
test_serialize_model_filesystem | 0.2566s | 0.1619s | 6.1756 Ops/s | 5.8358 Ops/s | |
test_reshape_pytree | 58.9700μs | 26.9622μs | 37.0889 KOps/s | 37.7935 KOps/s | |
test_reshape_td | 0.2225ms | 34.6983μs | 28.8199 KOps/s | 30.9375 KOps/s | |
test_view_pytree | 0.1356ms | 27.4967μs | 36.3680 KOps/s | 37.6193 KOps/s | |
test_view_td | 76.7220μs | 37.6322μs | 26.5730 KOps/s | 25.9530 KOps/s | |
test_unbind_pytree | 98.6130μs | 29.5671μs | 33.8214 KOps/s | 33.5996 KOps/s | |
test_unbind_td | 0.3300ms | 38.3741μs | 26.0592 KOps/s | 26.1963 KOps/s | |
test_split_pytree | 65.8020μs | 29.8225μs | 33.5317 KOps/s | 34.0420 KOps/s | |
test_split_td | 0.4808ms | 44.3916μs | 22.5268 KOps/s | 21.9331 KOps/s | |
test_add_pytree | 90.4080μs | 36.4013μs | 27.4716 KOps/s | 28.2362 KOps/s | |
test_add_td | 0.1434ms | 54.5873μs | 18.3193 KOps/s | 16.9281 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1156ms | 63.2820μs | 15.8023 KOps/s | 16.0400 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3848ms | 0.1606ms | 6.2262 KOps/s | 6.2840 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1227ms | 45.8266μs | 21.8214 KOps/s | 22.1030 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2464ms | 0.1193ms | 8.3831 KOps/s | 8.4043 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.1044ms | 26.4928μs | 37.7461 KOps/s | 39.3624 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1562ms | 53.4490μs | 18.7094 KOps/s | 18.5705 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1793ms | 79.4138μs | 12.5923 KOps/s | 12.8273 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1251ms | 67.3308μs | 14.8520 KOps/s | 14.7881 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2100ms | 0.1035ms | 9.6577 KOps/s | 9.6788 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3339ms | 0.1977ms | 5.0592 KOps/s | 5.0972 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1158ms | 44.2935μs | 22.5767 KOps/s | 22.5124 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5218ms | 62.9309μs | 15.8905 KOps/s | 15.9429 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2136ms | 0.1029ms | 9.7143 KOps/s | 9.8194 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3077ms | 0.2005ms | 4.9880 KOps/s | 4.7789 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4466ms | 0.2097ms | 4.7686 KOps/s | 4.8023 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2271ms | 0.1072ms | 9.3272 KOps/s | 9.5034 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.2640ms | 56.3087μs | 17.7592 KOps/s | 18.2452 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1397ms | 45.6547μs | 21.9035 KOps/s | 22.4508 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6299ms | 0.1588ms | 6.2958 KOps/s | 6.1796 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2029ms | 0.1029ms | 9.7165 KOps/s | 9.7642 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.1033ms | 21.3799μs | 46.7729 KOps/s | 47.4131 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1263ms | 59.1238μs | 16.9137 KOps/s | 17.2318 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1635ms | 81.6158μs | 12.2525 KOps/s | 12.5182 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1449ms | 69.0438μs | 14.4835 KOps/s | 13.7806 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3054ms | 0.2105ms | 4.7511 KOps/s | 4.7951 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 1.3981ms | 1.2529ms | 798.1516 Ops/s | 775.6477 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3589ms | 0.2114ms | 4.7301 KOps/s | 4.9229 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.0614ms | 0.7688ms | 1.3007 KOps/s | 1.2837 KOps/s | |
test_compile_assign_and_add_stack[compile] | 0.5761ms | 0.4617ms | 2.1659 KOps/s | 2.2363 KOps/s | |
test_compile_assign_and_add_stack[eager] | 5.8609ms | 2.5897ms | 386.1409 Ops/s | 367.4728 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1022ms | 35.7853μs | 27.9445 KOps/s | 28.5357 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.7882ms | 34.4907μs | 28.9933 KOps/s | 29.9899 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1719ms | 29.2060μs | 34.2396 KOps/s | 34.9533 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 83.0750μs | 23.3096μs | 42.9007 KOps/s | 44.0134 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 77.1440μs | 29.8631μs | 33.4861 KOps/s | 32.8245 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 68.2470μs | 22.8469μs | 43.7696 KOps/s | 44.0266 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1240ms | 52.4014μs | 19.0835 KOps/s | 19.3633 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5811ms | 19.8605μs | 50.3513 KOps/s | 49.6803 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1089ms | 43.7320μs | 22.8666 KOps/s | 22.1298 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 0.1459ms | 19.0639μs | 52.4552 KOps/s | 53.6701 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1395ms | 44.9433μs | 22.2503 KOps/s | 21.9181 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 0.1189ms | 19.0755μs | 52.4232 KOps/s | 53.7187 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2112ms | 54.6795μs | 18.2884 KOps/s | 18.9059 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 5.1502ms | 20.1279μs | 49.6823 KOps/s | 51.4345 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 92.3620μs | 44.3050μs | 22.5708 KOps/s | 21.8333 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 75.5610μs | 18.9692μs | 52.7169 KOps/s | 53.3008 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1356ms | 44.4587μs | 22.4928 KOps/s | 21.9228 KOps/s | |
test_compile_indexing[int-pytree-eager] | 50.7950μs | 18.7645μs | 53.2921 KOps/s | 53.8953 KOps/s | |
test_mod_add[eager] | 69.0990μs | 25.7486μs | 38.8371 KOps/s | 36.1039 KOps/s | |
test_mod_add[compile] | 0.1121ms | 44.6154μs | 22.4138 KOps/s | 23.2907 KOps/s | |
test_mod_add[compile-overhead] | 0.1220ms | 44.7882μs | 22.3273 KOps/s | 22.8674 KOps/s | |
test_mod_wrap[eager] | 0.3430ms | 0.2148ms | 4.6554 KOps/s | 4.6247 KOps/s | |
test_mod_wrap[compile] | 2.1776ms | 0.2034ms | 4.9167 KOps/s | 4.8615 KOps/s | |
test_mod_wrap[compile-overhead] | 1.8340ms | 0.2030ms | 4.9262 KOps/s | 4.8688 KOps/s | |
test_mod_wrap_and_backward[eager] | 14.2195ms | 11.8695ms | 84.2498 Ops/s | 77.1237 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.1752ms | 12.9415ms | 77.2707 Ops/s | 77.1599 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 15.0021ms | 12.9863ms | 77.0043 Ops/s | 75.4662 Ops/s | |
test_seq_add[eager] | 0.1578ms | 89.9240μs | 11.1205 KOps/s | 10.4878 KOps/s | |
test_seq_add[compile] | 0.1301ms | 59.7788μs | 16.7283 KOps/s | 16.6746 KOps/s | |
test_seq_add[compile-overhead] | 0.1248ms | 58.3501μs | 17.1379 KOps/s | 17.1891 KOps/s | |
test_seq_wrap[eager] | 0.5909ms | 0.3829ms | 2.6116 KOps/s | 2.4273 KOps/s | |
test_seq_wrap[compile] | 0.4228ms | 0.2278ms | 4.3904 KOps/s | 4.3962 KOps/s | |
test_seq_wrap[compile-overhead] | 0.4337ms | 0.2254ms | 4.4371 KOps/s | 4.4283 KOps/s | |
test_func_call_runtime[False-eager] | 0.7717ms | 0.5534ms | 1.8071 KOps/s | 1.7582 KOps/s | |
test_func_call_runtime[False-compile] | 0.6395ms | 0.4209ms | 2.3759 KOps/s | 2.3466 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.5829ms | 0.4234ms | 2.3617 KOps/s | 2.3428 KOps/s | |
test_func_call_runtime[True-eager] | 1.1218ms | 0.7609ms | 1.3143 KOps/s | 1.3210 KOps/s | |
test_func_call_runtime[True-compile] | 0.8148ms | 0.4609ms | 2.1695 KOps/s | 2.1602 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.6190ms | 0.4630ms | 2.1600 KOps/s | 2.1530 KOps/s | |
test_func_call_cm_runtime[False-eager] | 1.2039ms | 0.5624ms | 1.7780 KOps/s | 1.8268 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.5780ms | 0.4244ms | 2.3565 KOps/s | 2.3550 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.9299ms | 0.4218ms | 2.3705 KOps/s | 2.3489 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0570ms | 0.9040ms | 1.1062 KOps/s | 1.1059 KOps/s | |
test_func_call_cm_runtime[True-compile] | 1.2313ms | 0.4875ms | 2.0512 KOps/s | 2.0306 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7261ms | 0.4886ms | 2.0467 KOps/s | 2.0284 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.4282ms | 1.8823ms | 531.2646 Ops/s | 517.3762 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.6974ms | 0.5088ms | 1.9654 KOps/s | 1.9329 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.6138ms | 0.5056ms | 1.9777 KOps/s | 1.9429 KOps/s | |
test_distributed | 0.3392ms | 0.1268ms | 7.8861 KOps/s | 7.7617 KOps/s | |
test_tdmodule | 44.6740μs | 18.2764μs | 54.7153 KOps/s | 50.3696 KOps/s | |
test_tdmodule_dispatch | 66.5140μs | 35.7200μs | 27.9955 KOps/s | 26.3081 KOps/s | |
test_tdseq | 46.1960μs | 21.0099μs | 47.5965 KOps/s | 44.1430 KOps/s | |
test_tdseq_dispatch | 76.5120μs | 41.6109μs | 24.0321 KOps/s | 23.6039 KOps/s | |
test_instantiation_functorch | 2.1208ms | 1.5368ms | 650.6954 Ops/s | 657.6047 Ops/s | |
test_exec_functorch | 0.3456ms | 0.1835ms | 5.4505 KOps/s | 5.4662 KOps/s | |
test_exec_functional_call | 0.3357ms | 0.1765ms | 5.6663 KOps/s | 5.6692 KOps/s | |
test_exec_td_decorator | 0.5408ms | 0.2305ms | 4.3380 KOps/s | 4.3457 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.7989ms | 0.6307ms | 1.5855 KOps/s | 1.5431 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9712ms | 0.6317ms | 1.5830 KOps/s | 1.5380 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7638ms | 0.5172ms | 1.9337 KOps/s | 1.8927 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7328ms | 0.5175ms | 1.9322 KOps/s | 1.8466 KOps/s | |
test_to_module_speed[True] | 1.5510ms | 1.2911ms | 774.5616 Ops/s | 784.8408 Ops/s | |
test_to_module_speed[False] | 1.6625ms | 1.2625ms | 792.0681 Ops/s | 797.8408 Ops/s | |
test_tc_init | 89.1050μs | 45.0352μs | 22.2049 KOps/s | 21.3839 KOps/s | |
test_tc_init_nested | 0.1702ms | 91.0522μs | 10.9827 KOps/s | 10.7106 KOps/s | |
test_tc_first_layer_tensor | 40.6960μs | 1.5284μs | 654.2715 KOps/s | 661.2306 KOps/s | |
test_tc_first_layer_nontensor | 28.7130μs | 4.7284μs | 211.4890 KOps/s | 215.6634 KOps/s | |
test_tc_second_layer_tensor | 42.6590μs | 2.8206μs | 354.5295 KOps/s | 356.5423 KOps/s | |
test_tc_second_layer_nontensor | 31.2480μs | 6.0308μs | 165.8142 KOps/s | 169.6419 KOps/s | |
test_unbind | 0.2403s | 14.1888ms | 70.4782 Ops/s | 77.4163 Ops/s | |
test_full_like | 9.9245ms | 8.2730ms | 120.8746 Ops/s | 78.9645 Ops/s | |
test_zeros_like | 3.6937ms | 3.1546ms | 316.9993 Ops/s | 122.2463 Ops/s | |
test_ones_like | 4.1287ms | 3.5831ms | 279.0913 Ops/s | 123.3096 Ops/s | |
test_clone | 6.6814ms | 5.9049ms | 169.3519 Ops/s | 101.2904 Ops/s | |
test_squeeze | 63.3380μs | 11.9947μs | 83.3699 KOps/s | 82.3051 KOps/s | |
test_unsqueeze | 0.1656ms | 88.9454μs | 11.2429 KOps/s | 11.1270 KOps/s | |
test_split | 0.4837ms | 0.1846ms | 5.4158 KOps/s | 5.2027 KOps/s | |
test_permute | 0.3142ms | 0.2172ms | 4.6036 KOps/s | 4.5547 KOps/s | |
test_stack | 30.3634ms | 26.1641ms | 38.2202 Ops/s | 38.2684 Ops/s | |
test_cat | 33.6727ms | 26.0917ms | 38.3264 Ops/s | 38.4214 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 30.6400μs | 11.0298μs | 90.6637 KOps/s | 85.2034 KOps/s | |
test_plain_set_stack_nested | 39.4110μs | 11.1360μs | 89.7990 KOps/s | 86.1697 KOps/s | |
test_plain_set_nested_inplace | 0.4038ms | 12.0047μs | 83.3008 KOps/s | 80.0803 KOps/s | |
test_plain_set_stack_nested_inplace | 0.3903ms | 12.0142μs | 83.2347 KOps/s | 79.7488 KOps/s | |
test_items | 21.1010μs | 2.8571μs | 350.0048 KOps/s | 340.0693 KOps/s | |
test_items_nested | 0.6935ms | 0.3206ms | 3.1193 KOps/s | 3.1486 KOps/s | |
test_items_nested_locked | 0.6949ms | 0.3220ms | 3.1055 KOps/s | 3.1466 KOps/s | |
test_items_nested_leaf | 0.4340ms | 58.3381μs | 17.1415 KOps/s | 17.1229 KOps/s | |
test_items_stack_nested | 0.7060ms | 0.3201ms | 3.1237 KOps/s | 3.1197 KOps/s | |
test_items_stack_nested_leaf | 0.4302ms | 59.0322μs | 16.9399 KOps/s | 16.9439 KOps/s | |
test_items_stack_nested_locked | 0.7169ms | 0.3233ms | 3.0928 KOps/s | 3.1258 KOps/s | |
test_keys | 32.0200μs | 3.4614μs | 288.8991 KOps/s | 288.6467 KOps/s | |
test_keys_nested | 0.4583ms | 69.3884μs | 14.4116 KOps/s | 14.3150 KOps/s | |
test_keys_nested_locked | 0.7627ms | 74.9163μs | 13.3482 KOps/s | 13.2147 KOps/s | |
test_keys_nested_leaf | 0.4324ms | 62.0190μs | 16.1241 KOps/s | 16.3482 KOps/s | |
test_keys_stack_nested | 0.1257ms | 70.3367μs | 14.2173 KOps/s | 13.9700 KOps/s | |
test_keys_stack_nested_leaf | 0.4362ms | 61.6028μs | 16.2330 KOps/s | 16.1622 KOps/s | |
test_keys_stack_nested_locked | 0.4499ms | 76.0355μs | 13.1518 KOps/s | 13.0968 KOps/s | |
test_values | 65.4060μs | 0.8420μs | 1.1876 MOps/s | 1.1829 MOps/s | |
test_values_nested | 66.4110μs | 31.3467μs | 31.9013 KOps/s | 32.0224 KOps/s | |
test_values_nested_locked | 60.5500μs | 32.9403μs | 30.3580 KOps/s | 30.4742 KOps/s | |
test_values_nested_leaf | 54.5510μs | 33.7122μs | 29.6628 KOps/s | 29.7106 KOps/s | |
test_values_stack_nested | 59.6710μs | 31.7395μs | 31.5064 KOps/s | 31.6531 KOps/s | |
test_values_stack_nested_leaf | 75.1910μs | 34.2246μs | 29.2187 KOps/s | 29.0778 KOps/s | |
test_values_stack_nested_locked | 59.7510μs | 33.5109μs | 29.8411 KOps/s | 30.2148 KOps/s | |
test_membership | 1.6166μs | 0.5197μs | 1.9241 MOps/s | 1.9263 MOps/s | |
test_membership_nested | 13.3800μs | 1.8628μs | 536.8168 KOps/s | 543.1905 KOps/s | |
test_membership_nested_leaf | 11.8770μs | 1.8172μs | 550.3097 KOps/s | 536.3013 KOps/s | |
test_membership_stacked_nested | 35.3210μs | 1.9490μs | 513.0874 KOps/s | 514.0016 KOps/s | |
test_membership_stacked_nested_leaf | 30.4600μs | 1.9520μs | 512.2882 KOps/s | 522.6111 KOps/s | |
test_membership_nested_last | 38.0800μs | 2.7633μs | 361.8868 KOps/s | 364.8927 KOps/s | |
test_membership_nested_leaf_last | 25.3900μs | 2.7917μs | 358.2104 KOps/s | 361.2259 KOps/s | |
test_membership_stacked_nested_last | 42.4010μs | 3.3472μs | 298.7567 KOps/s | 268.1016 KOps/s | |
test_membership_stacked_nested_leaf_last | 0.7785ms | 3.1759μs | 314.8701 KOps/s | 269.8069 KOps/s | |
test_nested_getleaf | 34.2910μs | 5.9435μs | 168.2521 KOps/s | 167.4980 KOps/s | |
test_nested_get | 28.4500μs | 5.6376μs | 177.3818 KOps/s | 176.4344 KOps/s | |
test_stacked_getleaf | 30.8100μs | 5.9563μs | 167.8902 KOps/s | 167.0318 KOps/s | |
test_stacked_get | 28.2210μs | 5.6145μs | 178.1097 KOps/s | 176.6182 KOps/s | |
test_nested_getitemleaf | 32.8210μs | 6.0417μs | 165.5162 KOps/s | 164.4359 KOps/s | |
test_nested_getitem | 30.0700μs | 5.7267μs | 174.6207 KOps/s | 174.4036 KOps/s | |
test_stacked_getitemleaf | 31.3500μs | 6.0357μs | 165.6811 KOps/s | 164.5472 KOps/s | |
test_stacked_getitem | 26.3410μs | 5.7269μs | 174.6135 KOps/s | 174.9187 KOps/s | |
test_lock_nested | 9.1584ms | 0.3672ms | 2.7234 KOps/s | 2.7080 KOps/s | |
test_lock_stack_nested | 0.3764ms | 0.3302ms | 3.0289 KOps/s | 2.9893 KOps/s | |
test_unlock_nested | 0.6611ms | 0.3024ms | 3.3073 KOps/s | 3.3597 KOps/s | |
test_unlock_stack_nested | 0.3154ms | 0.2714ms | 3.6848 KOps/s | 3.6983 KOps/s | |
test_flatten_speed | 0.1046ms | 73.3030μs | 13.6420 KOps/s | 13.7461 KOps/s | |
test_unflatten_speed | 0.3377ms | 0.2873ms | 3.4812 KOps/s | 3.4272 KOps/s | |
test_common_ops | 1.7584ms | 0.5892ms | 1.6971 KOps/s | 1.6643 KOps/s | |
test_creation | 0.1204ms | 1.4929μs | 669.8547 KOps/s | 676.6796 KOps/s | |
test_creation_empty | 39.2410μs | 8.2591μs | 121.0787 KOps/s | 104.4622 KOps/s | |
test_creation_nested_1 | 1.6593ms | 9.6878μs | 103.2221 KOps/s | 90.6371 KOps/s | |
test_creation_nested_2 | 48.8510μs | 12.1293μs | 82.4450 KOps/s | 75.2270 KOps/s | |
test_clone | 58.4610μs | 10.5414μs | 94.8642 KOps/s | 99.2059 KOps/s | |
test_getitem[int] | 92.7821ms | 15.2685μs | 65.4943 KOps/s | 97.5402 KOps/s | |
test_getitem[slice_int] | 0.1103ms | 20.2091μs | 49.4827 KOps/s | 48.9503 KOps/s | |
test_getitem[range] | 0.1321ms | 35.9340μs | 27.8288 KOps/s | 28.1575 KOps/s | |
test_getitem[tuple] | 0.1045ms | 17.6389μs | 56.6928 KOps/s | 56.3888 KOps/s | |
test_getitem[list] | 0.2336ms | 31.8923μs | 31.3555 KOps/s | 31.6371 KOps/s | |
test_setitem_dim[int] | 39.4010μs | 18.3122μs | 54.6084 KOps/s | 56.3104 KOps/s | |
test_setitem_dim[slice_int] | 97.3710μs | 35.1315μs | 28.4645 KOps/s | 27.9596 KOps/s | |
test_setitem_dim[range] | 82.4210μs | 50.4190μs | 19.8338 KOps/s | 19.8001 KOps/s | |
test_setitem_dim[tuple] | 51.2910μs | 30.9215μs | 32.3400 KOps/s | 32.7637 KOps/s | |
test_setitem | 94.1020μs | 14.7527μs | 67.7842 KOps/s | 65.1968 KOps/s | |
test_set | 89.2510μs | 14.3294μs | 69.7868 KOps/s | 68.7660 KOps/s | |
test_set_shared | 1.5957ms | 0.1459ms | 6.8529 KOps/s | 6.9214 KOps/s | |
test_update | 0.9957ms | 17.3612μs | 57.5997 KOps/s | 53.4434 KOps/s | |
test_update_nested | 95.2810μs | 22.7634μs | 43.9302 KOps/s | 42.4510 KOps/s | |
test_update__nested | 0.1271ms | 23.9557μs | 41.7438 KOps/s | 42.9048 KOps/s | |
test_set_nested | 85.7610μs | 15.1247μs | 66.1168 KOps/s | 62.9976 KOps/s | |
test_set_nested_new | 94.3420μs | 18.1093μs | 55.2202 KOps/s | 56.3168 KOps/s | |
test_select | 96.1210μs | 29.8036μs | 33.5530 KOps/s | 34.1975 KOps/s | |
test_select_nested | 67.4910μs | 42.3229μs | 23.6279 KOps/s | 23.6845 KOps/s | |
test_exclude_nested | 93.5420μs | 57.4827μs | 17.3965 KOps/s | 17.1451 KOps/s | |
test_empty[True] | 0.3032ms | 0.2521ms | 3.9671 KOps/s | 3.9792 KOps/s | |
test_empty[False] | 3.4341μs | 0.7678μs | 1.3025 MOps/s | 1.3047 MOps/s | |
test_to | 85.3120μs | 55.0423μs | 18.1679 KOps/s | 17.8232 KOps/s | |
test_to_nonblocking | 78.3310μs | 45.5885μs | 21.9353 KOps/s | 22.0028 KOps/s | |
test_unbind_speed | 1.5721ms | 0.2300ms | 4.3479 KOps/s | 4.3151 KOps/s | |
test_unbind_speed_stack0 | 0.3073ms | 0.2287ms | 4.3725 KOps/s | 4.3083 KOps/s | |
test_unbind_speed_stack1 | 93.0841ms | 0.6513ms | 1.5354 KOps/s | 1.5319 KOps/s | |
test_split | 94.8592ms | 1.7011ms | 587.8417 Ops/s | 642.0571 Ops/s | |
test_chunk | 1.5476ms | 1.4353ms | 696.7122 Ops/s | 587.4610 Ops/s | |
test_consolidate[False-None] | 97.4198ms | 2.7751ms | 360.3420 Ops/s | 394.1560 Ops/s | |
test_consolidate[default-None] | 1.7908ms | 1.5871ms | 630.0840 Ops/s | 621.8761 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.8009ms | 1.6216ms | 616.6635 Ops/s | 612.2369 Ops/s | |
test_consolidate_njt[False-None] | 6.8788ms | 6.3364ms | 157.8193 Ops/s | 112.3847 Ops/s | |
test_to[False-False-None] | 1.8577ms | 1.7237ms | 580.1399 Ops/s | 586.5867 Ops/s | |
test_to[True-False-None] | 1.4001ms | 1.2359ms | 809.1050 Ops/s | 823.0252 Ops/s | |
test_to[within-False-None] | 0.3381s | 5.2388ms | 190.8841 Ops/s | 258.6629 Ops/s | |
test_to[True-default-None] | 5.3881ms | 5.0071ms | 199.7167 Ops/s | 197.3797 Ops/s | |
test_to_njt[False-False-None] | 7.0621ms | 6.7549ms | 148.0417 Ops/s | 143.4625 Ops/s | |
test_to_njt[True-False-None] | 5.8423ms | 5.4371ms | 183.9226 Ops/s | 179.1999 Ops/s | |
test_to_njt[within-False-None] | 12.1316ms | 11.7501ms | 85.1060 Ops/s | 82.3208 Ops/s | |
test_creation[device0] | 0.4651ms | 81.2486μs | 12.3079 KOps/s | 12.7789 KOps/s | |
test_creation_from_tensor | 0.4926ms | 84.5091μs | 11.8331 KOps/s | 12.2099 KOps/s | |
test_add_one[memmap_tensor0] | 0.4607ms | 7.0170μs | 142.5116 KOps/s | 145.7487 KOps/s | |
test_contiguous[memmap_tensor0] | 2.7345μs | 0.3873μs | 2.5821 MOps/s | 2.5404 MOps/s | |
test_stack[memmap_tensor0] | 31.1410μs | 4.2314μs | 236.3273 KOps/s | 232.9595 KOps/s | |
test_memmaptd_index | 1.8664ms | 0.2407ms | 4.1541 KOps/s | 4.1416 KOps/s | |
test_memmaptd_index_astensor | 0.8253ms | 0.3004ms | 3.3286 KOps/s | 3.3207 KOps/s | |
test_memmaptd_index_op | 0.9988ms | 0.5815ms | 1.7198 KOps/s | 1.6616 KOps/s | |
test_serialize_model | 0.1317s | 0.1306s | 7.6574 Ops/s | 7.6188 Ops/s | |
test_serialize_model_pickle | 1.3479s | 1.1851s | 0.8438 Ops/s | 0.8212 Ops/s | |
test_serialize_weights | 0.1319s | 0.1304s | 7.6677 Ops/s | 7.6153 Ops/s | |
test_serialize_weights_returnearly | 0.3774s | 67.0612ms | 14.9117 Ops/s | 10.5919 Ops/s | |
test_serialize_weights_pickle | 1.4745s | 1.2327s | 0.8112 Ops/s | 0.8387 Ops/s | |
test_reshape_pytree | 53.8510μs | 22.3262μs | 44.7905 KOps/s | 45.5258 KOps/s | |
test_reshape_td | 52.6210μs | 26.6260μs | 37.5572 KOps/s | 38.0967 KOps/s | |
test_view_pytree | 55.7210μs | 22.2527μs | 44.9383 KOps/s | 45.7024 KOps/s | |
test_view_td | 62.1410μs | 29.3372μs | 34.0864 KOps/s | 33.3265 KOps/s | |
test_unbind_pytree | 59.2810μs | 28.4804μs | 35.1119 KOps/s | 35.2717 KOps/s | |
test_unbind_td | 0.6355ms | 35.2031μs | 28.4066 KOps/s | 28.0174 KOps/s | |
test_split_pytree | 62.4010μs | 30.0955μs | 33.2276 KOps/s | 33.4044 KOps/s | |
test_split_td | 0.7949ms | 38.1611μs | 26.2047 KOps/s | 25.5353 KOps/s | |
test_add_pytree | 67.4910μs | 35.2577μs | 28.3626 KOps/s | 28.9313 KOps/s | |
test_add_td | 90.1110μs | 47.9926μs | 20.8365 KOps/s | 20.4727 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1807ms | 0.1177ms | 8.4994 KOps/s | 8.2701 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2182ms | 0.1257ms | 7.9583 KOps/s | 7.8870 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1327ms | 94.6304μs | 10.5674 KOps/s | 9.9062 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 1.1191ms | 0.1560ms | 6.4087 KOps/s | 6.3228 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 55.8010μs | 22.5517μs | 44.3426 KOps/s | 34.3122 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 58.7410μs | 26.3896μs | 37.8937 KOps/s | 37.6086 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2375ms | 65.5916μs | 15.2459 KOps/s | 14.9417 KOps/s | |
test_compile_copy_nested[pytree-eager] | 82.8510μs | 50.0644μs | 19.9743 KOps/s | 19.5778 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.2019ms | 0.1421ms | 7.0398 KOps/s | 7.1238 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.2998ms | 0.2084ms | 4.7986 KOps/s | 4.8020 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1481ms | 97.5700μs | 10.2491 KOps/s | 10.3723 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1041ms | 51.3372μs | 19.4790 KOps/s | 18.3600 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1752ms | 0.1368ms | 7.3076 KOps/s | 6.9662 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.5665ms | 0.5049ms | 1.9804 KOps/s | 1.9475 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3596ms | 0.2460ms | 4.0645 KOps/s | 3.9477 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1865ms | 0.1428ms | 7.0006 KOps/s | 6.6627 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1423ms | 60.6259μs | 16.4946 KOps/s | 15.5607 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1463ms | 97.9934μs | 10.2048 KOps/s | 9.6264 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4836ms | 0.4157ms | 2.4058 KOps/s | 2.3045 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1928ms | 0.1392ms | 7.1818 KOps/s | 7.4846 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 76.2610μs | 18.8551μs | 53.0361 KOps/s | 53.9262 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 65.6010μs | 27.3366μs | 36.5810 KOps/s | 36.3626 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1025ms | 68.7621μs | 14.5429 KOps/s | 14.5463 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1369ms | 51.2909μs | 19.4966 KOps/s | 19.3382 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.5697ms | 0.4333ms | 2.3079 KOps/s | 2.2844 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7484ms | 2.5753ms | 388.2978 Ops/s | 381.2226 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5682ms | 0.4253ms | 2.3512 KOps/s | 2.2829 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.8358ms | 2.7163ms | 368.1471 Ops/s | 370.7453 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.3581ms | 0.1154ms | 8.6646 KOps/s | 8.9724 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5910ms | 82.0770μs | 12.1837 KOps/s | 11.8831 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.3779ms | 0.1056ms | 9.4690 KOps/s | 9.5660 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.1247ms | 70.1039μs | 14.2645 KOps/s | 14.3573 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.1670ms | 0.1105ms | 9.0481 KOps/s | 9.0943 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1204ms | 68.6596μs | 14.5646 KOps/s | 14.2364 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.2587ms | 0.1000ms | 9.9974 KOps/s | 10.0675 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1411ms | 16.4927μs | 60.6328 KOps/s | 58.7572 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1627ms | 96.2567μs | 10.3889 KOps/s | 10.5804 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 47.4710μs | 15.7766μs | 63.3849 KOps/s | 63.9219 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1497ms | 95.6477μs | 10.4550 KOps/s | 10.4613 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 53.3710μs | 15.7327μs | 63.5619 KOps/s | 63.5371 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1557ms | 0.1010ms | 9.8972 KOps/s | 9.6673 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.6154ms | 16.4565μs | 60.7662 KOps/s | 59.9283 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1430ms | 95.4775μs | 10.4737 KOps/s | 10.4942 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 50.9110μs | 15.8379μs | 63.1398 KOps/s | 63.9975 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1560ms | 95.3516μs | 10.4875 KOps/s | 10.5191 KOps/s | |
test_compile_indexing[int-pytree-eager] | 48.8410μs | 15.7773μs | 63.3820 KOps/s | 63.9592 KOps/s | |
test_mod_add[eager] | 0.1497ms | 31.9164μs | 31.3319 KOps/s | 29.8940 KOps/s | |
test_mod_add[compile] | 0.3865ms | 76.2567μs | 13.1136 KOps/s | 13.1213 KOps/s | |
test_mod_add[compile-overhead] | 0.3157ms | 0.1657ms | 6.0342 KOps/s | 5.7490 KOps/s | |
test_mod_wrap[eager] | 0.3221ms | 0.2437ms | 4.1038 KOps/s | 4.0986 KOps/s | |
test_mod_wrap[compile] | 1.5679ms | 0.2789ms | 3.5850 KOps/s | 3.5605 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5216ms | 4.0491ms | 246.9711 Ops/s | 241.8535 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.8083ms | 1.3649ms | 732.6601 Ops/s | 677.1597 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.4699ms | 1.2635ms | 791.4513 Ops/s | 731.5239 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3699ms | 0.9220ms | 1.0846 KOps/s | 958.0745 Ops/s | |
test_seq_add[eager] | 0.5057ms | 98.3289μs | 10.1699 KOps/s | 10.3256 KOps/s | |
test_seq_add[compile] | 0.1817ms | 85.8556μs | 11.6475 KOps/s | 11.6985 KOps/s | |
test_seq_add[compile-overhead] | 0.5226ms | 0.1267ms | 7.8956 KOps/s | 7.8759 KOps/s | |
test_seq_wrap[eager] | 0.7856ms | 0.3831ms | 2.6102 KOps/s | 2.5777 KOps/s | |
test_seq_wrap[compile] | 0.3797ms | 0.2939ms | 3.4026 KOps/s | 3.3284 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2979ms | 0.2199ms | 4.5468 KOps/s | 4.5583 KOps/s | |
test_func_call_runtime[False-eager] | 0.8784ms | 0.7543ms | 1.3258 KOps/s | 1.2873 KOps/s | |
test_func_call_runtime[False-compile] | 0.8894ms | 0.7566ms | 1.3218 KOps/s | 1.3438 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4063ms | 0.3546ms | 2.8201 KOps/s | 2.8129 KOps/s | |
test_func_call_runtime[True-eager] | 1.0317ms | 0.8938ms | 1.1188 KOps/s | 1.0858 KOps/s | |
test_func_call_runtime[True-compile] | 0.9097ms | 0.7530ms | 1.3280 KOps/s | 1.2971 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5312ms | 0.3752ms | 2.6649 KOps/s | 2.6661 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8766ms | 0.7279ms | 1.3738 KOps/s | 1.3478 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8880ms | 0.7429ms | 1.3461 KOps/s | 1.3410 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4480ms | 0.3553ms | 2.8143 KOps/s | 2.8158 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.1341ms | 0.9935ms | 1.0066 KOps/s | 980.8138 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9536ms | 0.8106ms | 1.2337 KOps/s | 1.2540 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.6049ms | 0.3977ms | 2.5145 KOps/s | 2.4707 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5487ms | 2.0790ms | 480.9937 Ops/s | 478.8489 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.1822ms | 0.7919ms | 1.2629 KOps/s | 1.2453 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4459ms | 0.4000ms | 2.5003 KOps/s | 2.4744 KOps/s | |
test_distributed | 3.6096ms | 0.1747ms | 5.7255 KOps/s | 8.8582 KOps/s | |
test_tdmodule | 32.0200μs | 14.4279μs | 69.3101 KOps/s | 65.5955 KOps/s | |
test_tdmodule_dispatch | 87.0720μs | 28.2412μs | 35.4092 KOps/s | 34.0713 KOps/s | |
test_tdseq | 39.8500μs | 16.3133μs | 61.2999 KOps/s | 61.0235 KOps/s | |
test_tdseq_dispatch | 58.2410μs | 31.8864μs | 31.3614 KOps/s | 30.5972 KOps/s | |
test_instantiation_functorch | 1.6228ms | 1.5338ms | 651.9731 Ops/s | 647.0752 Ops/s | |
test_exec_functorch | 0.2112ms | 0.1501ms | 6.6632 KOps/s | 6.9853 KOps/s | |
test_exec_functional_call | 0.2580ms | 0.1511ms | 6.6197 KOps/s | 7.1202 KOps/s | |
test_exec_td_decorator | 0.3946ms | 0.1973ms | 5.0692 KOps/s | 5.4030 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8309ms | 0.7111ms | 1.4062 KOps/s | 1.4848 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8251ms | 0.7015ms | 1.4256 KOps/s | 1.4664 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7937ms | 0.6300ms | 1.5872 KOps/s | 1.6217 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7685ms | 0.6316ms | 1.5834 KOps/s | 1.6957 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.5331ms | 20.1653ms | 49.5901 Ops/s | 51.9841 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.3941ms | 20.1212ms | 49.6989 Ops/s | 51.8704 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.4276ms | 19.8827ms | 50.2949 Ops/s | 52.4493 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 20.5292ms | 19.2537ms | 51.9382 Ops/s | 52.2678 Ops/s | |
test_to_module_speed[True] | 2.1787ms | 0.9448ms | 1.0585 KOps/s | 1.0597 KOps/s | |
test_to_module_speed[False] | 1.0213ms | 0.9344ms | 1.0702 KOps/s | 1.0792 KOps/s | |
test_tc_init | 66.5410μs | 36.6598μs | 27.2778 KOps/s | 25.8074 KOps/s | |
test_tc_init_nested | 0.1117ms | 74.5634μs | 13.4114 KOps/s | 12.6998 KOps/s | |
test_tc_first_layer_tensor | 14.0873μs | 0.7086μs | 1.4113 MOps/s | 1.4155 MOps/s | |
test_tc_first_layer_nontensor | 24.6000μs | 2.3277μs | 429.6064 KOps/s | 439.3045 KOps/s | |
test_tc_second_layer_tensor | 8.6103μs | 1.4369μs | 695.9203 KOps/s | 707.7757 KOps/s | |
test_tc_second_layer_nontensor | 25.3810μs | 3.0579μs | 327.0266 KOps/s | 331.9561 KOps/s | |
test_unbind | 0.2276s | 12.0283ms | 83.1376 Ops/s | 150.8182 Ops/s | |
test_full_like | 9.4428ms | 9.1062ms | 109.8149 Ops/s | 108.3080 Ops/s | |
test_zeros_like | 4.8492ms | 4.3176ms | 231.6091 Ops/s | 231.0469 Ops/s | |
test_ones_like | 9.1784ms | 4.3359ms | 230.6316 Ops/s | 236.6159 Ops/s | |
test_clone | 6.6828ms | 6.3717ms | 156.9448 Ops/s | 156.6134 Ops/s | |
test_squeeze | 58.4510μs | 9.2526μs | 108.0773 KOps/s | 108.4997 KOps/s | |
test_unsqueeze | 0.1211ms | 70.9858μs | 14.0873 KOps/s | 14.2066 KOps/s | |
test_split | 0.4007ms | 0.1556ms | 6.4269 KOps/s | 6.3501 KOps/s | |
test_permute | 0.3269ms | 0.1734ms | 5.7662 KOps/s | 5.7373 KOps/s | |
test_stack | 51.6348ms | 50.6939ms | 19.7262 Ops/s | 19.7124 Ops/s | |
test_cat | 50.7155ms | 50.4571ms | 19.8188 Ops/s | 19.7706 Ops/s |
vmoens
added a commit
that referenced
this pull request
Nov 7, 2024
ghstack-source-id: 0b65b0a2dfb0cd7b5113e245c9444d3a0b55d085 Pull Request resolved: #1080
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
enhancement
New feature or request
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):