-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Doc] Add doc on export with nested keys #1085
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Nov 13, 2024
ghstack-source-id: 9c95e2dba6751d93c20c66d0dba0d4219dc61c0b Pull Request resolved: #1085
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Nov 13, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 36.4980μs | 18.1656μs | 55.0492 KOps/s | 59.1778 KOps/s | |
test_plain_set_stack_nested | 75.4610μs | 18.1015μs | 55.2440 KOps/s | 58.4605 KOps/s | |
test_plain_set_nested_inplace | 57.9390μs | 19.8021μs | 50.4996 KOps/s | 53.5079 KOps/s | |
test_plain_set_stack_nested_inplace | 65.9330μs | 19.6561μs | 50.8747 KOps/s | 53.0978 KOps/s | |
test_items | 36.8080μs | 4.1184μs | 242.8122 KOps/s | 243.6537 KOps/s | |
test_items_nested | 0.4720ms | 0.3443ms | 2.9047 KOps/s | 2.9247 KOps/s | |
test_items_nested_locked | 0.7775ms | 0.3426ms | 2.9190 KOps/s | 2.9343 KOps/s | |
test_items_nested_leaf | 0.1267ms | 71.8537μs | 13.9172 KOps/s | 13.9170 KOps/s | |
test_items_stack_nested | 0.5600ms | 0.3458ms | 2.8919 KOps/s | 2.9083 KOps/s | |
test_items_stack_nested_leaf | 0.1364ms | 74.6056μs | 13.4038 KOps/s | 13.2599 KOps/s | |
test_items_stack_nested_locked | 0.6396ms | 0.3460ms | 2.8904 KOps/s | 2.9020 KOps/s | |
test_keys | 32.1700μs | 3.5074μs | 285.1139 KOps/s | 279.7986 KOps/s | |
test_keys_nested | 0.2574ms | 0.1362ms | 7.3409 KOps/s | 7.3941 KOps/s | |
test_keys_nested_locked | 1.6804ms | 0.1438ms | 6.9558 KOps/s | 7.1225 KOps/s | |
test_keys_nested_leaf | 0.2328ms | 0.1171ms | 8.5406 KOps/s | 8.5964 KOps/s | |
test_keys_stack_nested | 0.5271ms | 0.1377ms | 7.2646 KOps/s | 7.2948 KOps/s | |
test_keys_stack_nested_leaf | 0.2261ms | 0.1159ms | 8.6247 KOps/s | 8.6053 KOps/s | |
test_keys_stack_nested_locked | 0.2292ms | 0.1409ms | 7.0972 KOps/s | 7.0596 KOps/s | |
test_values | 8.5960μs | 1.0314μs | 969.5688 KOps/s | 954.9596 KOps/s | |
test_values_nested | 0.1041ms | 56.2475μs | 17.7786 KOps/s | 17.7358 KOps/s | |
test_values_nested_locked | 0.1030ms | 56.3342μs | 17.7512 KOps/s | 17.7573 KOps/s | |
test_values_nested_leaf | 0.4568ms | 60.7084μs | 16.4722 KOps/s | 16.2662 KOps/s | |
test_values_stack_nested | 0.1291ms | 57.5650μs | 17.3717 KOps/s | 17.2879 KOps/s | |
test_values_stack_nested_leaf | 0.1188ms | 60.5810μs | 16.5068 KOps/s | 16.6548 KOps/s | |
test_values_stack_nested_locked | 0.1096ms | 57.6151μs | 17.3566 KOps/s | 17.5970 KOps/s | |
test_membership | 12.4430μs | 0.9161μs | 1.0916 MOps/s | 1.1052 MOps/s | |
test_membership_nested | 42.2290μs | 2.8341μs | 352.8470 KOps/s | 360.0149 KOps/s | |
test_membership_nested_leaf | 43.9010μs | 2.8385μs | 352.2951 KOps/s | 354.5303 KOps/s | |
test_membership_stacked_nested | 46.6870μs | 2.8157μs | 355.1516 KOps/s | 354.2872 KOps/s | |
test_membership_stacked_nested_leaf | 29.2440μs | 2.8472μs | 351.2222 KOps/s | 358.1467 KOps/s | |
test_membership_nested_last | 56.9860μs | 4.1709μs | 239.7569 KOps/s | 245.6448 KOps/s | |
test_membership_nested_leaf_last | 20.7990μs | 4.1910μs | 238.6074 KOps/s | 243.0882 KOps/s | |
test_membership_stacked_nested_last | 50.2940μs | 13.6504μs | 73.2577 KOps/s | 226.4814 KOps/s | |
test_membership_stacked_nested_leaf_last | 31.6990μs | 13.8177μs | 72.3712 KOps/s | 245.5275 KOps/s | |
test_nested_getleaf | 0.1241ms | 11.2543μs | 88.8546 KOps/s | 92.3909 KOps/s | |
test_nested_get | 0.1291ms | 10.7453μs | 93.0636 KOps/s | 94.8823 KOps/s | |
test_stacked_getleaf | 40.6470μs | 11.1507μs | 89.6808 KOps/s | 90.7527 KOps/s | |
test_stacked_get | 45.7350μs | 10.5666μs | 94.6376 KOps/s | 94.0070 KOps/s | |
test_nested_getitemleaf | 53.1590μs | 11.6603μs | 85.7608 KOps/s | 88.2552 KOps/s | |
test_nested_getitem | 54.9120μs | 10.9267μs | 91.5186 KOps/s | 94.7652 KOps/s | |
test_stacked_getitemleaf | 42.3490μs | 11.6123μs | 86.1156 KOps/s | 87.5689 KOps/s | |
test_stacked_getitem | 51.3160μs | 10.8869μs | 91.8534 KOps/s | 94.1457 KOps/s | |
test_lock_nested | 2.6499ms | 0.4496ms | 2.2243 KOps/s | 1.7849 KOps/s | |
test_lock_stack_nested | 0.6446ms | 0.4044ms | 2.4730 KOps/s | 2.3218 KOps/s | |
test_unlock_nested | 0.9952ms | 0.3681ms | 2.7167 KOps/s | 2.6686 KOps/s | |
test_unlock_stack_nested | 0.5034ms | 0.3221ms | 3.1046 KOps/s | 2.9170 KOps/s | |
test_flatten_speed | 0.1604ms | 92.0995μs | 10.8578 KOps/s | 10.8454 KOps/s | |
test_unflatten_speed | 0.8616ms | 0.4892ms | 2.0442 KOps/s | 2.0714 KOps/s | |
test_common_ops | 3.2684ms | 0.7599ms | 1.3160 KOps/s | 1.3798 KOps/s | |
test_creation | 16.8710μs | 2.1033μs | 475.4513 KOps/s | 466.1922 KOps/s | |
test_creation_empty | 46.2560μs | 10.3305μs | 96.8007 KOps/s | 117.1606 KOps/s | |
test_creation_nested_1 | 65.5930μs | 13.1536μs | 76.0250 KOps/s | 87.1645 KOps/s | |
test_creation_nested_2 | 54.2610μs | 17.5063μs | 57.1222 KOps/s | 63.5160 KOps/s | |
test_clone | 64.0800μs | 13.4567μs | 74.3125 KOps/s | 76.8212 KOps/s | |
test_getitem[int] | 1.2424ms | 12.6440μs | 79.0890 KOps/s | 79.0421 KOps/s | |
test_getitem[slice_int] | 0.1362ms | 23.6753μs | 42.2381 KOps/s | 41.8904 KOps/s | |
test_getitem[range] | 0.1912ms | 46.9061μs | 21.3192 KOps/s | 20.7782 KOps/s | |
test_getitem[tuple] | 0.1289ms | 19.7816μs | 50.5520 KOps/s | 49.4537 KOps/s | |
test_getitem[list] | 0.1618ms | 42.1789μs | 23.7085 KOps/s | 22.9046 KOps/s | |
test_setitem_dim[int] | 54.7010μs | 24.6327μs | 40.5964 KOps/s | 38.6814 KOps/s | |
test_setitem_dim[slice_int] | 87.6130μs | 49.7169μs | 20.1139 KOps/s | 18.8182 KOps/s | |
test_setitem_dim[range] | 0.1580ms | 74.0474μs | 13.5049 KOps/s | 13.6468 KOps/s | |
test_setitem_dim[tuple] | 63.6790μs | 38.8033μs | 25.7710 KOps/s | 24.3932 KOps/s | |
test_setitem | 63.8590μs | 19.8144μs | 50.4684 KOps/s | 53.7423 KOps/s | |
test_set | 90.5780μs | 19.4002μs | 51.5458 KOps/s | 55.4881 KOps/s | |
test_set_shared | 5.2480ms | 0.1663ms | 6.0115 KOps/s | 6.1397 KOps/s | |
test_update | 0.1435ms | 22.1367μs | 45.1739 KOps/s | 51.5180 KOps/s | |
test_update_nested | 0.1233ms | 31.2108μs | 32.0402 KOps/s | 34.0267 KOps/s | |
test_update__nested | 0.3660ms | 32.4255μs | 30.8399 KOps/s | 30.5077 KOps/s | |
test_set_nested | 0.1060ms | 21.7784μs | 45.9170 KOps/s | 49.9110 KOps/s | |
test_set_nested_new | 0.1243ms | 26.4686μs | 37.7806 KOps/s | 40.2640 KOps/s | |
test_select | 0.2194ms | 41.9692μs | 23.8270 KOps/s | 24.1879 KOps/s | |
test_select_nested | 0.1190ms | 61.6285μs | 16.2263 KOps/s | 16.6137 KOps/s | |
test_exclude_nested | 0.1582ms | 75.7105μs | 13.2082 KOps/s | 13.2151 KOps/s | |
test_empty[True] | 0.5633ms | 0.3522ms | 2.8390 KOps/s | 2.8391 KOps/s | |
test_empty[False] | 11.8220μs | 1.2590μs | 794.2561 KOps/s | 804.0764 KOps/s | |
test_unbind_speed | 0.4636ms | 0.2678ms | 3.7335 KOps/s | 3.7430 KOps/s | |
test_unbind_speed_stack0 | 0.4958ms | 0.2549ms | 3.9229 KOps/s | 3.6167 KOps/s | |
test_unbind_speed_stack1 | 99.3551ms | 0.7416ms | 1.3485 KOps/s | 1.5838 KOps/s | |
test_split | 2.5903ms | 1.5889ms | 629.3822 Ops/s | 563.4149 Ops/s | |
test_chunk | 97.8825ms | 1.8909ms | 528.8618 Ops/s | 571.8623 Ops/s | |
test_consolidate_njt[False-None] | 8.4451ms | 8.1318ms | 122.9734 Ops/s | 120.8286 Ops/s | |
test_creation[device0] | 0.2121ms | 89.8004μs | 11.1358 KOps/s | 11.0593 KOps/s | |
test_creation_from_tensor | 0.2366ms | 94.2629μs | 10.6086 KOps/s | 10.6443 KOps/s | |
test_add_one[memmap_tensor0] | 0.1912ms | 4.8939μs | 204.3350 KOps/s | 208.3983 KOps/s | |
test_contiguous[memmap_tensor0] | 19.5360μs | 0.5224μs | 1.9144 MOps/s | 1.9095 MOps/s | |
test_stack[memmap_tensor0] | 27.4010μs | 3.4608μs | 288.9537 KOps/s | 294.7756 KOps/s | |
test_memmaptd_index | 0.9296ms | 0.2351ms | 4.2537 KOps/s | 4.1744 KOps/s | |
test_memmaptd_index_astensor | 0.6310ms | 0.3136ms | 3.1889 KOps/s | 3.1726 KOps/s | |
test_memmaptd_index_op | 0.9362ms | 0.5696ms | 1.7557 KOps/s | 1.8559 KOps/s | |
test_serialize_model | 0.1298s | 0.1147s | 8.7155 Ops/s | 7.5475 Ops/s | |
test_serialize_model_pickle | 0.4692s | 0.4027s | 2.4835 Ops/s | 2.4876 Ops/s | |
test_serialize_weights | 0.2167s | 0.1261s | 7.9323 Ops/s | 8.6814 Ops/s | |
test_serialize_weights_returnearly | 0.1741s | 0.1598s | 6.2567 Ops/s | 6.4628 Ops/s | |
test_serialize_weights_pickle | 0.4629s | 0.4030s | 2.4811 Ops/s | 1.1203 Ops/s | |
test_serialize_weights_filesystem | 0.1526s | 0.1431s | 6.9903 Ops/s | 6.9704 Ops/s | |
test_serialize_model_filesystem | 0.1648s | 0.1542s | 6.4863 Ops/s | 6.3310 Ops/s | |
test_reshape_pytree | 0.1002ms | 27.6786μs | 36.1290 KOps/s | 36.6170 KOps/s | |
test_reshape_td | 79.7280μs | 34.1527μs | 29.2803 KOps/s | 30.6619 KOps/s | |
test_view_pytree | 74.2890μs | 27.2426μs | 36.7072 KOps/s | 37.1651 KOps/s | |
test_view_td | 81.5820μs | 37.9613μs | 26.3427 KOps/s | 26.2388 KOps/s | |
test_unbind_pytree | 69.1890μs | 30.2611μs | 33.0458 KOps/s | 33.0538 KOps/s | |
test_unbind_td | 0.3624ms | 39.4027μs | 25.3789 KOps/s | 25.7891 KOps/s | |
test_split_pytree | 0.1105ms | 30.3289μs | 32.9718 KOps/s | 33.4183 KOps/s | |
test_split_td | 0.5385ms | 45.2143μs | 22.1169 KOps/s | 22.2668 KOps/s | |
test_add_pytree | 80.0990μs | 36.5995μs | 27.3227 KOps/s | 27.7275 KOps/s | |
test_add_td | 0.1672ms | 52.6210μs | 19.0038 KOps/s | 18.7643 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1480ms | 62.5413μs | 15.9894 KOps/s | 15.9242 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3811ms | 0.1620ms | 6.1724 KOps/s | 5.9473 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.3612ms | 47.3975μs | 21.0982 KOps/s | 21.4176 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.5984ms | 0.1215ms | 8.2294 KOps/s | 8.2524 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 0.5510ms | 25.8890μs | 38.6265 KOps/s | 37.8458 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1244ms | 54.3565μs | 18.3971 KOps/s | 18.1471 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1481ms | 81.7633μs | 12.2304 KOps/s | 12.1340 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.3227ms | 68.7795μs | 14.5392 KOps/s | 13.9792 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1825ms | 0.1048ms | 9.5435 KOps/s | 9.4432 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.4259ms | 0.2005ms | 4.9888 KOps/s | 4.9655 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1276ms | 45.5841μs | 21.9375 KOps/s | 22.0511 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.5094ms | 61.2334μs | 16.3309 KOps/s | 15.4703 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2229ms | 0.1028ms | 9.7243 KOps/s | 9.6158 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3501ms | 0.2032ms | 4.9216 KOps/s | 4.9746 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.7252ms | 0.2100ms | 4.7618 KOps/s | 4.6415 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.1749ms | 0.1044ms | 9.5816 KOps/s | 9.4006 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1446ms | 55.6787μs | 17.9602 KOps/s | 17.8396 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1007ms | 46.6152μs | 21.4522 KOps/s | 22.0886 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 1.3155ms | 0.1619ms | 6.1750 KOps/s | 6.2937 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.2611ms | 0.1034ms | 9.6753 KOps/s | 9.5378 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 59.8310μs | 21.6758μs | 46.1344 KOps/s | 46.6003 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1298ms | 59.5771μs | 16.7850 KOps/s | 16.6250 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1531ms | 82.6819μs | 12.0945 KOps/s | 12.0489 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1765ms | 73.5227μs | 13.6012 KOps/s | 14.2300 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3251ms | 0.2084ms | 4.7987 KOps/s | 4.6720 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.6706ms | 1.2836ms | 779.0372 Ops/s | 782.0202 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.3690ms | 0.2060ms | 4.8539 KOps/s | 4.8512 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.8682ms | 0.7855ms | 1.2731 KOps/s | 1.2875 KOps/s | |
test_compile_assign_and_add_stack[compile] | 1.5141ms | 0.4716ms | 2.1203 KOps/s | 2.1709 KOps/s | |
test_compile_assign_and_add_stack[eager] | 2.8439ms | 2.5410ms | 393.5441 Ops/s | 404.3178 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 88.9450μs | 35.9583μs | 27.8100 KOps/s | 27.9096 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5068ms | 31.8864μs | 31.3613 KOps/s | 30.3632 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 77.3440μs | 29.2388μs | 34.2011 KOps/s | 34.4612 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 91.7200μs | 24.1732μs | 41.3681 KOps/s | 41.0702 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2888ms | 30.5232μs | 32.7620 KOps/s | 33.6180 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.1151ms | 23.9795μs | 41.7023 KOps/s | 41.0842 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1295ms | 52.9850μs | 18.8733 KOps/s | 19.4253 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.5589ms | 19.5868μs | 51.0549 KOps/s | 49.7688 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1053ms | 44.9513μs | 22.2463 KOps/s | 22.5125 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 68.6480μs | 19.4262μs | 51.4768 KOps/s | 52.7548 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.5744ms | 45.3116μs | 22.0694 KOps/s | 21.5186 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 68.4780μs | 19.2853μs | 51.8531 KOps/s | 52.0817 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1313ms | 53.8536μs | 18.5689 KOps/s | 18.6521 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8644ms | 19.5321μs | 51.1978 KOps/s | 49.3185 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1154ms | 44.8734μs | 22.2849 KOps/s | 22.1888 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 70.6110μs | 19.1577μs | 52.1982 KOps/s | 52.3473 KOps/s | |
test_compile_indexing[int-pytree-compile] | 99.4570μs | 45.3683μs | 22.0418 KOps/s | 22.3242 KOps/s | |
test_compile_indexing[int-pytree-eager] | 67.1140μs | 19.0313μs | 52.5449 KOps/s | 52.4486 KOps/s | |
test_mod_add[eager] | 89.8270μs | 25.6616μs | 38.9688 KOps/s | 41.2519 KOps/s | |
test_mod_add[compile] | 0.1210ms | 44.3623μs | 22.5416 KOps/s | 22.6376 KOps/s | |
test_mod_add[compile-overhead] | 0.1058ms | 43.1100μs | 23.1965 KOps/s | 22.2131 KOps/s | |
test_mod_wrap[eager] | 0.3345ms | 0.2062ms | 4.8505 KOps/s | 4.7622 KOps/s | |
test_mod_wrap[compile] | 1.9770ms | 0.2028ms | 4.9310 KOps/s | 4.7497 KOps/s | |
test_mod_wrap[compile-overhead] | 2.0858ms | 0.2011ms | 4.9733 KOps/s | 4.8932 KOps/s | |
test_mod_wrap_and_backward[eager] | 18.6654ms | 12.7135ms | 78.6565 Ops/s | 82.1615 Ops/s | |
test_mod_wrap_and_backward[compile] | 15.8905ms | 12.5918ms | 79.4168 Ops/s | 73.8515 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 17.8905ms | 14.3922ms | 69.4820 Ops/s | 76.4820 Ops/s | |
test_seq_add[eager] | 0.1773ms | 90.4494μs | 11.0559 KOps/s | 11.1378 KOps/s | |
test_seq_add[compile] | 0.1374ms | 59.6804μs | 16.7559 KOps/s | 16.4657 KOps/s | |
test_seq_add[compile-overhead] | 0.5823ms | 58.3186μs | 17.1472 KOps/s | 16.8276 KOps/s | |
test_seq_wrap[eager] | 0.5345ms | 0.3846ms | 2.6003 KOps/s | 2.6054 KOps/s | |
test_seq_wrap[compile] | 0.6509ms | 0.2311ms | 4.3271 KOps/s | 4.4255 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3791ms | 0.2233ms | 4.4792 KOps/s | 4.3796 KOps/s | |
test_func_call_runtime[False-eager] | 1.3914ms | 0.5341ms | 1.8724 KOps/s | 1.8402 KOps/s | |
test_func_call_runtime[False-compile] | 0.9687ms | 0.4299ms | 2.3263 KOps/s | 2.3207 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.8162ms | 0.4377ms | 2.2849 KOps/s | 2.3200 KOps/s | |
test_func_call_runtime[True-eager] | 1.0128ms | 0.7468ms | 1.3390 KOps/s | 1.3174 KOps/s | |
test_func_call_runtime[True-compile] | 0.8457ms | 0.4723ms | 2.1175 KOps/s | 2.1281 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7465ms | 0.4657ms | 2.1473 KOps/s | 2.1371 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.9612ms | 0.5290ms | 1.8903 KOps/s | 1.8960 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8688ms | 0.4314ms | 2.3179 KOps/s | 2.3630 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.5925ms | 0.4261ms | 2.3466 KOps/s | 2.3261 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0665ms | 0.8632ms | 1.1584 KOps/s | 1.1257 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8467ms | 0.4980ms | 2.0078 KOps/s | 2.0290 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.7042ms | 0.4883ms | 2.0481 KOps/s | 2.0190 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.6018ms | 1.8772ms | 532.7180 Ops/s | 523.0491 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 1.0648ms | 0.5170ms | 1.9341 KOps/s | 1.9334 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.8692ms | 0.5072ms | 1.9715 KOps/s | 1.9120 KOps/s | |
test_distributed | 0.3676ms | 0.1272ms | 7.8588 KOps/s | 7.5894 KOps/s | |
test_tdmodule | 37.0400μs | 17.9517μs | 55.7049 KOps/s | 57.2165 KOps/s | |
test_tdmodule_dispatch | 57.6270μs | 35.1305μs | 28.4653 KOps/s | 29.6107 KOps/s | |
test_tdseq | 52.4680μs | 21.1894μs | 47.1934 KOps/s | 51.6029 KOps/s | |
test_tdseq_dispatch | 84.2870μs | 41.4052μs | 24.1516 KOps/s | 26.4710 KOps/s | |
test_instantiation_functorch | 1.7709ms | 1.5150ms | 660.0604 Ops/s | 638.6139 Ops/s | |
test_exec_functorch | 0.4194ms | 0.1820ms | 5.4948 KOps/s | 5.5145 KOps/s | |
test_exec_functional_call | 0.3262ms | 0.1720ms | 5.8130 KOps/s | 5.7457 KOps/s | |
test_exec_td_decorator | 0.5103ms | 0.2239ms | 4.4656 KOps/s | 4.3291 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8656ms | 0.6298ms | 1.5878 KOps/s | 1.5276 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 1.0111ms | 0.6340ms | 1.5772 KOps/s | 1.5586 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7540ms | 0.5221ms | 1.9154 KOps/s | 1.9126 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.8185ms | 0.5232ms | 1.9114 KOps/s | 1.9143 KOps/s | |
test_to_module_speed[True] | 2.4876ms | 1.3178ms | 758.8308 Ops/s | 762.1887 Ops/s | |
test_to_module_speed[False] | 1.7411ms | 1.2848ms | 778.3605 Ops/s | 775.4965 Ops/s | |
test_tc_init | 80.7610μs | 44.9903μs | 22.2270 KOps/s | 24.1265 KOps/s | |
test_tc_init_nested | 0.1532ms | 87.4819μs | 11.4309 KOps/s | 11.6468 KOps/s | |
test_tc_first_layer_tensor | 38.0710μs | 1.5854μs | 630.7462 KOps/s | 651.3461 KOps/s | |
test_tc_first_layer_nontensor | 45.8550μs | 4.7058μs | 212.5037 KOps/s | 209.5071 KOps/s | |
test_tc_second_layer_tensor | 25.5380μs | 2.8887μs | 346.1727 KOps/s | 357.5127 KOps/s | |
test_tc_second_layer_nontensor | 54.9520μs | 6.0640μs | 164.9077 KOps/s | 164.9603 KOps/s | |
test_unbind | 0.2377s | 13.1311ms | 76.1553 Ops/s | 79.5685 Ops/s | |
test_full_like | 18.0750ms | 12.6602ms | 78.9874 Ops/s | 126.1551 Ops/s | |
test_zeros_like | 11.0435ms | 7.2793ms | 137.3762 Ops/s | 325.5591 Ops/s | |
test_ones_like | 15.6896ms | 8.9808ms | 111.3487 Ops/s | 279.2304 Ops/s | |
test_clone | 13.9543ms | 10.1643ms | 98.3840 Ops/s | 171.4897 Ops/s | |
test_squeeze | 65.0610μs | 11.4967μs | 86.9818 KOps/s | 83.8284 KOps/s | |
test_unsqueeze | 0.3391ms | 86.4820μs | 11.5631 KOps/s | 11.2229 KOps/s | |
test_split | 0.3334ms | 0.1862ms | 5.3704 KOps/s | 5.0803 KOps/s | |
test_permute | 0.3474ms | 0.2133ms | 4.6888 KOps/s | 4.5805 KOps/s | |
test_stack | 28.0831ms | 25.3803ms | 39.4007 Ops/s | 37.6397 Ops/s | |
test_cat | 27.7997ms | 25.4211ms | 39.3374 Ops/s | 38.6855 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 33.1120μs | 10.1470μs | 98.5512 KOps/s | 96.5537 KOps/s | |
test_plain_set_stack_nested | 43.7120μs | 10.2240μs | 97.8093 KOps/s | 96.0099 KOps/s | |
test_plain_set_nested_inplace | 41.1820μs | 11.0459μs | 90.5311 KOps/s | 89.0064 KOps/s | |
test_plain_set_stack_nested_inplace | 94.4760μs | 11.0724μs | 90.3147 KOps/s | 89.0011 KOps/s | |
test_items | 30.0220μs | 3.0836μs | 324.2959 KOps/s | 339.3715 KOps/s | |
test_items_nested | 0.3471ms | 0.3208ms | 3.1168 KOps/s | 3.1716 KOps/s | |
test_items_nested_locked | 0.4603ms | 0.3225ms | 3.1003 KOps/s | 3.0978 KOps/s | |
test_items_nested_leaf | 81.7240μs | 57.9632μs | 17.2523 KOps/s | 17.1776 KOps/s | |
test_items_stack_nested | 0.3977ms | 0.3231ms | 3.0947 KOps/s | 3.1385 KOps/s | |
test_items_stack_nested_leaf | 88.9950μs | 59.5566μs | 16.7907 KOps/s | 16.6931 KOps/s | |
test_items_stack_nested_locked | 0.3649ms | 0.3277ms | 3.0520 KOps/s | 3.0993 KOps/s | |
test_keys | 30.1110μs | 3.4904μs | 286.4966 KOps/s | 287.2399 KOps/s | |
test_keys_nested | 97.1460μs | 70.1090μs | 14.2635 KOps/s | 14.2502 KOps/s | |
test_keys_nested_locked | 0.7225ms | 75.3495μs | 13.2715 KOps/s | 13.1211 KOps/s | |
test_keys_nested_leaf | 88.7760μs | 61.3020μs | 16.3127 KOps/s | 16.2937 KOps/s | |
test_keys_stack_nested | 99.8960μs | 70.5177μs | 14.1808 KOps/s | 14.0909 KOps/s | |
test_keys_stack_nested_leaf | 0.2363ms | 61.3676μs | 16.2953 KOps/s | 16.0587 KOps/s | |
test_keys_stack_nested_locked | 0.1255ms | 75.8018μs | 13.1923 KOps/s | 13.0610 KOps/s | |
test_values | 5.1737μs | 0.8433μs | 1.1858 MOps/s | 1.1818 MOps/s | |
test_values_nested | 72.4650μs | 31.7039μs | 31.5418 KOps/s | 31.5949 KOps/s | |
test_values_nested_locked | 61.5840μs | 33.1832μs | 30.1358 KOps/s | 30.1615 KOps/s | |
test_values_nested_leaf | 65.7240μs | 34.2050μs | 29.2355 KOps/s | 29.5113 KOps/s | |
test_values_stack_nested | 0.1074ms | 32.0359μs | 31.2150 KOps/s | 30.9488 KOps/s | |
test_values_stack_nested_leaf | 62.3230μs | 34.7844μs | 28.7485 KOps/s | 28.9564 KOps/s | |
test_values_stack_nested_locked | 0.1012ms | 33.8804μs | 29.5156 KOps/s | 29.6710 KOps/s | |
test_membership | 1.8781μs | 0.5088μs | 1.9653 MOps/s | 1.9600 MOps/s | |
test_membership_nested | 14.6210μs | 1.8878μs | 529.7299 KOps/s | 526.3749 KOps/s | |
test_membership_nested_leaf | 28.0920μs | 1.8553μs | 538.9960 KOps/s | 544.6213 KOps/s | |
test_membership_stacked_nested | 33.6120μs | 2.0164μs | 495.9446 KOps/s | 515.3778 KOps/s | |
test_membership_stacked_nested_leaf | 25.4010μs | 1.9609μs | 509.9808 KOps/s | 512.9427 KOps/s | |
test_membership_nested_last | 35.7730μs | 2.8236μs | 354.1547 KOps/s | 360.0135 KOps/s | |
test_membership_nested_leaf_last | 35.2820μs | 2.7958μs | 357.6774 KOps/s | 359.8043 KOps/s | |
test_membership_stacked_nested_last | 32.5220μs | 5.3356μs | 187.4206 KOps/s | 284.3248 KOps/s | |
test_membership_stacked_nested_leaf_last | 27.1520μs | 5.2653μs | 189.9245 KOps/s | 284.9581 KOps/s | |
test_nested_getleaf | 35.4320μs | 6.0374μs | 165.6354 KOps/s | 169.3894 KOps/s | |
test_nested_get | 38.0320μs | 5.7078μs | 175.1989 KOps/s | 176.5566 KOps/s | |
test_stacked_getleaf | 37.1120μs | 6.0744μs | 164.6243 KOps/s | 167.7135 KOps/s | |
test_stacked_get | 25.4520μs | 5.7611μs | 173.5779 KOps/s | 176.2487 KOps/s | |
test_nested_getitemleaf | 30.8920μs | 6.0805μs | 164.4593 KOps/s | 164.6292 KOps/s | |
test_nested_getitem | 40.0530μs | 5.7799μs | 173.0145 KOps/s | 173.6848 KOps/s | |
test_stacked_getitemleaf | 30.5510μs | 6.0951μs | 164.0658 KOps/s | 164.8707 KOps/s | |
test_stacked_getitem | 25.8620μs | 5.7753μs | 173.1516 KOps/s | 174.9702 KOps/s | |
test_lock_nested | 9.5550ms | 0.3757ms | 2.6620 KOps/s | 2.7008 KOps/s | |
test_lock_stack_nested | 0.3704ms | 0.3389ms | 2.9508 KOps/s | 2.9553 KOps/s | |
test_unlock_nested | 0.7194ms | 0.3130ms | 3.1944 KOps/s | 3.2405 KOps/s | |
test_unlock_stack_nested | 0.3247ms | 0.2784ms | 3.5914 KOps/s | 3.6181 KOps/s | |
test_flatten_speed | 0.1082ms | 73.2165μs | 13.6581 KOps/s | 13.8097 KOps/s | |
test_unflatten_speed | 0.3294ms | 0.2903ms | 3.4447 KOps/s | 3.4477 KOps/s | |
test_common_ops | 1.7365ms | 0.5833ms | 1.7144 KOps/s | 1.7025 KOps/s | |
test_creation | 99.9260μs | 1.4748μs | 678.0674 KOps/s | 676.3143 KOps/s | |
test_creation_empty | 32.9020μs | 6.5692μs | 152.2248 KOps/s | 144.3122 KOps/s | |
test_creation_nested_1 | 30.5810μs | 8.1391μs | 122.8639 KOps/s | 116.5879 KOps/s | |
test_creation_nested_2 | 41.7730μs | 10.6908μs | 93.5385 KOps/s | 92.1618 KOps/s | |
test_clone | 56.7140μs | 10.8276μs | 92.3567 KOps/s | 87.0744 KOps/s | |
test_getitem[int] | 94.8782ms | 16.1897μs | 61.7677 KOps/s | 89.7604 KOps/s | |
test_getitem[slice_int] | 0.1115ms | 20.7339μs | 48.2301 KOps/s | 48.3227 KOps/s | |
test_getitem[range] | 0.1288ms | 37.3463μs | 26.7764 KOps/s | 25.8146 KOps/s | |
test_getitem[tuple] | 0.1004ms | 17.8477μs | 56.0295 KOps/s | 54.1287 KOps/s | |
test_getitem[list] | 0.2259ms | 33.1004μs | 30.2111 KOps/s | 28.9965 KOps/s | |
test_setitem_dim[int] | 38.3120μs | 18.5136μs | 54.0144 KOps/s | 51.9232 KOps/s | |
test_setitem_dim[slice_int] | 61.2530μs | 37.5918μs | 26.6016 KOps/s | 26.5881 KOps/s | |
test_setitem_dim[range] | 77.6350μs | 52.3830μs | 19.0901 KOps/s | 18.6893 KOps/s | |
test_setitem_dim[tuple] | 60.3140μs | 32.0399μs | 31.2111 KOps/s | 31.3759 KOps/s | |
test_setitem | 76.3050μs | 14.7854μs | 67.6345 KOps/s | 65.0181 KOps/s | |
test_set | 78.8350μs | 14.4095μs | 69.3986 KOps/s | 67.2966 KOps/s | |
test_set_shared | 1.8384ms | 0.1475ms | 6.7803 KOps/s | 6.7122 KOps/s | |
test_update | 0.3183ms | 16.4937μs | 60.6294 KOps/s | 58.6871 KOps/s | |
test_update_nested | 88.8750μs | 21.5052μs | 46.5003 KOps/s | 45.7319 KOps/s | |
test_update__nested | 0.4510ms | 25.0126μs | 39.9798 KOps/s | 39.3120 KOps/s | |
test_set_nested | 93.1750μs | 16.5989μs | 60.2451 KOps/s | 62.3631 KOps/s | |
test_set_nested_new | 99.8260μs | 17.6788μs | 56.5649 KOps/s | 55.1754 KOps/s | |
test_select | 0.2082ms | 30.8768μs | 32.3868 KOps/s | 33.2666 KOps/s | |
test_select_nested | 75.7640μs | 41.8317μs | 23.9053 KOps/s | 24.1562 KOps/s | |
test_exclude_nested | 95.8060μs | 58.7530μs | 17.0204 KOps/s | 16.8931 KOps/s | |
test_empty[True] | 0.3068ms | 0.2540ms | 3.9374 KOps/s | 3.9698 KOps/s | |
test_empty[False] | 3.3422μs | 0.7383μs | 1.3545 MOps/s | 1.3427 MOps/s | |
test_to | 83.8050μs | 55.1633μs | 18.1280 KOps/s | 18.2861 KOps/s | |
test_to_nonblocking | 0.1080ms | 48.3275μs | 20.6921 KOps/s | 21.6118 KOps/s | |
test_unbind_speed | 0.2811ms | 0.2344ms | 4.2656 KOps/s | 4.3036 KOps/s | |
test_unbind_speed_stack0 | 0.2742ms | 0.2355ms | 4.2468 KOps/s | 4.2656 KOps/s | |
test_unbind_speed_stack1 | 95.7920ms | 0.6511ms | 1.5360 KOps/s | 1.5462 KOps/s | |
test_split | 97.2017ms | 1.5894ms | 629.1713 Ops/s | 616.8234 Ops/s | |
test_chunk | 99.2729ms | 1.5949ms | 626.9944 Ops/s | 564.7831 Ops/s | |
test_consolidate[False-None] | 98.9949ms | 2.8328ms | 353.0047 Ops/s | 380.9037 Ops/s | |
test_consolidate[default-None] | 1.7832ms | 1.6754ms | 596.8699 Ops/s | 585.4601 Ops/s | |
test_consolidate[reduce-overhead-None] | 1.7951ms | 1.7090ms | 585.1335 Ops/s | 575.8957 Ops/s | |
test_consolidate_njt[False-None] | 6.7808ms | 6.4881ms | 154.1276 Ops/s | 154.3975 Ops/s | |
test_to[False-False-None] | 1.7630ms | 1.6635ms | 601.1344 Ops/s | 604.8978 Ops/s | |
test_to[True-False-None] | 1.5843ms | 1.3280ms | 753.0290 Ops/s | 775.6307 Ops/s | |
test_to[within-False-None] | 4.1723ms | 3.9087ms | 255.8402 Ops/s | 249.1144 Ops/s | |
test_to[True-default-None] | 5.3422ms | 5.1196ms | 195.3291 Ops/s | 194.8812 Ops/s | |
test_to_njt[False-False-None] | 7.3122ms | 6.8824ms | 145.2978 Ops/s | 145.0854 Ops/s | |
test_to_njt[True-False-None] | 5.5762ms | 5.4544ms | 183.3380 Ops/s | 183.8119 Ops/s | |
test_to_njt[within-False-None] | 12.5085ms | 12.0433ms | 83.0337 Ops/s | 83.2651 Ops/s | |
test_creation[device0] | 0.4835ms | 78.0752μs | 12.8082 KOps/s | 12.4663 KOps/s | |
test_creation_from_tensor | 0.5239ms | 84.1280μs | 11.8867 KOps/s | 11.9133 KOps/s | |
test_add_one[memmap_tensor0] | 0.4525ms | 6.8055μs | 146.9394 KOps/s | 144.2380 KOps/s | |
test_contiguous[memmap_tensor0] | 2.1076μs | 0.4108μs | 2.4345 MOps/s | 2.4628 MOps/s | |
test_stack[memmap_tensor0] | 37.4830μs | 4.3218μs | 231.3836 KOps/s | 218.9056 KOps/s | |
test_memmaptd_index | 1.6140ms | 0.2409ms | 4.1504 KOps/s | 3.9865 KOps/s | |
test_memmaptd_index_astensor | 0.6081ms | 0.3000ms | 3.3338 KOps/s | 3.2503 KOps/s | |
test_memmaptd_index_op | 1.0365ms | 0.5574ms | 1.7941 KOps/s | 1.7016 KOps/s | |
test_serialize_model | 0.1315s | 0.1303s | 7.6764 Ops/s | 7.6768 Ops/s | |
test_serialize_model_pickle | 1.3501s | 1.2122s | 0.8249 Ops/s | 0.8250 Ops/s | |
test_serialize_weights | 0.1303s | 0.1296s | 7.7141 Ops/s | 7.7285 Ops/s | |
test_serialize_weights_returnearly | 45.7506ms | 41.1024ms | 24.3295 Ops/s | 14.5057 Ops/s | |
test_serialize_weights_pickle | 1.3810s | 1.1953s | 0.8366 Ops/s | 0.8187 Ops/s | |
test_reshape_pytree | 60.7540μs | 21.9681μs | 45.5205 KOps/s | 43.5428 KOps/s | |
test_reshape_td | 61.9040μs | 26.6340μs | 37.5460 KOps/s | 37.6722 KOps/s | |
test_view_pytree | 0.1373ms | 21.9080μs | 45.6455 KOps/s | 44.3586 KOps/s | |
test_view_td | 68.3240μs | 29.3583μs | 34.0619 KOps/s | 32.2643 KOps/s | |
test_unbind_pytree | 65.1940μs | 28.1559μs | 35.5166 KOps/s | 34.9968 KOps/s | |
test_unbind_td | 0.7741ms | 36.1798μs | 27.6398 KOps/s | 27.6237 KOps/s | |
test_split_pytree | 91.4250μs | 29.5628μs | 33.8263 KOps/s | 32.5247 KOps/s | |
test_split_td | 1.0058ms | 38.7271μs | 25.8217 KOps/s | 25.3876 KOps/s | |
test_add_pytree | 78.8040μs | 34.4722μs | 29.0088 KOps/s | 27.8401 KOps/s | |
test_add_td | 89.4550μs | 43.3170μs | 23.0856 KOps/s | 20.3112 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.2726ms | 0.1225ms | 8.1663 KOps/s | 8.0371 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2153ms | 0.1238ms | 8.0800 KOps/s | 8.0016 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1580ms | 95.5956μs | 10.4607 KOps/s | 10.0629 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2949ms | 0.1505ms | 6.6454 KOps/s | 6.6209 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 98.2860μs | 24.1603μs | 41.3902 KOps/s | 41.6063 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 57.6730μs | 26.9046μs | 37.1684 KOps/s | 36.9605 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1865ms | 64.7237μs | 15.4503 KOps/s | 15.0838 KOps/s | |
test_compile_copy_nested[pytree-eager] | 90.9860μs | 50.1905μs | 19.9241 KOps/s | 19.9334 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.1821ms | 0.1432ms | 6.9847 KOps/s | 6.8354 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3019ms | 0.2086ms | 4.7939 KOps/s | 4.9082 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.2405ms | 98.7313μs | 10.1285 KOps/s | 10.2697 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1926ms | 51.1732μs | 19.5415 KOps/s | 19.7663 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.1942ms | 0.1367ms | 7.3140 KOps/s | 7.0311 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6313ms | 0.4831ms | 2.0699 KOps/s | 2.0365 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.3537ms | 0.2481ms | 4.0306 KOps/s | 4.0434 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.2469ms | 0.1460ms | 6.8498 KOps/s | 6.9157 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1857ms | 61.1560μs | 16.3516 KOps/s | 15.8463 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.2384ms | 99.7412μs | 10.0259 KOps/s | 9.7677 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.4716ms | 0.4096ms | 2.4413 KOps/s | 2.4604 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.1829ms | 0.1391ms | 7.1887 KOps/s | 7.3618 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 54.0930μs | 20.0560μs | 49.8604 KOps/s | 51.1523 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1226ms | 26.6795μs | 37.4819 KOps/s | 36.4073 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1181ms | 69.1783μs | 14.4554 KOps/s | 14.2193 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.2650μs | 51.4272μs | 19.4450 KOps/s | 19.2042 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 1.6024ms | 0.3895ms | 2.5671 KOps/s | 2.2653 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7351ms | 2.6621ms | 375.6481 Ops/s | 378.0984 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 1.5895ms | 0.4321ms | 2.3141 KOps/s | 2.2879 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 2.7437ms | 2.6788ms | 373.3051 Ops/s | 372.0957 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.7266ms | 0.1135ms | 8.8137 KOps/s | 8.2654 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.5487ms | 78.7258μs | 12.7023 KOps/s | 12.0091 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.6160ms | 0.1056ms | 9.4719 KOps/s | 9.0407 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 0.2182ms | 67.4926μs | 14.8164 KOps/s | 13.6979 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2439ms | 0.1065ms | 9.3904 KOps/s | 8.8535 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 0.2164ms | 67.8297μs | 14.7428 KOps/s | 13.7786 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1561ms | 0.1008ms | 9.9190 KOps/s | 9.9297 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1488ms | 17.0039μs | 58.8099 KOps/s | 56.3971 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.2237ms | 96.0300μs | 10.4134 KOps/s | 10.2220 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 48.8630μs | 15.8160μs | 63.2270 KOps/s | 61.7365 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2428ms | 96.6285μs | 10.3489 KOps/s | 10.2170 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 49.0430μs | 15.6848μs | 63.7559 KOps/s | 59.9282 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.2067ms | 0.1013ms | 9.8734 KOps/s | 9.8284 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.5808ms | 16.9872μs | 58.8678 KOps/s | 58.0519 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2444ms | 96.6128μs | 10.3506 KOps/s | 10.2635 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1581ms | 15.8049μs | 63.2717 KOps/s | 62.0046 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.2707ms | 96.7438μs | 10.3366 KOps/s | 10.1844 KOps/s | |
test_compile_indexing[int-pytree-eager] | 84.0950μs | 15.8197μs | 63.2125 KOps/s | 61.4878 KOps/s | |
test_mod_add[eager] | 75.2540μs | 30.4630μs | 32.8268 KOps/s | 31.9960 KOps/s | |
test_mod_add[compile] | 0.3799ms | 76.8804μs | 13.0072 KOps/s | 13.0972 KOps/s | |
test_mod_add[compile-overhead] | 0.3206ms | 0.1640ms | 6.0964 KOps/s | 5.8284 KOps/s | |
test_mod_wrap[eager] | 0.3554ms | 0.2413ms | 4.1434 KOps/s | 3.9553 KOps/s | |
test_mod_wrap[compile] | 1.6151ms | 0.2885ms | 3.4668 KOps/s | 3.5321 KOps/s | |
test_mod_wrap[compile-overhead] | 7.3906ms | 3.7863ms | 264.1101 Ops/s | 262.5025 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.5817ms | 1.3729ms | 728.4083 Ops/s | 684.0154 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.3489ms | 1.2604ms | 793.4145 Ops/s | 727.7363 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3911ms | 0.9201ms | 1.0869 KOps/s | 938.7251 Ops/s | |
test_seq_add[eager] | 0.1656ms | 97.0371μs | 10.3053 KOps/s | 10.3429 KOps/s | |
test_seq_add[compile] | 0.2631ms | 86.3965μs | 11.5745 KOps/s | 11.6556 KOps/s | |
test_seq_add[compile-overhead] | 0.2740ms | 0.1328ms | 7.5295 KOps/s | 7.8738 KOps/s | |
test_seq_wrap[eager] | 0.4938ms | 0.3773ms | 2.6503 KOps/s | 2.6168 KOps/s | |
test_seq_wrap[compile] | 0.4338ms | 0.3125ms | 3.2001 KOps/s | 3.3346 KOps/s | |
test_seq_wrap[compile-overhead] | 0.3617ms | 0.2233ms | 4.4789 KOps/s | 4.4623 KOps/s | |
test_func_call_runtime[False-eager] | 1.2227ms | 0.7820ms | 1.2787 KOps/s | 1.3316 KOps/s | |
test_func_call_runtime[False-compile] | 0.8769ms | 0.7417ms | 1.3483 KOps/s | 1.3507 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4033ms | 0.3579ms | 2.7942 KOps/s | 2.7938 KOps/s | |
test_func_call_runtime[True-eager] | 1.0533ms | 0.9076ms | 1.1018 KOps/s | 1.0990 KOps/s | |
test_func_call_runtime[True-compile] | 0.9705ms | 0.7643ms | 1.3083 KOps/s | 1.3127 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5033ms | 0.3789ms | 2.6390 KOps/s | 2.6452 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8798ms | 0.7522ms | 1.3295 KOps/s | 1.3355 KOps/s | |
test_func_call_cm_runtime[False-compile] | 1.1309ms | 0.7429ms | 1.3460 KOps/s | 1.3393 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.7697ms | 0.3599ms | 2.7783 KOps/s | 2.7739 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.4394ms | 1.0023ms | 997.7357 Ops/s | 982.0657 Ops/s | |
test_func_call_cm_runtime[True-compile] | 1.1875ms | 0.7890ms | 1.2674 KOps/s | 1.2633 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4732ms | 0.4037ms | 2.4770 KOps/s | 2.4486 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5306ms | 2.0868ms | 479.1965 Ops/s | 475.8587 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9380ms | 0.7999ms | 1.2502 KOps/s | 1.2429 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.5076ms | 0.4068ms | 2.4580 KOps/s | 2.4364 KOps/s | |
test_distributed | 6.0240ms | 0.2342ms | 4.2704 KOps/s | 8.6239 KOps/s | |
test_tdmodule | 99.3560μs | 13.4995μs | 74.0768 KOps/s | 75.2015 KOps/s | |
test_tdmodule_dispatch | 43.8930μs | 25.9234μs | 38.5751 KOps/s | 37.5524 KOps/s | |
test_tdseq | 28.5710μs | 14.8156μs | 67.4964 KOps/s | 66.1880 KOps/s | |
test_tdseq_dispatch | 47.3630μs | 28.9024μs | 34.5992 KOps/s | 33.4448 KOps/s | |
test_instantiation_functorch | 1.7221ms | 1.5527ms | 644.0234 Ops/s | 634.8705 Ops/s | |
test_exec_functorch | 0.2952ms | 0.1491ms | 6.7049 KOps/s | 6.7619 KOps/s | |
test_exec_functional_call | 0.2000ms | 0.1395ms | 7.1686 KOps/s | 7.0317 KOps/s | |
test_exec_td_decorator | 0.3706ms | 0.1858ms | 5.3807 KOps/s | 5.3346 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.8235ms | 0.6848ms | 1.4602 KOps/s | 1.4804 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8617ms | 0.6883ms | 1.4529 KOps/s | 1.4781 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7777ms | 0.6021ms | 1.6608 KOps/s | 1.6770 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7601ms | 0.6054ms | 1.6517 KOps/s | 1.6772 KOps/s | |
test_vmap_transformer_speed_decorator[True-True] | 19.6480ms | 19.1562ms | 52.2025 Ops/s | 52.2558 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.0422ms | 19.3957ms | 51.5578 Ops/s | 52.0415 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 19.3948ms | 19.0782ms | 52.4159 Ops/s | 52.5483 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5640ms | 19.0673ms | 52.4457 Ops/s | 52.4215 Ops/s | |
test_to_module_speed[True] | 1.3133ms | 0.9352ms | 1.0693 KOps/s | 1.0687 KOps/s | |
test_to_module_speed[False] | 1.3654ms | 0.9230ms | 1.0834 KOps/s | 1.0793 KOps/s | |
test_tc_init | 74.6450μs | 34.4009μs | 29.0690 KOps/s | 30.0003 KOps/s | |
test_tc_init_nested | 0.4544ms | 69.9359μs | 14.2988 KOps/s | 14.6776 KOps/s | |
test_tc_first_layer_tensor | 54.4004μs | 0.7040μs | 1.4204 MOps/s | 1.4181 MOps/s | |
test_tc_first_layer_nontensor | 18.7410μs | 2.4421μs | 409.4920 KOps/s | 432.0806 KOps/s | |
test_tc_second_layer_tensor | 99.4860μs | 1.4253μs | 701.6029 KOps/s | 696.4832 KOps/s | |
test_tc_second_layer_nontensor | 0.3794ms | 3.1916μs | 313.3220 KOps/s | 330.1123 KOps/s | |
test_unbind | 0.2253s | 9.8406ms | 101.6195 Ops/s | 148.1572 Ops/s | |
test_full_like | 10.4890ms | 9.5714ms | 104.4777 Ops/s | 101.5904 Ops/s | |
test_zeros_like | 4.8977ms | 4.3890ms | 227.8418 Ops/s | 113.3268 Ops/s | |
test_ones_like | 9.4797ms | 7.3074ms | 136.8484 Ops/s | 227.0226 Ops/s | |
test_clone | 7.7567ms | 6.7716ms | 147.6748 Ops/s | 144.8250 Ops/s | |
test_squeeze | 58.0530μs | 9.3245μs | 107.2443 KOps/s | 107.2626 KOps/s | |
test_unsqueeze | 0.4690ms | 73.7629μs | 13.5570 KOps/s | 13.7614 KOps/s | |
test_split | 0.3910ms | 0.1590ms | 6.2910 KOps/s | 6.2506 KOps/s | |
test_permute | 0.6073ms | 0.1907ms | 5.2425 KOps/s | 5.3514 KOps/s | |
test_stack | 52.8574ms | 51.9531ms | 19.2481 Ops/s | 18.8300 Ops/s | |
test_cat | 52.3091ms | 51.5668ms | 19.3923 Ops/s | 19.1664 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):