-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] Fix windows wheels #1006
Merged
Merged
[CI] Fix windows wheels #1006
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
vmoens
added a commit
that referenced
this pull request
Sep 23, 2024
ghstack-source-id: e5c9fd8a8534fef623982fe435cadaf0a9c4703a Pull Request resolved: #1006
facebook-github-bot
added
the
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
label
Sep 23, 2024
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 44.7130μs | 18.9368μs | 52.8072 KOps/s | 50.6421 KOps/s | |
test_plain_set_stack_nested | 47.9200μs | 19.1687μs | 52.1683 KOps/s | 49.2244 KOps/s | |
test_plain_set_nested_inplace | 57.8690μs | 20.7765μs | 48.1314 KOps/s | 46.8178 KOps/s | |
test_plain_set_stack_nested_inplace | 69.2000μs | 20.8835μs | 47.8846 KOps/s | 46.4384 KOps/s | |
test_items | 99.5360μs | 4.2003μs | 238.0800 KOps/s | 243.7985 KOps/s | |
test_items_nested | 0.7000ms | 0.3716ms | 2.6912 KOps/s | 2.7484 KOps/s | |
test_items_nested_locked | 0.4381ms | 0.3672ms | 2.7236 KOps/s | 2.7655 KOps/s | |
test_items_nested_leaf | 0.1470ms | 67.7062μs | 14.7697 KOps/s | 14.4723 KOps/s | |
test_items_stack_nested | 0.5172ms | 0.3717ms | 2.6905 KOps/s | 2.7131 KOps/s | |
test_items_stack_nested_leaf | 0.1179ms | 71.4127μs | 14.0031 KOps/s | 13.7042 KOps/s | |
test_items_stack_nested_locked | 0.6801ms | 0.3849ms | 2.5982 KOps/s | 2.7249 KOps/s | |
test_keys | 47.6490μs | 3.5729μs | 279.8863 KOps/s | 284.3278 KOps/s | |
test_keys_nested | 0.1420ms | 0.1008ms | 9.9231 KOps/s | 9.6401 KOps/s | |
test_keys_nested_locked | 0.7157ms | 0.1050ms | 9.5247 KOps/s | 9.3909 KOps/s | |
test_keys_nested_leaf | 0.1617ms | 83.6563μs | 11.9537 KOps/s | 11.6273 KOps/s | |
test_keys_stack_nested | 0.1462ms | 0.1012ms | 9.8784 KOps/s | 9.7977 KOps/s | |
test_keys_stack_nested_leaf | 0.1628ms | 84.9737μs | 11.7683 KOps/s | 11.7792 KOps/s | |
test_keys_stack_nested_locked | 0.1484ms | 0.1070ms | 9.3452 KOps/s | 9.3297 KOps/s | |
test_values | 5.8168μs | 1.0428μs | 958.9647 KOps/s | 951.3652 KOps/s | |
test_values_nested | 0.1364ms | 75.2114μs | 13.2958 KOps/s | 13.5008 KOps/s | |
test_values_nested_locked | 0.1371ms | 75.2902μs | 13.2819 KOps/s | 13.5209 KOps/s | |
test_values_nested_leaf | 0.1186ms | 61.7561μs | 16.1927 KOps/s | 15.8430 KOps/s | |
test_values_stack_nested | 0.1237ms | 76.7305μs | 13.0326 KOps/s | 13.4494 KOps/s | |
test_values_stack_nested_leaf | 0.1149ms | 62.0233μs | 16.1230 KOps/s | 16.2201 KOps/s | |
test_values_stack_nested_locked | 0.1424ms | 76.7714μs | 13.0257 KOps/s | 13.4528 KOps/s | |
test_membership | 3.6799μs | 0.7710μs | 1.2970 MOps/s | 1.1377 MOps/s | |
test_membership_nested | 20.8390μs | 2.8336μs | 352.9067 KOps/s | 342.2286 KOps/s | |
test_membership_nested_leaf | 20.1670μs | 2.8123μs | 355.5763 KOps/s | 363.9504 KOps/s | |
test_membership_stacked_nested | 25.6080μs | 2.8315μs | 353.1673 KOps/s | 364.0319 KOps/s | |
test_membership_stacked_nested_leaf | 27.2210μs | 2.8732μs | 348.0475 KOps/s | 362.3386 KOps/s | |
test_membership_nested_last | 22.3620μs | 4.0373μs | 247.6928 KOps/s | 252.4880 KOps/s | |
test_membership_nested_leaf_last | 24.3650μs | 4.0076μs | 249.5283 KOps/s | 250.6667 KOps/s | |
test_membership_stacked_nested_last | 23.3940μs | 3.9931μs | 250.4294 KOps/s | 163.1146 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.7760μs | 4.0245μs | 248.4759 KOps/s | 164.0957 KOps/s | |
test_nested_getleaf | 40.8260μs | 10.6604μs | 93.8052 KOps/s | 91.6647 KOps/s | |
test_nested_get | 31.6000μs | 10.2333μs | 97.7201 KOps/s | 95.0011 KOps/s | |
test_stacked_getleaf | 34.6650μs | 10.6146μs | 94.2096 KOps/s | 93.7222 KOps/s | |
test_stacked_get | 34.8660μs | 10.3025μs | 97.0633 KOps/s | 95.0708 KOps/s | |
test_nested_getitemleaf | 38.2610μs | 11.2299μs | 89.0476 KOps/s | 85.3071 KOps/s | |
test_nested_getitem | 32.5500μs | 10.5013μs | 95.2259 KOps/s | 92.5898 KOps/s | |
test_stacked_getitemleaf | 34.5650μs | 11.1955μs | 89.3216 KOps/s | 88.3783 KOps/s | |
test_stacked_getitem | 30.4460μs | 10.4940μs | 95.2930 KOps/s | 95.3636 KOps/s | |
test_lock_nested | 83.0334ms | 0.5709ms | 1.7515 KOps/s | 2.0399 KOps/s | |
test_lock_stack_nested | 0.8670ms | 0.4572ms | 2.1870 KOps/s | 2.2239 KOps/s | |
test_unlock_nested | 85.0411ms | 0.4882ms | 2.0482 KOps/s | 2.4463 KOps/s | |
test_unlock_stack_nested | 0.5688ms | 0.3703ms | 2.7009 KOps/s | 2.7116 KOps/s | |
test_flatten_speed | 0.1766ms | 86.9901μs | 11.4956 KOps/s | 11.2287 KOps/s | |
test_unflatten_speed | 0.8207ms | 0.4660ms | 2.1459 KOps/s | 2.1308 KOps/s | |
test_common_ops | 6.2163ms | 1.0520ms | 950.5513 Ops/s | 935.2376 Ops/s | |
test_creation | 26.4590μs | 2.1387μs | 467.5708 KOps/s | 463.9625 KOps/s | |
test_creation_empty | 42.2500μs | 15.6576μs | 63.8668 KOps/s | 64.6022 KOps/s | |
test_creation_nested_1 | 68.9970μs | 18.9546μs | 52.7576 KOps/s | 52.1483 KOps/s | |
test_creation_nested_2 | 64.9310μs | 22.7457μs | 43.9643 KOps/s | 42.6337 KOps/s | |
test_clone | 1.2895ms | 17.2401μs | 58.0043 KOps/s | 56.9877 KOps/s | |
test_getitem[int] | 0.8488ms | 16.9280μs | 59.0739 KOps/s | 57.1217 KOps/s | |
test_getitem[slice_int] | 0.1344ms | 30.5875μs | 32.6931 KOps/s | 31.7941 KOps/s | |
test_getitem[range] | 0.1718ms | 57.8640μs | 17.2819 KOps/s | 17.3671 KOps/s | |
test_getitem[tuple] | 0.1303ms | 25.4296μs | 39.3242 KOps/s | 38.8379 KOps/s | |
test_getitem[list] | 0.1767ms | 53.0748μs | 18.8413 KOps/s | 18.6471 KOps/s | |
test_setitem_dim[int] | 56.5560μs | 31.8156μs | 31.4312 KOps/s | 30.1866 KOps/s | |
test_setitem_dim[slice_int] | 0.1116ms | 60.1273μs | 16.6314 KOps/s | 16.4019 KOps/s | |
test_setitem_dim[range] | 0.1796ms | 83.6103μs | 11.9602 KOps/s | 11.7900 KOps/s | |
test_setitem_dim[tuple] | 75.7210μs | 48.5038μs | 20.6169 KOps/s | 19.9262 KOps/s | |
test_setitem | 69.8300μs | 27.4819μs | 36.3876 KOps/s | 34.9140 KOps/s | |
test_set | 0.1394ms | 26.7029μs | 37.4491 KOps/s | 36.9010 KOps/s | |
test_set_shared | 1.3008ms | 0.2102ms | 4.7574 KOps/s | 4.7499 KOps/s | |
test_update | 0.1439ms | 32.8103μs | 30.4782 KOps/s | 30.7283 KOps/s | |
test_update_nested | 0.1183ms | 43.2317μs | 23.1312 KOps/s | 23.3731 KOps/s | |
test_update__nested | 0.1269ms | 34.2247μs | 29.2187 KOps/s | 28.7363 KOps/s | |
test_set_nested | 0.1271ms | 29.5338μs | 33.8595 KOps/s | 33.1862 KOps/s | |
test_set_nested_new | 82.0130μs | 34.6460μs | 28.8633 KOps/s | 28.0319 KOps/s | |
test_select | 1.2196ms | 52.8045μs | 18.9378 KOps/s | 18.6337 KOps/s | |
test_select_nested | 0.1278ms | 59.0553μs | 16.9333 KOps/s | 16.4402 KOps/s | |
test_exclude_nested | 0.1597ms | 74.9682μs | 13.3390 KOps/s | 12.9929 KOps/s | |
test_empty[True] | 0.4922ms | 0.3187ms | 3.1375 KOps/s | 3.0995 KOps/s | |
test_empty[False] | 5.7907μs | 1.1991μs | 833.9846 KOps/s | 836.9779 KOps/s | |
test_unbind_speed | 0.3905ms | 0.3060ms | 3.2682 KOps/s | 3.3133 KOps/s | |
test_unbind_speed_stack0 | 0.4326ms | 0.2959ms | 3.3800 KOps/s | 3.4292 KOps/s | |
test_unbind_speed_stack1 | 92.6478ms | 0.8095ms | 1.2353 KOps/s | 1.4974 KOps/s | |
test_split | 81.5529ms | 2.1617ms | 462.5898 Ops/s | 456.4482 Ops/s | |
test_chunk | 3.1064ms | 2.0118ms | 497.0630 Ops/s | 458.4147 Ops/s | |
test_creation[device0] | 0.2409ms | 0.1160ms | 8.6192 KOps/s | 8.5840 KOps/s | |
test_creation_from_tensor | 3.0316ms | 0.1162ms | 8.6090 KOps/s | 8.2710 KOps/s | |
test_add_one[memmap_tensor0] | 84.1580μs | 7.0741μs | 141.3602 KOps/s | 139.3530 KOps/s | |
test_contiguous[memmap_tensor0] | 29.2450μs | 1.9232μs | 519.9714 KOps/s | 524.6382 KOps/s | |
test_stack[memmap_tensor0] | 86.4720μs | 5.5649μs | 179.6963 KOps/s | 179.5274 KOps/s | |
test_memmaptd_index | 1.2202ms | 0.3944ms | 2.5353 KOps/s | 2.4104 KOps/s | |
test_memmaptd_index_astensor | 0.7410ms | 0.4694ms | 2.1303 KOps/s | 2.0564 KOps/s | |
test_memmaptd_index_op | 1.6128ms | 0.9517ms | 1.0508 KOps/s | 1.0253 KOps/s | |
test_serialize_model | 0.2143s | 0.1292s | 7.7381 Ops/s | 8.3948 Ops/s | |
test_serialize_model_pickle | 0.4454s | 0.3946s | 2.5343 Ops/s | 2.5505 Ops/s | |
test_serialize_weights | 0.1200s | 0.1110s | 9.0085 Ops/s | 8.6954 Ops/s | |
test_serialize_weights_returnearly | 0.1709s | 0.1568s | 6.3782 Ops/s | 6.1699 Ops/s | |
test_serialize_weights_pickle | 0.4572s | 0.3931s | 2.5439 Ops/s | 2.2157 Ops/s | |
test_serialize_weights_filesystem | 0.2221s | 0.1508s | 6.6292 Ops/s | 6.4131 Ops/s | |
test_serialize_model_filesystem | 0.1578s | 0.1479s | 6.7626 Ops/s | 6.5518 Ops/s | |
test_reshape_pytree | 80.7510μs | 39.3263μs | 25.4283 KOps/s | 25.0662 KOps/s | |
test_reshape_td | 95.1980μs | 45.8802μs | 21.7959 KOps/s | 21.4300 KOps/s | |
test_view_pytree | 88.9070μs | 38.8748μs | 25.7236 KOps/s | 25.1099 KOps/s | |
test_view_td | 0.2259ms | 53.3767μs | 18.7348 KOps/s | 18.9467 KOps/s | |
test_unbind_pytree | 98.2240μs | 36.5398μs | 27.3674 KOps/s | 28.0528 KOps/s | |
test_unbind_td | 0.3111ms | 44.9698μs | 22.2371 KOps/s | 22.4887 KOps/s | |
test_split_pytree | 95.9680μs | 38.2594μs | 26.1374 KOps/s | 26.4212 KOps/s | |
test_split_td | 0.4490ms | 56.8541μs | 17.5889 KOps/s | 17.0556 KOps/s | |
test_add_pytree | 0.1478ms | 45.2303μs | 22.1091 KOps/s | 21.8483 KOps/s | |
test_add_td | 0.1529ms | 75.0754μs | 13.3199 KOps/s | 13.3919 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.1409ms | 56.5410μs | 17.6863 KOps/s | 17.9998 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.3690ms | 0.1735ms | 5.7648 KOps/s | 5.6878 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.1754ms | 56.6892μs | 17.6400 KOps/s | 18.0095 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.3375ms | 0.1411ms | 7.0892 KOps/s | 6.9959 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 49.5330μs | 21.2251μs | 47.1141 KOps/s | 48.1971 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1529ms | 68.2354μs | 14.6551 KOps/s | 14.6576 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.1522ms | 77.3846μs | 12.9225 KOps/s | 12.9935 KOps/s | |
test_compile_copy_nested[pytree-eager] | 0.1407ms | 70.1505μs | 14.2551 KOps/s | 14.2634 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3194ms | 0.1738ms | 5.7548 KOps/s | 5.8389 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3624ms | 0.1873ms | 5.3376 KOps/s | 5.2144 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 92.3440μs | 45.3314μs | 22.0597 KOps/s | 21.3321 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1429ms | 68.6043μs | 14.5763 KOps/s | 14.5541 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.2401ms | 0.1757ms | 5.6909 KOps/s | 5.7842 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.3804ms | 0.2850ms | 3.5082 KOps/s | 3.4228 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.4597ms | 0.2008ms | 4.9797 KOps/s | 4.8375 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3389ms | 0.1730ms | 5.7796 KOps/s | 5.7676 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1420ms | 62.4837μs | 16.0042 KOps/s | 16.0979 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1285ms | 46.9721μs | 21.2892 KOps/s | 21.5313 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.3135ms | 0.2311ms | 4.3273 KOps/s | 4.2518 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3188ms | 0.1743ms | 5.7376 KOps/s | 5.7524 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 0.2272ms | 0.1046ms | 9.5557 KOps/s | 9.6890 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 0.1225ms | 58.7439μs | 17.0230 KOps/s | 17.3698 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1706ms | 79.9941μs | 12.5009 KOps/s | 12.6399 KOps/s | |
test_compile_copy_flat[pytree-eager] | 0.1392ms | 70.3422μs | 14.2162 KOps/s | 14.2889 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 0.3864ms | 0.1961ms | 5.0988 KOps/s | 5.1924 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 2.7559ms | 1.6567ms | 603.5985 Ops/s | 608.5758 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 0.2820ms | 0.1925ms | 5.1938 KOps/s | 5.2115 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 1.3263ms | 1.0972ms | 911.4026 Ops/s | 916.6134 Ops/s | |
test_compile_assign_and_add_stack[compile] | 0.7336ms | 0.4126ms | 2.4238 KOps/s | 2.4123 KOps/s | |
test_compile_assign_and_add_stack[eager] | 3.9482ms | 3.5684ms | 280.2353 Ops/s | 279.1129 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 90.6500μs | 32.7299μs | 30.5531 KOps/s | 29.6891 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.8593ms | 47.3147μs | 21.1351 KOps/s | 20.6177 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 67.5160μs | 28.3319μs | 35.2959 KOps/s | 33.1085 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 80.1000μs | 29.6884μs | 33.6831 KOps/s | 33.9617 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 73.8780μs | 28.3346μs | 35.2926 KOps/s | 33.0056 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 69.1690μs | 29.5366μs | 33.8563 KOps/s | 34.4046 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1568ms | 71.4301μs | 13.9997 KOps/s | 13.8370 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.4469ms | 27.4353μs | 36.4494 KOps/s | 35.7960 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1493ms | 67.1019μs | 14.9027 KOps/s | 14.5615 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 78.6970μs | 23.4253μs | 42.6889 KOps/s | 42.6403 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.1499ms | 66.3119μs | 15.0803 KOps/s | 14.8020 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 78.6970μs | 23.6530μs | 42.2779 KOps/s | 42.8773 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1104ms | 70.5792μs | 14.1685 KOps/s | 13.7822 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.8396ms | 26.7017μs | 37.4508 KOps/s | 36.0477 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.1597ms | 66.7682μs | 14.9772 KOps/s | 14.6249 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 64.2000μs | 23.2922μs | 42.9328 KOps/s | 43.6519 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1651ms | 65.7666μs | 15.2053 KOps/s | 14.7707 KOps/s | |
test_compile_indexing[int-pytree-eager] | 59.1710μs | 23.0952μs | 43.2990 KOps/s | 43.3399 KOps/s | |
test_mod_add[eager] | 59.2310μs | 23.3473μs | 42.8314 KOps/s | 41.8968 KOps/s | |
test_mod_add[compile] | 83.9970μs | 37.7505μs | 26.4897 KOps/s | 25.6303 KOps/s | |
test_mod_add[compile-overhead] | 84.2080μs | 37.9909μs | 26.3221 KOps/s | 25.2594 KOps/s | |
test_mod_wrap[eager] | 0.4255ms | 0.1997ms | 5.0078 KOps/s | 4.9588 KOps/s | |
test_mod_wrap[compile] | 0.4287ms | 0.2298ms | 4.3514 KOps/s | 4.3427 KOps/s | |
test_mod_wrap[compile-overhead] | 0.4326ms | 0.2258ms | 4.4296 KOps/s | 4.4071 KOps/s | |
test_mod_wrap_and_backward[eager] | 12.0379ms | 10.5252ms | 95.0100 Ops/s | 91.8074 Ops/s | |
test_mod_wrap_and_backward[compile] | 12.5134ms | 10.7037ms | 93.4253 Ops/s | 85.4859 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 12.2436ms | 10.9116ms | 91.6454 Ops/s | 85.3843 Ops/s | |
test_seq_add[eager] | 0.2770ms | 89.0602μs | 11.2284 KOps/s | 11.6980 KOps/s | |
test_seq_add[compile] | 0.1240ms | 63.7468μs | 15.6871 KOps/s | 15.6001 KOps/s | |
test_seq_add[compile-overhead] | 0.1079ms | 61.7211μs | 16.2019 KOps/s | 15.7720 KOps/s | |
test_seq_wrap[eager] | 0.5085ms | 0.3663ms | 2.7300 KOps/s | 2.7759 KOps/s | |
test_seq_wrap[compile] | 0.8926ms | 0.2619ms | 3.8190 KOps/s | 3.7634 KOps/s | |
test_seq_wrap[compile-overhead] | 1.0321ms | 0.2726ms | 3.6685 KOps/s | 3.7266 KOps/s | |
test_func_call_runtime[False-eager] | 0.6628ms | 0.5095ms | 1.9627 KOps/s | 1.9623 KOps/s | |
test_func_call_runtime[False-compile] | 1.0399ms | 0.4997ms | 2.0014 KOps/s | 2.0279 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 1.0299ms | 0.4945ms | 2.0223 KOps/s | 2.0289 KOps/s | |
test_func_call_runtime[True-eager] | 1.2551ms | 0.7325ms | 1.3651 KOps/s | 1.3833 KOps/s | |
test_func_call_runtime[True-compile] | 0.6544ms | 0.5038ms | 1.9849 KOps/s | 1.9591 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.7126ms | 0.5000ms | 2.0000 KOps/s | 1.9578 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.6285ms | 0.5039ms | 1.9846 KOps/s | 1.9990 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.6412ms | 0.4944ms | 2.0225 KOps/s | 2.0232 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.6065ms | 0.4900ms | 2.0409 KOps/s | 2.0260 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.3381ms | 0.8533ms | 1.1719 KOps/s | 1.1787 KOps/s | |
test_func_call_cm_runtime[True-compile] | 0.8334ms | 0.7142ms | 1.4001 KOps/s | 1.3794 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 1.2278ms | 0.7156ms | 1.3974 KOps/s | 1.3781 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.3046ms | 1.8289ms | 546.7767 Ops/s | 530.6623 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 2.5831ms | 1.8951ms | 527.6819 Ops/s | 527.2431 Ops/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 2.6141ms | 1.8963ms | 527.3560 Ops/s | 524.5766 Ops/s | |
test_distributed | 0.2264ms | 0.1241ms | 8.0595 KOps/s | 7.8874 KOps/s | |
test_tdmodule | 51.3860μs | 17.0582μs | 58.6227 KOps/s | 61.4856 KOps/s | |
test_tdmodule_dispatch | 61.7560μs | 33.1406μs | 30.1745 KOps/s | 31.3697 KOps/s | |
test_tdseq | 35.5270μs | 18.6097μs | 53.7355 KOps/s | 53.3325 KOps/s | |
test_tdseq_dispatch | 70.3820μs | 38.4542μs | 26.0049 KOps/s | 26.6291 KOps/s | |
test_instantiation_functorch | 1.8847ms | 1.5583ms | 641.7099 Ops/s | 632.1417 Ops/s | |
test_instantiation_td | 1.9738ms | 1.1664ms | 857.3707 Ops/s | 852.4408 Ops/s | |
test_exec_functorch | 0.3400ms | 0.1814ms | 5.5142 KOps/s | 5.4546 KOps/s | |
test_exec_functional_call | 0.3517ms | 0.1679ms | 5.9552 KOps/s | 5.8011 KOps/s | |
test_exec_td | 0.2493ms | 0.1625ms | 6.1523 KOps/s | 5.9146 KOps/s | |
test_exec_td_decorator | 1.1959ms | 0.2190ms | 4.5668 KOps/s | 4.4069 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.9524ms | 0.6367ms | 1.5707 KOps/s | 1.5805 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7631ms | 0.6351ms | 1.5746 KOps/s | 1.5891 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.7866ms | 0.4964ms | 2.0144 KOps/s | 2.0454 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6551ms | 0.4961ms | 2.0157 KOps/s | 2.0360 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 0.9447ms | 0.6187ms | 1.6164 KOps/s | 1.6328 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.9460ms | 0.6194ms | 1.6144 KOps/s | 1.6307 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7529ms | 0.5104ms | 1.9591 KOps/s | 1.9527 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.9084ms | 0.5111ms | 1.9564 KOps/s | 1.9457 KOps/s | |
test_to_module_speed[True] | 2.1121ms | 1.3149ms | 760.4871 Ops/s | 764.8971 Ops/s | |
test_to_module_speed[False] | 1.7615ms | 1.2772ms | 782.9395 Ops/s | 789.1886 Ops/s | |
test_tc_init | 86.7730μs | 42.4159μs | 23.5761 KOps/s | 23.4115 KOps/s | |
test_tc_init_nested | 0.1412ms | 83.7362μs | 11.9423 KOps/s | 11.9400 KOps/s | |
test_tc_first_layer_tensor | 22.0810μs | 1.5527μs | 644.0411 KOps/s | 654.4185 KOps/s | |
test_tc_first_layer_nontensor | 43.6240μs | 4.8070μs | 208.0286 KOps/s | 213.8229 KOps/s | |
test_tc_second_layer_tensor | 18.2940μs | 2.8405μs | 352.0521 KOps/s | 357.7640 KOps/s | |
test_tc_second_layer_nontensor | 27.8620μs | 6.1170μs | 163.4795 KOps/s | 164.7317 KOps/s | |
test_unbind | 0.4556s | 12.9038ms | 77.4968 Ops/s | 74.7872 Ops/s | |
test_full_like | 7.2921ms | 6.6309ms | 150.8080 Ops/s | 149.9393 Ops/s | |
test_zeros_like | 3.0221ms | 2.5540ms | 391.5362 Ops/s | 389.3320 Ops/s | |
test_ones_like | 3.3844ms | 2.9978ms | 333.5791 Ops/s | 168.5809 Ops/s | |
test_clone | 4.8706ms | 4.5754ms | 218.5613 Ops/s | 131.2008 Ops/s | |
test_squeeze | 55.7240μs | 12.1286μs | 82.4499 KOps/s | 77.7010 KOps/s | |
test_unsqueeze | 0.3330ms | 92.9762μs | 10.7554 KOps/s | 10.8731 KOps/s | |
test_split | 0.3358ms | 0.1978ms | 5.0551 KOps/s | 5.1661 KOps/s | |
test_permute | 0.3238ms | 0.2193ms | 4.5593 KOps/s | 4.4049 KOps/s | |
test_stack | 31.4845ms | 24.0275ms | 41.6190 Ops/s | 42.1654 Ops/s | |
test_cat | 27.3639ms | 23.2931ms | 42.9313 Ops/s | 42.5671 Ops/s |
|
Name | Max | Mean | Ops | Ops on Repo HEAD
|
Change |
---|---|---|---|---|---|
test_plain_set_nested | 63.4810μs | 13.6069μs | 73.4923 KOps/s | 70.2448 KOps/s | |
test_plain_set_stack_nested | 32.1310μs | 13.5377μs | 73.8677 KOps/s | 69.7560 KOps/s | |
test_plain_set_nested_inplace | 0.1128ms | 14.4472μs | 69.2175 KOps/s | 66.1771 KOps/s | |
test_plain_set_stack_nested_inplace | 44.0410μs | 14.6430μs | 68.2919 KOps/s | 66.5036 KOps/s | |
test_items | 29.1810μs | 2.8601μs | 349.6383 KOps/s | 343.1253 KOps/s | |
test_items_nested | 0.3551ms | 0.3246ms | 3.0811 KOps/s | 3.0928 KOps/s | |
test_items_nested_locked | 0.3821ms | 0.3293ms | 3.0363 KOps/s | 3.0408 KOps/s | |
test_items_nested_leaf | 80.4520μs | 55.5719μs | 17.9947 KOps/s | 17.8360 KOps/s | |
test_items_stack_nested | 0.3962ms | 0.3322ms | 3.0100 KOps/s | 3.0721 KOps/s | |
test_items_stack_nested_leaf | 81.9820μs | 57.3410μs | 17.4395 KOps/s | 17.5510 KOps/s | |
test_items_stack_nested_locked | 0.3668ms | 0.3320ms | 3.0122 KOps/s | 3.0679 KOps/s | |
test_keys | 25.7300μs | 3.4233μs | 292.1190 KOps/s | 284.1720 KOps/s | |
test_keys_nested | 0.1171ms | 54.3453μs | 18.4008 KOps/s | 18.2843 KOps/s | |
test_keys_nested_locked | 2.6080ms | 61.4813μs | 16.2651 KOps/s | 15.9159 KOps/s | |
test_keys_nested_leaf | 80.6820μs | 46.8799μs | 21.3311 KOps/s | 21.6996 KOps/s | |
test_keys_stack_nested | 87.1020μs | 56.6573μs | 17.6500 KOps/s | 17.5877 KOps/s | |
test_keys_stack_nested_leaf | 84.9810μs | 48.2717μs | 20.7161 KOps/s | 20.8006 KOps/s | |
test_keys_stack_nested_locked | 92.2520μs | 61.8315μs | 16.1730 KOps/s | 16.1054 KOps/s | |
test_values | 4.7052μs | 0.8381μs | 1.1931 MOps/s | 1.1987 MOps/s | |
test_values_nested | 68.4420μs | 40.9814μs | 24.4013 KOps/s | 24.5043 KOps/s | |
test_values_nested_locked | 68.1220μs | 42.7624μs | 23.3850 KOps/s | 23.3332 KOps/s | |
test_values_nested_leaf | 62.0410μs | 35.7046μs | 28.0076 KOps/s | 28.2980 KOps/s | |
test_values_stack_nested | 72.5320μs | 41.8599μs | 23.8892 KOps/s | 24.1756 KOps/s | |
test_values_stack_nested_leaf | 66.0410μs | 36.0934μs | 27.7059 KOps/s | 27.9969 KOps/s | |
test_values_stack_nested_locked | 85.2520μs | 43.5607μs | 22.9565 KOps/s | 22.9152 KOps/s | |
test_membership | 1.6596μs | 0.5041μs | 1.9839 MOps/s | 2.0047 MOps/s | |
test_membership_nested | 13.4250μs | 1.9033μs | 525.4054 KOps/s | 506.7147 KOps/s | |
test_membership_nested_leaf | 11.8637μs | 1.8635μs | 536.6326 KOps/s | 524.5538 KOps/s | |
test_membership_stacked_nested | 28.1110μs | 2.0147μs | 496.3441 KOps/s | 513.3812 KOps/s | |
test_membership_stacked_nested_leaf | 18.2100μs | 1.9966μs | 500.8398 KOps/s | 516.6043 KOps/s | |
test_membership_nested_last | 38.7810μs | 2.8376μs | 352.4159 KOps/s | 362.6859 KOps/s | |
test_membership_nested_leaf_last | 29.5910μs | 2.8220μs | 354.3606 KOps/s | 314.5672 KOps/s | |
test_membership_stacked_nested_last | 38.7510μs | 3.4408μs | 290.6321 KOps/s | 311.2717 KOps/s | |
test_membership_stacked_nested_leaf_last | 29.8110μs | 3.4502μs | 289.8402 KOps/s | 301.2950 KOps/s | |
test_nested_getleaf | 41.2510μs | 6.0454μs | 165.4148 KOps/s | 165.9031 KOps/s | |
test_nested_get | 43.1110μs | 5.7012μs | 175.4005 KOps/s | 178.0945 KOps/s | |
test_stacked_getleaf | 37.7600μs | 6.0526μs | 165.2183 KOps/s | 165.2471 KOps/s | |
test_stacked_get | 35.3610μs | 5.6267μs | 177.7232 KOps/s | 177.5370 KOps/s | |
test_nested_getitemleaf | 41.8710μs | 6.1384μs | 162.9098 KOps/s | 163.4747 KOps/s | |
test_nested_getitem | 29.8710μs | 5.7292μs | 174.5441 KOps/s | 173.9660 KOps/s | |
test_stacked_getitemleaf | 53.3710μs | 6.0818μs | 164.4237 KOps/s | 164.0957 KOps/s | |
test_stacked_getitem | 32.4910μs | 5.6764μs | 176.1676 KOps/s | 175.4009 KOps/s | |
test_lock_nested | 5.1956ms | 0.4272ms | 2.3406 KOps/s | 2.3333 KOps/s | |
test_lock_stack_nested | 0.4315ms | 0.3826ms | 2.6136 KOps/s | 2.5613 KOps/s | |
test_unlock_nested | 0.7688ms | 0.3634ms | 2.7518 KOps/s | 2.7083 KOps/s | |
test_unlock_stack_nested | 0.3820ms | 0.3222ms | 3.1040 KOps/s | 2.9884 KOps/s | |
test_flatten_speed | 0.1463ms | 68.7798μs | 14.5392 KOps/s | 14.3412 KOps/s | |
test_unflatten_speed | 0.3887ms | 0.2816ms | 3.5509 KOps/s | 3.3793 KOps/s | |
test_common_ops | 1.6138ms | 1.3334ms | 749.9532 Ops/s | 797.2723 Ops/s | |
test_creation | 18.4300μs | 1.4519μs | 688.7613 KOps/s | 664.4970 KOps/s | |
test_creation_empty | 44.2210μs | 15.1836μs | 65.8607 KOps/s | 61.2116 KOps/s | |
test_creation_nested_1 | 51.5910μs | 16.7571μs | 59.6762 KOps/s | 56.2776 KOps/s | |
test_creation_nested_2 | 57.4110μs | 20.8856μs | 47.8799 KOps/s | 46.4192 KOps/s | |
test_clone | 67.9520μs | 31.0819μs | 32.1731 KOps/s | 31.4847 KOps/s | |
test_getitem[int] | 1.4574ms | 18.4070μs | 54.3270 KOps/s | 61.1068 KOps/s | |
test_getitem[slice_int] | 0.1249ms | 31.5412μs | 31.7046 KOps/s | 33.5998 KOps/s | |
test_getitem[range] | 0.1544ms | 0.1087ms | 9.2020 KOps/s | 9.1263 KOps/s | |
test_getitem[tuple] | 0.1233ms | 24.4409μs | 40.9150 KOps/s | 36.5954 KOps/s | |
test_getitem[list] | 0.1959ms | 98.6328μs | 10.1386 KOps/s | 9.3388 KOps/s | |
test_setitem_dim[int] | 66.6010μs | 45.1856μs | 22.1309 KOps/s | 19.6865 KOps/s | |
test_setitem_dim[slice_int] | 0.1010ms | 67.9331μs | 14.7204 KOps/s | 13.4207 KOps/s | |
test_setitem_dim[range] | 0.1950ms | 0.1271ms | 7.8705 KOps/s | 7.3444 KOps/s | |
test_setitem_dim[tuple] | 87.8240μs | 61.5529μs | 16.2462 KOps/s | 14.8651 KOps/s | |
test_setitem | 78.2630μs | 41.5912μs | 24.0436 KOps/s | 21.6002 KOps/s | |
test_set | 75.0730μs | 40.6988μs | 24.5708 KOps/s | 22.1352 KOps/s | |
test_set_shared | 0.3829ms | 52.4528μs | 19.0648 KOps/s | 17.9363 KOps/s | |
test_update | 0.1076ms | 54.0112μs | 18.5147 KOps/s | 17.9205 KOps/s | |
test_update_nested | 0.1571ms | 57.1210μs | 17.5067 KOps/s | 15.7238 KOps/s | |
test_update__nested | 0.1158ms | 59.3361μs | 16.8531 KOps/s | 15.2228 KOps/s | |
test_set_nested | 78.8720μs | 43.6407μs | 22.9144 KOps/s | 20.4984 KOps/s | |
test_set_nested_new | 88.4920μs | 46.5405μs | 21.4866 KOps/s | 19.3727 KOps/s | |
test_select | 0.1033ms | 59.4258μs | 16.8277 KOps/s | 14.9308 KOps/s | |
test_select_nested | 0.6090ms | 42.0140μs | 23.8016 KOps/s | 22.3762 KOps/s | |
test_exclude_nested | 99.9820μs | 58.2719μs | 17.1609 KOps/s | 15.5798 KOps/s | |
test_empty[True] | 0.3073ms | 0.2425ms | 4.1239 KOps/s | 4.0105 KOps/s | |
test_empty[False] | 3.7971μs | 0.7491μs | 1.3349 MOps/s | 1.3294 MOps/s | |
test_to | 57.0810μs | 24.4862μs | 40.8393 KOps/s | 41.4734 KOps/s | |
test_to_nonblocking | 59.4610μs | 23.3414μs | 42.8423 KOps/s | 43.6929 KOps/s | |
test_unbind_speed | 0.3401ms | 0.2858ms | 3.4991 KOps/s | 3.5277 KOps/s | |
test_unbind_speed_stack0 | 0.3212ms | 0.2798ms | 3.5744 KOps/s | 3.5250 KOps/s | |
test_unbind_speed_stack1 | 99.1894ms | 0.7058ms | 1.4167 KOps/s | 1.3664 KOps/s | |
test_split | 99.7283ms | 2.2255ms | 449.3361 Ops/s | 444.8484 Ops/s | |
test_chunk | 0.1004s | 2.2185ms | 450.7456 Ops/s | 445.0396 Ops/s | |
test_creation[device0] | 0.3564ms | 0.1309ms | 7.6371 KOps/s | 7.4881 KOps/s | |
test_creation_from_tensor | 0.3435ms | 0.1335ms | 7.4902 KOps/s | 7.3920 KOps/s | |
test_add_one[memmap_tensor0] | 0.1805ms | 10.0853μs | 99.1543 KOps/s | 100.0849 KOps/s | |
test_contiguous[memmap_tensor0] | 23.8610μs | 2.2060μs | 453.3013 KOps/s | 451.2061 KOps/s | |
test_stack[memmap_tensor0] | 36.7810μs | 6.9770μs | 143.3286 KOps/s | 144.9345 KOps/s | |
test_memmaptd_index | 1.2358ms | 0.4344ms | 2.3018 KOps/s | 2.2000 KOps/s | |
test_memmaptd_index_astensor | 0.9952ms | 0.4960ms | 2.0161 KOps/s | 1.9734 KOps/s | |
test_memmaptd_index_op | 1.4454ms | 1.0357ms | 965.5028 Ops/s | 889.9751 Ops/s | |
test_serialize_model | 0.1311s | 0.1294s | 7.7258 Ops/s | 7.6981 Ops/s | |
test_serialize_model_pickle | 1.3499s | 1.2132s | 0.8243 Ops/s | 0.8245 Ops/s | |
test_serialize_weights | 0.2292s | 0.1431s | 6.9886 Ops/s | 7.7680 Ops/s | |
test_serialize_weights_returnearly | 0.2315s | 58.3685ms | 17.1325 Ops/s | 18.6339 Ops/s | |
test_serialize_weights_pickle | 1.3716s | 1.2168s | 0.8218 Ops/s | 0.8242 Ops/s | |
test_reshape_pytree | 73.1010μs | 36.5332μs | 27.3724 KOps/s | 28.2297 KOps/s | |
test_reshape_td | 86.8720μs | 42.0578μs | 23.7768 KOps/s | 22.9123 KOps/s | |
test_view_pytree | 77.3920μs | 36.2243μs | 27.6058 KOps/s | 27.6801 KOps/s | |
test_view_td | 87.8420μs | 47.5227μs | 21.0426 KOps/s | 21.1736 KOps/s | |
test_unbind_pytree | 63.7320μs | 35.1401μs | 28.4575 KOps/s | 28.4231 KOps/s | |
test_unbind_td | 0.4699ms | 44.0386μs | 22.7073 KOps/s | 22.4522 KOps/s | |
test_split_pytree | 83.3320μs | 46.5235μs | 21.4945 KOps/s | 20.6429 KOps/s | |
test_split_td | 0.6756ms | 56.1837μs | 17.7988 KOps/s | 15.6637 KOps/s | |
test_add_pytree | 0.1010ms | 58.0567μs | 17.2245 KOps/s | 16.0947 KOps/s | |
test_add_td | 0.1547ms | 93.4809μs | 10.6974 KOps/s | 9.8770 KOps/s | |
test_compile_add_one_nested[tensordict-compile] | 0.4186ms | 0.2104ms | 4.7520 KOps/s | 4.7316 KOps/s | |
test_compile_add_one_nested[tensordict-eager] | 0.2632ms | 0.1496ms | 6.6846 KOps/s | 6.6750 KOps/s | |
test_compile_add_one_nested[pytree-compile] | 0.2303ms | 0.1444ms | 6.9269 KOps/s | 6.6636 KOps/s | |
test_compile_add_one_nested[pytree-eager] | 0.2801ms | 0.1847ms | 5.4130 KOps/s | 5.1327 KOps/s | |
test_compile_copy_nested[tensordict-compile] | 55.5110μs | 21.5953μs | 46.3063 KOps/s | 47.9459 KOps/s | |
test_compile_copy_nested[tensordict-eager] | 0.1015ms | 43.4793μs | 22.9995 KOps/s | 22.9800 KOps/s | |
test_compile_copy_nested[pytree-compile] | 0.2379ms | 64.8000μs | 15.4321 KOps/s | 15.3771 KOps/s | |
test_compile_copy_nested[pytree-eager] | 80.4920μs | 49.5476μs | 20.1826 KOps/s | 20.1570 KOps/s | |
test_compile_add_one_flat[tensordict-compile] | 0.3671ms | 0.3178ms | 3.1465 KOps/s | 3.0821 KOps/s | |
test_compile_add_one_flat[tensordict-eager] | 0.3047ms | 0.2068ms | 4.8365 KOps/s | 4.8600 KOps/s | |
test_compile_add_one_flat[tensorclass-compile] | 0.1727ms | 0.1269ms | 7.8779 KOps/s | 7.5766 KOps/s | |
test_compile_add_one_flat[tensorclass-eager] | 0.1020ms | 59.6026μs | 16.7778 KOps/s | 16.2057 KOps/s | |
test_compile_add_one_flat[pytree-compile] | 0.4018ms | 0.3190ms | 3.1349 KOps/s | 3.1142 KOps/s | |
test_compile_add_one_flat[pytree-eager] | 0.6803ms | 0.6342ms | 1.5767 KOps/s | 1.6294 KOps/s | |
test_compile_add_self_flat[tensordict-eager] | 0.2985ms | 0.2470ms | 4.0487 KOps/s | 4.0625 KOps/s | |
test_compile_add_self_flat[tensordict-compile] | 0.3792ms | 0.3171ms | 3.1538 KOps/s | 3.0949 KOps/s | |
test_compile_add_self_flat[tensorclass-eager] | 0.1187ms | 70.6753μs | 14.1492 KOps/s | 13.4423 KOps/s | |
test_compile_add_self_flat[tensorclass-compile] | 0.1795ms | 0.1272ms | 7.8630 KOps/s | 7.5168 KOps/s | |
test_compile_add_self_flat[pytree-eager] | 0.6077ms | 0.5380ms | 1.8587 KOps/s | 1.8993 KOps/s | |
test_compile_add_self_flat[pytree-compile] | 0.3617ms | 0.3189ms | 3.1358 KOps/s | 3.1076 KOps/s | |
test_compile_copy_flat[tensordict-compile] | 86.7420μs | 18.0177μs | 55.5010 KOps/s | 55.1424 KOps/s | |
test_compile_copy_flat[tensordict-eager] | 59.3020μs | 27.2419μs | 36.7081 KOps/s | 36.8327 KOps/s | |
test_compile_copy_flat[pytree-compile] | 0.1033ms | 69.0248μs | 14.4876 KOps/s | 14.0632 KOps/s | |
test_compile_copy_flat[pytree-eager] | 81.7420μs | 51.0322μs | 19.5955 KOps/s | 19.4354 KOps/s | |
test_compile_assign_and_add[tensordict-compile] | 2.3971ms | 0.8356ms | 1.1967 KOps/s | 1.1048 KOps/s | |
test_compile_assign_and_add[tensordict-eager] | 3.5001ms | 3.2403ms | 308.6087 Ops/s | 315.1045 Ops/s | |
test_compile_assign_and_add[pytree-compile] | 2.3374ms | 0.8190ms | 1.2210 KOps/s | 1.1184 KOps/s | |
test_compile_assign_and_add[pytree-eager] | 3.5223ms | 3.3026ms | 302.7905 Ops/s | 313.6156 Ops/s | |
test_compile_indexing[tensor-tensordict-compile] | 0.1595ms | 0.1096ms | 9.1273 KOps/s | 9.0734 KOps/s | |
test_compile_indexing[tensor-tensordict-eager] | 0.1931ms | 64.3140μs | 15.5487 KOps/s | 15.9324 KOps/s | |
test_compile_indexing[tensor-tensorclass-compile] | 0.1685ms | 0.1041ms | 9.6061 KOps/s | 9.7632 KOps/s | |
test_compile_indexing[tensor-tensorclass-eager] | 94.3720μs | 45.5921μs | 21.9336 KOps/s | 23.0484 KOps/s | |
test_compile_indexing[tensor-pytree-compile] | 0.2086ms | 0.1071ms | 9.3410 KOps/s | 9.6575 KOps/s | |
test_compile_indexing[tensor-pytree-eager] | 91.0720μs | 45.7576μs | 21.8543 KOps/s | 22.9277 KOps/s | |
test_compile_indexing[slice-tensordict-compile] | 0.1891ms | 0.1442ms | 6.9365 KOps/s | 7.2452 KOps/s | |
test_compile_indexing[slice-tensordict-eager] | 0.1784ms | 25.2662μs | 39.5785 KOps/s | 38.2639 KOps/s | |
test_compile_indexing[slice-tensorclass-compile] | 0.1934ms | 0.1358ms | 7.3635 KOps/s | 7.5929 KOps/s | |
test_compile_indexing[slice-tensorclass-eager] | 56.0920μs | 21.0516μs | 47.5024 KOps/s | 47.1669 KOps/s | |
test_compile_indexing[slice-pytree-compile] | 0.2285ms | 0.1369ms | 7.3054 KOps/s | 7.5139 KOps/s | |
test_compile_indexing[slice-pytree-eager] | 52.5710μs | 20.9967μs | 47.6266 KOps/s | 46.5491 KOps/s | |
test_compile_indexing[int-tensordict-compile] | 0.1859ms | 0.1391ms | 7.1911 KOps/s | 7.2005 KOps/s | |
test_compile_indexing[int-tensordict-eager] | 0.4965ms | 25.4409μs | 39.3068 KOps/s | 38.1292 KOps/s | |
test_compile_indexing[int-tensorclass-compile] | 0.2443ms | 0.1322ms | 7.5632 KOps/s | 7.5324 KOps/s | |
test_compile_indexing[int-tensorclass-eager] | 0.1061ms | 23.4814μs | 42.5869 KOps/s | 47.3940 KOps/s | |
test_compile_indexing[int-pytree-compile] | 0.1918ms | 0.1356ms | 7.3748 KOps/s | 7.5388 KOps/s | |
test_compile_indexing[int-pytree-eager] | 64.8810μs | 20.7964μs | 48.0852 KOps/s | 47.5728 KOps/s | |
test_mod_add[eager] | 74.6720μs | 32.5515μs | 30.7205 KOps/s | 30.9717 KOps/s | |
test_mod_add[compile] | 0.1211ms | 69.8851μs | 14.3092 KOps/s | 14.1720 KOps/s | |
test_mod_add[compile-overhead] | 0.2666ms | 0.1354ms | 7.3833 KOps/s | 6.6415 KOps/s | |
test_mod_wrap[eager] | 0.3582ms | 0.2502ms | 3.9960 KOps/s | 4.0579 KOps/s | |
test_mod_wrap[compile] | 1.4336ms | 0.2951ms | 3.3886 KOps/s | 3.3530 KOps/s | |
test_mod_wrap[compile-overhead] | 7.5940ms | 4.0260ms | 248.3841 Ops/s | 247.6779 Ops/s | |
test_mod_wrap_and_backward[eager] | 1.6395ms | 1.3573ms | 736.7318 Ops/s | 684.9489 Ops/s | |
test_mod_wrap_and_backward[compile] | 1.5784ms | 1.3167ms | 759.4684 Ops/s | 697.2193 Ops/s | |
test_mod_wrap_and_backward[compile-overhead] | 1.3440ms | 0.9122ms | 1.0962 KOps/s | 937.9654 Ops/s | |
test_seq_add[eager] | 0.1837ms | 99.5723μs | 10.0430 KOps/s | 9.9064 KOps/s | |
test_seq_add[compile] | 0.6972ms | 84.9570μs | 11.7707 KOps/s | 12.4246 KOps/s | |
test_seq_add[compile-overhead] | 0.1523ms | 0.1139ms | 8.7827 KOps/s | 8.7520 KOps/s | |
test_seq_wrap[eager] | 0.4400ms | 0.3797ms | 2.6339 KOps/s | 2.5346 KOps/s | |
test_seq_wrap[compile] | 0.3770ms | 0.3137ms | 3.1874 KOps/s | 3.1461 KOps/s | |
test_seq_wrap[compile-overhead] | 0.2980ms | 0.2255ms | 4.4338 KOps/s | 4.5007 KOps/s | |
test_func_call_runtime[False-eager] | 0.7895ms | 0.7363ms | 1.3581 KOps/s | 1.2607 KOps/s | |
test_func_call_runtime[False-compile] | 1.0881ms | 0.7846ms | 1.2745 KOps/s | 1.2374 KOps/s | |
test_func_call_runtime[False-compile-overhead] | 0.4004ms | 0.3592ms | 2.7842 KOps/s | 2.7035 KOps/s | |
test_func_call_runtime[True-eager] | 0.9825ms | 0.8977ms | 1.1139 KOps/s | 1.0908 KOps/s | |
test_func_call_runtime[True-compile] | 0.8810ms | 0.8211ms | 1.2179 KOps/s | 1.1769 KOps/s | |
test_func_call_runtime[True-compile-overhead] | 0.5222ms | 0.3936ms | 2.5404 KOps/s | 2.5060 KOps/s | |
test_func_call_cm_runtime[False-eager] | 0.8111ms | 0.7338ms | 1.3628 KOps/s | 1.2920 KOps/s | |
test_func_call_cm_runtime[False-compile] | 0.8983ms | 0.7808ms | 1.2807 KOps/s | 1.2350 KOps/s | |
test_func_call_cm_runtime[False-compile-overhead] | 0.4092ms | 0.3616ms | 2.7654 KOps/s | 2.6960 KOps/s | |
test_func_call_cm_runtime[True-eager] | 1.0928ms | 0.9998ms | 1.0002 KOps/s | 985.2992 Ops/s | |
test_func_call_cm_runtime[True-compile] | 0.9444ms | 0.8482ms | 1.1790 KOps/s | 1.1570 KOps/s | |
test_func_call_cm_runtime[True-compile-overhead] | 0.4587ms | 0.4162ms | 2.4027 KOps/s | 2.3613 KOps/s | |
test_vmap_func_call_cm_runtime[eager] | 2.5270ms | 2.0705ms | 482.9859 Ops/s | 477.6399 Ops/s | |
test_vmap_func_call_cm_runtime[compile] | 0.9553ms | 0.9004ms | 1.1106 KOps/s | 1.1386 KOps/s | |
test_vmap_func_call_cm_runtime[compile-overhead] | 0.4813ms | 0.4241ms | 2.3578 KOps/s | 2.3404 KOps/s | |
test_distributed | 4.0519ms | 0.2345ms | 4.2637 KOps/s | 8.5092 KOps/s | |
test_tdmodule | 51.4710μs | 13.6870μs | 73.0623 KOps/s | 65.9930 KOps/s | |
test_tdmodule_dispatch | 67.0810μs | 27.1556μs | 36.8249 KOps/s | 33.4452 KOps/s | |
test_tdseq | 36.5010μs | 14.8007μs | 67.5644 KOps/s | 62.1798 KOps/s | |
test_tdseq_dispatch | 60.3710μs | 29.8890μs | 33.4571 KOps/s | 30.6796 KOps/s | |
test_instantiation_functorch | 2.0184ms | 1.8692ms | 534.9903 Ops/s | 533.0794 Ops/s | |
test_instantiation_td | 1.8120ms | 1.1972ms | 835.2787 Ops/s | 826.1761 Ops/s | |
test_exec_functorch | 0.2555ms | 0.2108ms | 4.7433 KOps/s | 4.7114 KOps/s | |
test_exec_functional_call | 0.2491ms | 0.2127ms | 4.7015 KOps/s | 4.6776 KOps/s | |
test_exec_td | 0.2883ms | 0.2177ms | 4.5938 KOps/s | 4.5017 KOps/s | |
test_exec_td_decorator | 1.1504ms | 0.2578ms | 3.8795 KOps/s | 3.7730 KOps/s | |
test_vmap_mlp_speed[True-True] | 0.7589ms | 0.6845ms | 1.4610 KOps/s | 1.4384 KOps/s | |
test_vmap_mlp_speed[True-False] | 0.7594ms | 0.6856ms | 1.4585 KOps/s | 1.4446 KOps/s | |
test_vmap_mlp_speed[False-True] | 0.6633ms | 0.5778ms | 1.7306 KOps/s | 1.7150 KOps/s | |
test_vmap_mlp_speed[False-False] | 0.6644ms | 0.5794ms | 1.7260 KOps/s | 1.7108 KOps/s | |
test_vmap_mlp_speed_decorator[True-True] | 1.2546ms | 0.6889ms | 1.4516 KOps/s | 1.4733 KOps/s | |
test_vmap_mlp_speed_decorator[True-False] | 0.8105ms | 0.6727ms | 1.4865 KOps/s | 1.4716 KOps/s | |
test_vmap_mlp_speed_decorator[False-True] | 0.7175ms | 0.5925ms | 1.6877 KOps/s | 1.6701 KOps/s | |
test_vmap_mlp_speed_decorator[False-False] | 0.7428ms | 0.5912ms | 1.6916 KOps/s | 1.6695 KOps/s | |
test_vmap_transformer_speed[True-True] | 8.6380ms | 8.3264ms | 120.1000 Ops/s | 118.3316 Ops/s | |
test_vmap_transformer_speed[True-False] | 8.7237ms | 8.3480ms | 119.7895 Ops/s | 118.6359 Ops/s | |
test_vmap_transformer_speed[False-True] | 8.4908ms | 8.1897ms | 122.1045 Ops/s | 121.3994 Ops/s | |
test_vmap_transformer_speed[False-False] | 8.4950ms | 8.2172ms | 121.6961 Ops/s | 121.6025 Ops/s | |
test_vmap_transformer_speed_decorator[True-True] | 20.2277ms | 19.7980ms | 50.5102 Ops/s | 51.0782 Ops/s | |
test_vmap_transformer_speed_decorator[True-False] | 20.3576ms | 19.9424ms | 50.1443 Ops/s | 51.0792 Ops/s | |
test_vmap_transformer_speed_decorator[False-True] | 20.0880ms | 19.6075ms | 51.0008 Ops/s | 51.5615 Ops/s | |
test_vmap_transformer_speed_decorator[False-False] | 19.5095ms | 19.3674ms | 51.6332 Ops/s | 51.4700 Ops/s | |
test_to_module_speed[True] | 1.1663ms | 0.9378ms | 1.0664 KOps/s | 1.0613 KOps/s | |
test_to_module_speed[False] | 1.3257ms | 0.9130ms | 1.0953 KOps/s | 1.0800 KOps/s | |
test_tc_init | 65.2420μs | 34.0931μs | 29.3315 KOps/s | 27.5807 KOps/s | |
test_tc_init_nested | 0.1148ms | 71.0185μs | 14.0808 KOps/s | 13.6939 KOps/s | |
test_tc_first_layer_tensor | 9.0330μs | 0.6781μs | 1.4747 MOps/s | 1.4601 MOps/s | |
test_tc_first_layer_nontensor | 24.9110μs | 2.2574μs | 442.9782 KOps/s | 440.4209 KOps/s | |
test_tc_second_layer_tensor | 34.3007μs | 1.3877μs | 720.6210 KOps/s | 733.9660 KOps/s | |
test_tc_second_layer_nontensor | 28.1810μs | 2.9573μs | 338.1428 KOps/s | 338.8215 KOps/s | |
test_unbind | 0.2025s | 13.0979ms | 76.3479 Ops/s | 96.9112 Ops/s | |
test_full_like | 0.6634ms | 0.5715ms | 1.7498 KOps/s | 1.7334 KOps/s | |
test_zeros_like | 0.2590ms | 0.1979ms | 5.0533 KOps/s | 5.0508 KOps/s | |
test_ones_like | 0.2331ms | 0.1978ms | 5.0556 KOps/s | 5.0561 KOps/s | |
test_clone | 0.4444ms | 0.4148ms | 2.4108 KOps/s | 2.4198 KOps/s | |
test_squeeze | 31.6510μs | 10.0525μs | 99.4778 KOps/s | 99.0518 KOps/s | |
test_unsqueeze | 0.2306ms | 75.6948μs | 13.2109 KOps/s | 13.3296 KOps/s | |
test_split | 0.4376ms | 0.1599ms | 6.2527 KOps/s | 6.2792 KOps/s | |
test_permute | 0.2859ms | 0.1802ms | 5.5486 KOps/s | 5.3772 KOps/s | |
test_stack | 1.2673ms | 0.8572ms | 1.1666 KOps/s | 1.1390 KOps/s | |
test_cat | 1.2550ms | 1.2314ms | 812.0593 Ops/s | 811.6231 Ops/s |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
CI
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Stack from ghstack (oldest at bottom):