Replies: 3 comments 7 replies
-
@nbrady-techempower Let me show you the numbers of running a branch of techempower with server container's restarted (naively implemented, but it's just to show the difference) vs master. Just as reminder , the order of execution of tests:
The plaintext one, given that's the last one and due to how the JVM compilation works, can have a huge variance in its results because of the previous executed (very different) workloads, see: these results coming from our CI (that's using bare metal/idle/stable hw):
If you have a CI there, you can reproduce yourself just by running a plaintext alone vs all and checking plaintext results. |
Beta Was this translation helpful? Give feedback.
-
i think it is better to re-use the existing server for all tests as it currently does - this is the intended purpose of the benchmarks afaict - to test frameworks across a wide range of different tasks, not to see how optimized they can be for each specific task. there is at least one framework already that re-compiles itself based on profiling information so it can better fit the expected load of the test. imho, this tends to obscure any advantage dynamic/JIT languages have over statically compiled ones, which should be something we can learn from these benchmarks. 🤷♂️ |
Beta Was this translation helpful? Give feedback.
-
But now there are fws than use a different variant per test, and benefiting from the JIT. So we need to clarify the rules. |
Beta Was this translation helpful? Give feedback.
-
Reading the documentation it seems that the server (docker) istance is kept running across different test types (eg db, update, json, plaintext,...).
Both native and JIT technology stacks can have trouble to keep performance stable across different runs of very different test types:
To keep results between types more stable would be better to restart the server (docker) machine before each validation/warmup/prime run, in order to have repeatable results independent by the order of test types, wdyt?
Beta Was this translation helpful? Give feedback.
All reactions