-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark downloader (script + CI job) #12818
Conversation
fa1ddc6
to
f41e05c
Compare
This is now done and ready for review. |
"${scripts_dir}/externalTests/benchmark_diff.py" table \ | ||
--output-format markdown \ | ||
--style humanized \ | ||
base-branch/summarized-benchmarks-*.json \ | ||
summarized-benchmarks.json > diff/benchmark-diff-summarized-table-markdown-humanized.md | ||
"${scripts_dir}/externalTests/benchmark_diff.py" table \ | ||
--output-format markdown \ | ||
--style absolute \ | ||
base-branch/summarized-benchmarks-*.json \ | ||
summarized-benchmarks.json > diff/benchmark-diff-summarized-table-markdown-absolute.md | ||
"${scripts_dir}/externalTests/benchmark_diff.py" inplace \ | ||
--style absolute \ | ||
base-branch/summarized-benchmarks-*.json \ | ||
summarized-benchmarks.json > diff/benchmark-diff-summarized-inplace-absolute.md | ||
"${scripts_dir}/externalTests/benchmark_diff.py" inplace \ | ||
--style absolute \ | ||
base-branch/all-benchmarks-*.json \ | ||
all-benchmarks.json > diff/benchmark-diff-all-table-inplace-absolute.md |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might not need all of these. benchmark-diff-summarized-table-markdown-humanized.md
will probably be the most useful one in practice. It will soon be possible to include extra columns with absolute values in it as well.
On the other hand the overhead of having all of them is negligible and also this acts as an end to end test for the script so I included them here anyway.
ad76d0f
to
838be6d
Compare
60b45d0
to
b3e3669
Compare
838be6d
to
9dfa818
Compare
b3e3669
to
fce9efc
Compare
9dfa818
to
9c91771
Compare
9c91771
to
0fea909
Compare
Note, about the
This is a limitation of the script with regard to PRs that are based one on the other. The job tries to access artifacts from the same job on the base PR. If both were pushed at the same time, it's likely that that base job has not finished yet. In that case if just needs to be reran after the base job finishes. |
0fea909
to
71d9f08
Compare
71d9f08
to
1cf3fec
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a lot of infrastructure for just downloading some files :-). But probably not much one can do to avoid that...
I haven't reviewed the python scripts at all, but - as usual for these PRs - I'd say it's fine to merge based on them working properly in CI - and it being nice to have them, but them being non-critical.
If anyone is up for going through the python scripts once, though, it wouldn't hurt.
Unfortunately, dealing with APIs reliably is just verbose. This is not the first time I need to access their APIs and I always end up with a really ugly Bash script. We really needed some wrappers for that that we can later reuse as needed. CircleCI actually has its own CLI utility called Also, to be fair, it adds more value than just "downloading some files". It automates finding the exact runs that you need which is more tedious than it seems at first. E.g. if your branch is based on |
1cf3fec
to
69781f8
Compare
69781f8
to
45dffe5
Compare
Depends on #12804.Merged.The script makes several CirecleCI and Github API requests to get the JSON with benchmark results for a specific PR or branch. I think it will be very handy for manual use, though my primary goal here is to use it in
c_ext_benchmarks
job to get benchmarks from the base PR and provide the diff. The PR extends that job to generate such diffs and attach them as artifacts.After the experiences with #12181 and ethereum/solc-bin#35 I decided to use Python to make this more maintainable. This way I'll be able to cover it with tests, and, compared to my earlier Bash scripts, it was easy to make a module with helpers that we'll be able to reuse in other situations that require API access. While it isn't that short as a whole, it's more concise compared to the previous Bash scripts - the main logic specific to this task is concentrated in one function and the rest are reusable helpers and command-line argument definitions (which is more like docs than real code).
I'm on purpose not trying to do too much error handling to keep things simple. I.e. it either works or you get an exception. Only the cases that would not just crash on unexpected response (like finding no matching results) are explicitly checked.
This is still work in progress. I still need toEverything is done:Usage of the script
To get benchmarks from your branch and from develop, you just run this:
This stores
summarized-benchmarks-<branch>-<commit hash>.json
andall-benchmarks-<branch>-<commit hash>.json
in the current directory. Now you can run the script from #12804 on the files.If you want to compare someone else's PR you probably have the PR number handy - you can pass that to the script using the
--pr
option instead of the branch.You can also use
--base-of-pr
option to get artifacts from the branch your PR is based on (instead ofdevelop
). By default the script finds a CI run matching not only the base branch name but also the exact commit. This guarantees that even if the base branch has moved on and has extra commits, your comparison will not be influenced by these commits.Sometimes you might still want to get the benchmark from the latest state of the base branch, e.g. if there was no successful run on the commit you're based on and you know that any extra commits are not going to distort results all that much - in that case you can use
--any-commit
flag to bypass the commit check.