-
Notifications
You must be signed in to change notification settings - Fork 478
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SYSTEMDS-3534 Cache for build phase in bin and recode encoder #2036
base: main
Are you sure you want to change the base?
Conversation
…cache to the build class
…ng without specifying bin method
Thanks for the changes, @Mayaryin and @ingunnaf. I will have a detailed look in the next days. |
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #2036 +/- ##
=======================================
Coverage ? 68.84%
Complexity ? 40755
=======================================
Files ? 1447
Lines ? 161755
Branches ? 31432
=======================================
Hits ? 111363
Misses ? 41315
Partials ? 9077 ☔ View full report in Codecov by Sentry. |
…onfusion about thread numbers
We added a unit test and a tests for multithreading. We attempted to test the cache via a dml script and a test dataset, but couldnt succeed. Our test results from the inner tests as well as the test data generation script and the dml scripts can be found in this repository: https://github.com/Mayaryin/systemds_test_data |
Thanks @Mayaryin. I will take a look. Can you please summarize the performance comparison of single-threaded, multi-threaded, with and without cache in this PR to track it in the same place? You can maintain the scripts and setups in the other repository, but just mention the speedups here. |
Also, please ensure that all tests pass. |
I am unsure which tests exactly fail. When building locally the build fails due to javadoc warnings, this can be avoided by adding false in the javadoc plugin configuration in the POM. When looking at the warnings it seems that there are comments missing in many classes, but not restricted to the ones we added. Could we have a look at it together in a call this week? |
Looks like the builtin.part2 is failing. You can see the back trace if you click Details. Take a look and try to reproduce the failure locally by running the same test. If still unsure, we can have a call later this week. Btw, I reran the tests to avoid any intermittent issues. |
I've seen that there is a problem with the dedup function but this also occurs in at least one other PR |
All of the tests have passed now. |
Thank you, @Mayaryin, @ingunnaf for your contribution. List of TODOs include:
Future work outside the scope of this PR:
|
We implemented a caffeine cache that is currently used by the bin and recode encoders. We tested and compared the runtime with and without caching in a seperate test class. We separate a first run of encoding which initializes the cache and takes longer from the rest of our measurements. We average over 10 runs for each setting.