-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BEAM-7209][BEAM-9351][BEAM-9428] Upgrade Hive to version 3.1.3 #17749
Conversation
* This eliminated the pentaho dependency
Can one of the admins verify this patch? |
2 similar comments
Can one of the admins verify this patch? |
Can one of the admins verify this patch? |
Finally able to make the integration test run: The current integration test is written in a way that needs both metastore and hiveserver2 services running and essentially operating the same database. Did not find a way to make both services run locally (in local embedded mode, metastore or hiveserver2 will occupy the same Derby database and cannot be setup at the same time.) The test is conducted by manually doing setup of HCataogIOIT (thus commenting out these parts in HCataogIOIT.java), running a metastore service and then run the integration test. |
retest this please |
R: @TheNeuralBit |
Codecov Report
@@ Coverage Diff @@
## master #17749 +/- ##
==========================================
+ Coverage 74.01% 74.05% +0.04%
==========================================
Files 695 696 +1
Lines 91798 92195 +397
==========================================
+ Hits 67941 68274 +333
- Misses 22611 22675 +64
Partials 1246 1246
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
Run GoPortable PreCommit |
Known build flake "warning: Memory constraints are impeding performance; please increase max heap size." unrelated to this change. |
Do we expose hive in any of our public APIs? I was under the impression that we do, so we can't just bump a major version without risking breaking users. |
Yes we do. The HCatalog read and write transforms is in the type of One of the motivation of this change is the old pentaho dependencies that hive 2.x depended on has been removed from repos gradually and generated build issues to the Beam several times ([BEAM-6811]; #17734 (BEAM-14298)) and this upgrade may be considered to increase its priority. |
I see, please also update CHANGES.md to note this as a breaking change and recommend users override the Hive version if they need to. A couple more questions:
|
The changes are only involved in unit tests. Unit tests needs to run with Hive 3.x. The artifact built should not be affected.
Probably not. Hive 2.1.0 used Calcite 1.6.0 and now Hive 3.1.3 uses Calcite 1.16.0 as dependencies. Looked at Beam release history vendored Calcite started on v1.20.0 since the release of Beam 2.16.0 and the reason of vendoring were not related to Hive. Will confirm with others. |
retest this please |
commenting |
is it possible to move hive-site.xml to the test resources folder to make sure all hive dependencies are in the test side of the world? |
Agree. Those files were marked as |
retest this please |
R: @johnjcasey |
LGTM |
Run PythonLint PreCommit |
…he#17749) * [BEAM-9351] Upgrade Hive to version 3.1.2 * This eliminated the pentaho dependency * fix auth issue in test * Add change log * move internal test only files to test * clean up original workaround: Hive 3.1.3 upgraded to log4j 2.17.1
…he#17749) * [BEAM-9351] Upgrade Hive to version 3.1.2 * This eliminated the pentaho dependency * fix auth issue in test * Add change log * move internal test only files to test * clean up original workaround: Hive 3.1.3 upgraded to log4j 2.17.1
…he#17749) * [BEAM-9351] Upgrade Hive to version 3.1.2 * This eliminated the pentaho dependency * fix auth issue in test * Add change log * move internal test only files to test * clean up original workaround: Hive 3.1.3 upgraded to log4j 2.17.1
Please add a meaningful description for your change here
Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
R: @username
).[BEAM-XXX] Fixes bug in ApproximateQuantiles
, where you replaceBEAM-XXX
with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.CHANGES.md
with noteworthy changes.See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI.